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Preface 


This book is written primarily for college students who have no previous 
background in operations research and who intend careers as administrators, 
consultants, executives, or managers in business, nonprofit enterprises, or govern¬ 
ment. The broad topic coverage also should make the text helpful for students 
who seek careers as teachers and researchers as well as for practitioners who desire 
an up-to-date review of operations research. The book can be used in half-year 
or full-year introductory courses for juniors, seniors, or graduates in business, 
economics, and engineering curricula. 

The central goal of the book is to answer the question, “What are th^funda- 
mental ideas of operations research?” The text does not presuppose any advanced 
training in business administration, industrial engineering, mathematics, statis¬ 
tics, probability theory, or economics. Therefore, the main ideas do not rely 
on the reader’s being expert in these areas. The text does assume, however, that 
the reader is not entirely naive about such subjects. 


MATHEMATICS PREREQUISITES 

We have consistently tried to motivate the technical developments and to 
make the mathematical logic clear. But, frankly, we did not try to make the 
mathematics palatable for a student who is congenitally adverse to science. 

The differential calculus is employed only in Chaps. 14, 15, and 20, and the 
integral calculus only in Chap. 20 and Appendix III. Nevertheless, much of the 
text assumes a mathematical sophistication comparable to that acquired in 
standard college-level introductory calculus or a no-nonsense finite mathematics 
course. Elementary probability theory is first needed in Chap. 16. The three 
appendices and much of the optional material, designated by either an asterisk 
or smaller print, relate to technical details and often require a noticeably higher 
level of mathematical maturity. 

vii 
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PRINCIPAL OBJECTIVES 

Beginning students frequently ask, “What must I learn about operations 
research if I intend to become a manager rather than a specialist?” and “What 
must I learn about operations research given that I want to apply it to real 
problems?” Regrettably, a single introductory operations research course cannot 
completely answer either question. But such a course can better answer the 
first rather than the second question. 

In the context of these two questions, the book’s principal objectives arc 

• To introduce the important ideas in operations research which arc both 
fundamental and long lasting. 

• To provide those students not going beyond a single introductory course 
with enough understanding and confidence to appreciate the strengths 
>and inherent limitations of the operations research approach. 

• To prepare and motivate future specialists to continue in their study by 
having an insightful overview of operations research. 

• To demonstrate the cohesiveness of operations research methodology. 


COVERAGE AND EMPHASIS 

Obviously, an author of an introductory text must exercise considerable 
judgment in selecting topics and in choosing the depth of treatment. In this 
regard, we have been influenced considerably by more than 15 years of first¬ 
hand experience in applying operations research to actual situations in business 
and government. The examples in the text often are scaled-down versions of 
real problems that we have encountered. Of equal importance in selecting topics 
is our accumulated experience teaching hundreds of undergraduates and grad¬ 
uates, ^any of whom now are using operations research to aid managerial 
decision making or are, in turn, educating other college students. The specific 
models that we chose, their complexity, and sequential development all reflect 
our observations of how most students come to appreciate and understand the 
pivotal concepts in operations research. 

To permit an instructor to select the topics that are most appropriate for his 
class, we have included material at several different levels of complexity. 
material designated by an asterisk or appearing between the symbols ► and ◄ and yet in 
smaller print may be skipped without loss of continuity. 

Throughout, the book stresses the insights for decision making that stem from 
operations research analysis. The text puts into focus the value of information 
derived from an operations research solution and attempts to give an accurate 
representation of how a hypothetical example would be applied in a real situa- 


Preface ix 


The text develops the student’s skill in formulating and building models and, 
specifically, in translating a verbal description of a decision problem into an 
equivalent mathematical model. The book also explains the importance and the 
degree of severity of a model’s assumptions, the connection between the starting 
assumptions and the derived results, and the seriousness of the assumptions for 
practical purposes. Finally, the text demonstrates by means of important exam¬ 
ples the process and usefulness of constructing analogies, finding multiple inter¬ 
pretations of models, and deriving significant special cases from general models. 


SUGGESTIONS TO STUDENTS 

If you have taken courses in both the calculus and elementary statistics or 
probability theory, you will recall that you could understand most of the calculus 
text without much help from the instructor whereas you needed the guidance 
of the instructor to fully understand the statistics or probability theory text. This 
is a very common occurrence among students. Can you suggest why? You will 
find operations research to be more like statistics and probability theory than 
the calculus in that you can expect, as a matter of course, to rely on your instructor 
to amplify the ideas and techniques in this book. Since the concepts can be intri¬ 
cate, you may have to study several pages before comprehending the full idea. 
So be prepared for patient reading and some rereading. 

We have tried to avoid giving numerical examples that are so misleadingly 
simple that you are not sufficiently prepared for the exercises at the end of each 
chapter. As a result, you will find it advantageous to examine carefully the 
numerical illustrations in the chapters. Here and there, the text will ask you to 
verify certain calculations and conclusions. Obviously, the purpose of these 
suggestions is not to validate the accuracy of the author’s arithmetic or logic. 
Rather, they are signals to guide you in learning the material. When studying 
a chapter for the first time, do not interrupt your reading to make these checks; 
but afterwards return to these points to ascertain that you have really understood 
the ideas. 

If your instructor assigns the exercises at the back of the chapters, do not wait 
until the last minute to begin the problems. The numerical exercises, although 
straightforward, can be fatiguing if attempted in a single evening session. 
Most students find that formulation problems require a "gestation period,” so 
allow yourself a few days to ponder such exercises. 

Surprisingly, it is difficult to write a verbal description of an operations research 
problem that is completely unambiguous. More than once, thoughtful students 
have discovered vague wording in problems that we had assigned previously to 
other classes that experienced no difficulty in obtaining the intended solutions. 
Consequently, if you believe an exercise displays a troublesome ambiguity, tiy 
to resolve it in a sensible way and make explicit on your paper the specific as¬ 
sumption that you adopted. The purpose of the exercises (namely, to give you 
practice in mathematical formulation) is well served by this procedure. 



X Preface 


SUGGESTIONS TO INSTRUCTORS 

Clearly, the entire book is too large for even a full year’s introductory course. 
This broad topic coverage has been purposely designed to give you considerable 
flexibility in choosing the subjects that you want to stress. The logical organi¬ 
zation of the chapters is shown in Exhibit A. 

In a one-term introductory survey course, where the students know elementary 
probability theory and the calculus, you can include Chaps. 1 through 6, 8, 10, 
16, 17, 19 through 22, skipping all of the starred (*) sections and optional mate¬ 
rial in smaller print. If you do not want to use the calculus, then omit Chap. 
20. If the students are unfamiliar with elementary probability theory, then you 
can assign Chaps. 1 through 8, 10, and choose topics from Chaps. 9 and 11 through 
15. 

To assist you in selecting other combinations, the “Immediate Predecessors” 
of each chapter arc shown in Exhibit B. In general, the chapters become pro¬ 
gressively more difficult. Thus, although you could cover dynamic programming 
(Chaps. 8, 9, and 10) prior to linear programming, the reverse sequence ordinarily 
would be preferable, because the dynamic programming chapters assume the 
student already is somewhat familiar with multivariate constrained optimization 
models and solution techniques. 

The book contains more than 1000 exercises, many of which have multiple 
parts. The problems are grouped into four categories. The Review Exercises 
are keyed closely to the text and determine whether the student has understood 
the conceptual developments. The Formulation Exercises consist of “word prob¬ 
lems” and test whether the student can translate a verbal statement into a 
precise mathematical model. The Com.putational Exercises provide practice 
problems for applying algorithms. The Mind-Expanding Exercises are primarily 
for advanced students—^you also may want to pick some of these problems as 
topics for your lectures. There is “redundancy” in the multiple parts of many 
exercises, and consequently, you should examine the parts to ascertain which 
ones you want to assign. 

In comparing this book with other elementary texts in operations research, 
you will notice a few differences in the standard list of chapter headings. We 
give extended treatment at an introductory level to several topics (such as dy¬ 
namic programming over an unbounded horizon, integer programming, non¬ 
linear programming, stochastic programming, and dynamic programming in 
Markov chains) that are covered only briefly, if at all, elsewhere. In contrast, we 
have not devoted entire chapters to the subjects of replacement theory and 
game theory. Important replacement models are included in the dynamic pro¬ 
gramming chapters. Game theory has been omitted, with the exception of the 
Fundamental Theorem of Two-Person, Zero-Sum Matrix Games given in exer¬ 
cise 35 of Chap. 5. In practicing operations research, we have found that game 
theory does not contribute any managerial insights to real competitive and coopera¬ 
tive decision-making behavior that are not already familiar to church-going 
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Exhibit B 

CHAPTER IMMEDIATE 

PREDECESSORS 

2 Linear Models.1 

3 Algebra and Geometry of Linear Models.2 

4 Simplex Method.2 

5 Sensitivity Testing and Duality.3, 4 

6 Network Models.2 

7 Network Algorithms .5, 6 

8 Dynamic Models.1, (7) 

9 Deterministic Inventory Scheduling .8 

10 Examples of Dynamic Programming.8 

11 Unbounded Horizon Models.7,10 

12 Optimization for Unbounded Horizon.11 

13 Integer Programming.7 

14 Nonlinear Objective Function .5 

*15 Advanced Nonlinear Programming.14 

16 Stochastic Models.7 

17 Probabilistic Dynamic Programming .10,16 

*18 Programming in Markov Chains.12,17 

19 Probabilistic Inventory Models . . .(9), 16 

20 Waiting Line Models.16 

21 Computer Simulation.16 

22 Implementation.1 

*1 Advanced Network Algorithms .7 

*11 Periodic Review Model.18, 19 

*111 Advanced Waiting Line Techniques.20 

NOTE: Chaps. 16 through 21 require probability theory. 

Chaps. 14,15, and 20 require differential calculus. 

Chap. 20 and App. Ill require integral calculus. 

( ) indicates desirable but not essential. 
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poker players who regularly read the Wall Street Journal. (This is not to say, 
however, that the intricacies of real competitive economic behavior, such as 
price wars, advertising campaigns, mergers, and acquisitions, have yet become 
phenomena fully understood by scientists.) 
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CHAPTER 


The Art and Science 
of Executive Decisions 


1.1 BEFORE GETTING STARTED... 

You may well be curious to know how a subject with so abstruse a name as 
operations research could beget a several hundred page introductory text purportedly 
dealing only with principles. The ambiguous term operations research was coined 
during World War II. At that time, it was an apt description of the subject matter. 
Unfortunately, the name stuck, even though present-day applications of operations 
research are considerably more diverse than they used to be. 

Now there is a worldwide confederation of professional societi.es named Opera¬ 
tions Research. The staffs of many industrial organizations bear the title. So do 
departments in leading universities, which have gone on to sanctify the term by 
granting advanced degrees bearing its name. These vested interests are so well 
entrenched that the name operations research is unlikely to be supplanted in our 
lifetime. 

Disgruntled though we may be, saddled as we are with a title that is undescrip- 
tive if not downright misleading, we nevertheless must show our respect. After 
all, the scientists who originated the term were on the winning side of the war. 
(Who knows what might have happened if the other side had invented the 
approach first?) 

1.2 BY ANY OTHER NAME 

Numerous synonyms for operations research are in common use. The British 
like operational research, A frequent American substitute is management science, (The 
popularity of this name is fostered by yet another international professional society 
called the Institute of Management Science. The Operations Research Society and 
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the Institute of Management Science regularly hold joint meetings, and their 
membership overlaps to a large extent.) As a beginning student, fortunately, you 
can afford to assume a lofty indifference to the whole matter, leaving this 
semantic bone of contention for your seniors to wrangle over. 

For convenience, and with reasonable accuracy, you can simply define opera¬ 
tions research as a scientific approach to problem-solving for executive manage¬ 
ment. An application of operations research involves: 

• Constructing mathematical, economic, and statistical descriptions or models 
of decision and control problems to treat situations of complexity and uncer¬ 
tainty. 

• Analyzing the relationships that determine the probable future consequences 
of decision choices, and devising appropriate measures of effectiveness in order 
to evaluate the relative merit of alternative actions. 

It is sometimes believed that operations research refers to the constant monitor¬ 
ing of an organization’s ongoing activities—and, in fact, decision and control 
problems often do concern certain daily “operations” of the organization, Exaun- 
,ples of this sort include production scheduling and inventory control, facility 
maintenance and repair, and staffing of service facilities, to name a few applica¬ 
tions. 

But many operations research studies treat other kinds of decisions that bear on 
daily operations only indirectly. These studies usually have a planning orientation. 
Illustrations include determining the breadtn of a firm’s product line, developing 
a long-term program for plant expansion, designing a network of warehouses for 
a wnolesale distribution system, and entering a new business by merger or acquisi¬ 
tion. 

It is bad enough that the word “operations” inadequately describes the diver¬ 
sity of present-day applications. To make matters worse, the word “research” 
creates the false impression that the method is a “blue-sky approach.” On the 
contrary, in the past decade operations research has proved time and again to be a 
powerful and effective approach for solving critically real management problems. 
You will learn most of the reasons in this chapter, and you will know the full 
story after reading the main chapters of this book. 

►Of course, fundamental research in the methods of operations research continues, 
mainly at universities and at governmental and industrial research laboratories. Unlike 
the situation with basic research in other sciences, however, relatively little time elapses 
between an important discovery in operations research and its implementation by 
experienced practitioners in industrial groups. ^ 

Better decisions in a complex and uncertain environment. 

A preferable term to describe the subject of this book is decision analysis. An 
emphasis on making decisions or taking actions is central to all operations research 

separates a large-scale problem into its subparts, each of which 
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is simpler to manipulate and diagnose. Aiter the separate elements are carefully 
examined, the results are synthesized to give insights into the original problem. 
You may wonder why such complex decision-making problems arise in the first 
place. 

One reason is that in today’s economy, technological, environmental, and com¬ 
petitive factors typically interact in a complicated fashion. For example, a factory 
production schedule has to take account of customer demand (tempered by the 
likelihood of a price-cut by competitors), requirements for raw materials and inter¬ 
mediate inventories, the capacities of equipment, the possibility of equipment 
failures, and manufacturing process restrictions. It is not easy to make up a sched¬ 
ule that is both realistic and economical. 

Other reasons for complexity in real decision-making situations are that the 
organization (perhaps only half-knowingly) may be pursuing inconsistent goals, 
the responsibility and authority for making the required decisions may be greatly 
diffused within the organization, and the economic environment in which the 
company operates may be uncertain. 

To be successful, an operations research approach must improve the managerial 
decision-making process—the improvement being measured by the net cost of 
obtaining it. You should keep in mind the distinction between improved decision¬ 
making and improved performance, or more succinctly, between a good decision 
and a good outcome. For example, by all prior analysis, your betting on the Irish 
Sweepstakes may not appear to be a good decision (economically or morally); 
but after betting, the outcome will be good if you win. Improving decision analysis 
is important because the only \hmgyou control is your decision prior to the imcer- 
tain outcome. 

Distinguishing characteristics. There are many ways to approach 
management problems, and most of these ways are related. Certainly, there is 
no clear boundary line isolating the solutions derived by professional operations 
researchers from those derived by such people as industrial engineers, or econo¬ 
mists specializing in economic planning, or accountants or financial analysts ori¬ 
ented toward management information systems. But most operations research 
applications possess certain distinguishing characteristics. Specifically, a suggested 
approach to a particular problem must contain all the following qualities before 
we would call it an operations research approach: 

i. A Primary Focus on Decision-Making. The principal results of the analysis 
must have direct and unambiguous implications for executive action. 

ii. An Appraisal Resting on Economic Effectiveness Criteria. A comparison of the 
various feasible actions must be based on measurable values that unequivocally 
reflect the future well-being of the organization. In a conunercial firm, these mea¬ 
sured quantities typically include variable costs, revenues, cash flow, and rate 
of return on incremental investment. A recommended solution must have evalu- 
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ated the tradeoffs and have struck an optimum balance among these sometimes 
conflicting factors. 

iii. Reliance on a Formal Mathematical Model The procedures for manipulating 
the data should be so explicit that they can be described to another analyst, who 
in turn would derive the same results from the same data. 

iv. Dependence on an Electronic Computer, This characteristic is not really a desid¬ 
eratum but rather a requirement necessitated by either the complexity of the 
mathematical model, the volume of data to be manipulated, or the magnitude of 
computations needed to implement the associated management operating and 
control systems. 

In science we trust. To embrace operations research, a company must 
believe that applying the scientific method is relevant to the analysis of managerial 
decisions. This statement is not the platitude it may seem to be at first reading. 
The adoption of operations research calls for an act of faith in the benefits of a 
systematic approach to decision-making, and not all corporation executives are 
ready to make that act as yet. 

It may sound strange, at this late date, to hear a plea for faith in science—and 
operations research is a science. After all, the legitimacy of the scientific method in 
the study of other subjects, such as physical phenomena, is hardly open to ques¬ 
tion. After hundreds of years of experience, chemists and physicists have developed 
efficacious laboratory techniques. But the virtue of applying scientific procedures 
to decision-making problems of significance is not so well-established; its recogni¬ 
tion still calls for what the poet Coleridge described, in another context, as “the 
willing suspension of disbelief” Here is why. 

Rarely, if ever, can a company perform what most people would regard as 
a bona fide “scientific” experiment to test the merit of an operations research 
solution. Consider a company that is contemplating using a mathematical model 
to arrive at its annual operating plan. Since the company’s economic environment 
differs from year to year, it can never exactly repeat history, and therefore can 
never prove indisputably that the model solution will produce a realized improve¬ 
ment over the company’s current planning approach. 

Consider a second illustration. Suppose that an operations research model has 
been suggested for controlling a company’s inventories. Again, testing whether 
the new system will definitely yield an improvement over the present approach is 
inherently limited. Although you could use historical data to compare how the 
suggested rule would have operated in the past, the comparison does not represent 
a truly scientific experiment with controlled variables. For one thing, you can only 
assume that historical data are indicative of what will happen in the future. For 
another, if the suggested rules improve service and customers recognize the 
improvement, then there may be an increase in customer demand. In other words, 
the very operation of the suggested policy can alter the environment. 
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Thus, the historical data may not be typical of the future. And because the deci¬ 
sion system itself influences the environment, it is not really possible to operate both 
the present and new systems “in parallel.” (Occasionally, you can run part of the 
system under the new set of rules and the other part under the old set of rules. 
Explain why this test, also, is not a truly controlled experiment.) 

Of course, before a manager accepts a specific operations research solution, he 
should perform various tests of reasonableness, including historical comparisons. 
But at some point after making such tests, even in an ideal situation, the manager 
will have to accept as axiomatic that a scientific approach has intrinsic merit. We 
make three amplifying observations before leaving this conclusion. 

First, even though a company may be convinced about the worthiness of the 
scientific method to aid decision-making, it need not accept the results of a parti¬ 
cular operations research study as being valid. After all, the specific project may 
have been ill conceived or poorly executed. 

Second, a trust in science does not imply the abandonment of hunch and 
intuition. On the contrary, the history of science itself is studded with cases of 
important discoveries made through chance, hunch, serendipity—even dreams. 
Behavioral scientists have not yet developed ways to induce such flashes of bril¬ 
liance consistently. But most executives who use their hunches well also seem to 
possess a high level of knowledge and understanding about their activities. So the 
question is not when to apply science and when to rely on intuition, but rather 
how to combine the two effectively. 

Third, the inherent difficulty of demonstrating that a suggested solution is a 
sure-fire improvement is not unique to operations research. Because of the inability 
to duplicate history, an act of faith is also required to accept any other proposed 
solution—including maintaining the status quo. 

Past, present, and future. Although the term “operations research” 
was coined during World War II, the scientific origins of the subject date much 
further back. Primitive mathematical programming models were advanced by 
economists Quesnay in 1759 and Walras in 1874; more sophisticated economic 
models of a similar genre were proposed by Von Neumann in 1937 and Kantoro¬ 
vich in 1939. The mathematical underpinnings of linear models were established 
near the turn of the 19th century by Jordan in 1873, Minkowski in 1896, and 
Farkas in 1903. Another example of early development is the seminal work on 
dynamic models accomplished by Markov, who lived from 1856 to 1922. Two 
further illustrations are the innovative suggestions for economical inventory 
control, published in business and industrial engineering journals during the 
1920’s, and the pioneering studies of waiting line phenomena completed by 
Erlang, who lived from 1878 to 1929. • 

Even though these early starts received recognition and acclaim, only recently 
have mathematical models for decision analysis taken hold in business. Why? 
At least two factors are important. First, the competitive pressures of doing busi¬ 
ness have increased tremendously since World War II. Executives of large cor- 
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porations now find it essential in maintaining profits to improve on the traditional 
ways of collecting and analyzing data. Second, the fantastic development and 
widespread adoption of high-speed electronic computers have fostered the growth 
of more sophisticated means for assessing decision alternatives. 

There are many reasons to believe that the process of implementing operations- 
research-oriented systems will quicken. For example, new technological develop¬ 
ments in what is called time-shared computing bring the power of an electronic com¬ 
puter literally into an executive’s office. It is a pipedream to suppose that, in the 
next few years, most corporation presidents will have computer consoles on their 
desk tops for querying at a moment’s notice. But already, financial vice presidents 
in several industrial companies do have such consoles to evaluate major invest¬ 
ment alternatives. The future is getting closer all the time. 

1.3 BOUNDARIES OF QUANTITATIVE ANALYSIS 

As should be obvious, quantitative analysis.can never provide the entire basis 
for all strategic decisions. It is inconceivable that the selection of a corporation 
president by a company’s board of directors, for example, could (and should) 
ever rest solely on the manipulation of quantitative data, although some numerical 
information may be relevant. 

It is probably less apparent that even when quantitative analysis is of central 
importance for a managerial decision process, an operations-research-oriented 
system never supplies all the information required for action, no matter how sophis¬ 
ticated the system’s design. Furthermore, a truly successful implementation of an 
operations research system must apply behavioral as well as mathematical sci¬ 
ence, because the resultant system must interact with human beings. And finally, 
the very process of constructing an operations research system involves the exercise 
of judgment in addition to the logical manipulation of symbols and data. We 
discuss below each of these boundaries on quantitative analysis. 

Problems solved and unsolved. As you read the chapters of this 
book, you will learn the ways an executive can be aided by the different operations 
research models that are treated. We therefore limit our comments in this section 
to more generally applicable remarks. 

We have already mentioned that at the very inception of implementing an 
operations research system it is necessary fo. experienced executives to discern the 
relevance of the model. This alone is not enough, of course. Since the corporate 
owners hold these men responsible for wisely managing the firm, executives must 
continue to exercise their judgmental duty well beyond initial acceptance of the 
model. In one way or another, they must monitor the system to ensure that the 
underlying model remains valid, and in particular that it continues to be used* 
properly to provide insights into the real decision-making problems of the com¬ 
pany. (Managers must guard against thinking of the model as being reality, and 
hence of the accompanying answers as being sacrosanct.) 
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A newly implemented operations research system may well bring about a 
restructuring or an amplification of information. As a result, executives may act 
differently from how they might have acted without such information. There is 
no getting away from the fact, however, that an executive, not the model, takes 
the action. 

In short, an operations research model is never sufficient unto itself; it cannot 
become entirely independent of judgment supplied by knowledgeable managers. 
This boundary on quantitative analysis is always manifest, because the number of 
questions that managers can pose is boundless, whereas the kinds of answers that 
a single model can provide are inherently limited. 

Systems are for people. The above discussion also suggests that there 
is more to a successful implementation of an operations research system than 
the mere design of a mathematically correct model. Clearly, the system must 
operate in the larger context of managerial activity. The model must take account 
of the data sources, with respect both to quality of the data and to the goals and 
expertise of the people responsible for collecting the data. The system must also 
reflect the information requirements of the managers who review the analytic 
results, especially the needs for descriptive and interpretive commentary. 

Most experienced practitioners of operations research know how to solve these 
so-called problems of communication. But there is a more fundamental limitation 
on quantitative analysis: rarely, if ever, is a suggested operations research system 
in perfect harmony with previously existing managerial attitudes and predilec¬ 
tions. To ignore this fact is to invite internal conflict, subterfuge, and sometimes 
downright sabotage of a new system. 

For example, a corporate planning model may call for the development of 
realistic sales forecasts. You would expect that marketing executives should ordi¬ 
narily be entrusted to provide these figures; but the traditional orientation of the 
marketing department may make it impossible for these personnel to articulate 
anything other than sales goals. If the motivational drive of the sales organization 
is to set up targets and then try to meet them, and if it is then called upon to enun¬ 
ciate both targets and realistic forecasts, severe organizational conflicts may break 
out. 

What can be done about this kind of .limitation on quantitative analysis? 
Presently, considerable research is under way by behavioral scientists to discover 
successful means of instituting effective organizational change. Such developments 
in administrative science will certainly have a fundamental effect on the actual 
degree of success and speed of implementation of operations research systems. 

The art of management science. The problem-solving ingenuity of 
professional operations researchers is still a limiting factor in the spread of quan¬ 
titative analysis. Despite the enormous growth in the acceptance of management 
science models, there are preciously few “standard” applications. Even in areas 
of decision-making where the relevance of mathematical models has become well 
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established, designing particular applications in specific companies requires signi¬ 
ficant skill on the part of the management scientist. Model formulations remain 
tailor-made to a large degree. 

Conceivably, in the next decade some of the well-developed applications of 
operations research will have become so widely adopted that procedures for 
building these models can be codified, as many of the techniques in industrial 
engineering and managerial accounting have been. The unabated expansion of 
quantitative analysis into previously untouched areas of decision-making is so 
enonnous, however, that the need for imaginative and talented problem-solving 
will remain undiminished for some time. 

In other words, a considerable amount of “art” is still required for the successful 
practice of management science. This in turn means that whether you are a 
managerial user or a practitioner of operations research, you must have some 
facility with both the artistic and the scientific ingredients of the subject. A text¬ 
book, such as this one, can teach you many of the scientific aspects, and give you 
a modicum of practice in the art through the study of toy examples and the for¬ 
mulation and solution of small-scale problems. Unfortunately, however, it can do 
no more than make you aware of the artistic elements. 

To help you understand this interplay between the art and the science of apply¬ 
ing operations research, we offer an analogy with the fine arts. A knowledge of 
scientific principles, such as the chemistry of paint, the physiology of the eye, the 
physics of light, the psychology of color, and the laws of perspective, helps the 
artist master fully the craft of painting. Likewise, such knowledge also distin¬ 
guishes the true connoisseur from the casual, albeit appreciative, Sunday museum- 
goer. By the same token, an understanding of the fundamentals of operations 
research is essential not only for the practitioner, but for the manager who wants 
to make truly effective use of the approach. If today’s business world continues 
to become more complex, an executive will not be able to compete successfully 
in the role of a casual onlooker, or he himself may end up as a museum exhibit. 

1,4 IMPORTANCE OF MODEL-BUILDING 

If you study this text diligently, you will learn a considerable amount of 
mathematical technique. But a benefit that far transcends the mastering of specific 
algorithms is the facility you will gain in formulating, manipulating, and analyz¬ 
ing mathematical models. Model-building is the essence of the operations research approach. 
It is the counterpart to laboratory experimentation in the physical sciences. 

Constructing a model helps you put the complexities and possible uncertainties 
attending a decision-making problem into a logical framework amenable to com¬ 
prehensive analysis. Such a model clarifies the decision alternatives and their 
anticipated effects, indicates the data that are relevant for analyzing the alterna¬ 
tives, and leads to informative conclusions. In short, the model is a vehicle for 
arriving at a well-structured view of reality. 
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A mised bag* The word ‘'model” has several shades of meaning, all of 
which are relevant to operations research. First, a “model” may be a substitute 
representation of reality, such as a small-scale model airplane or locomotive. 
Second, “model” may imply some sort of idealization, often embodying a simpli¬ 
fication of details, such as a model plan for urban redevelopment. Finally, “model” 
may be used as a verb, meaning to exhibit the consequential characteristics of the 
idealized representation. This notion conjures up in the mind those television 
commercials dramatizing how love and happiness will result after a single applica¬ 
tion of the sponsor’s product. 

In operations research, a model is almost always a mathematical, and neces¬ 
sarily an approximate, representation of reality. It must be formulated to capture 
the crux of the decision-making problem. At the same time, it must be sufficiently 
free of burdensome minor detail to lend itself to finding an improved solution that 
is capable of implementation. Striking a proper balance between reality and mana¬ 
geability is no mean trick in most applications, and for this reason model-building 
can be arduous. 

You will find three pervasive and interrelated themes in operations research 
model-building. The first is an emphasis on optimization. Concentrating on deci¬ 
sions that are optimal according to one or more specified criteria has been the 
forcing wedge for attaining improved decision-making. Typically, the optimization 
is constrained, in that the values of the decision variables maximizing the stated 
objective function are restricted so as to satisfy certain technological restraints. 
Often, the model includes restrictions that mirror the impact of dynamic phe¬ 
nomena. 

The second theme is derivation of the analytic properties of a mathematical 
model, including the sensitivity of an optimal solution to the model’s parameters, 
the structural form of an optimal solution, and the operating characteristics of the 
solution. To illustrate, if you have a mathematical model leading to an inventory 
replenishment policy, you will want to know how the rule depends on forecasts 
of customer demand, the specification of the rule (such as, “when down to n, 
order again”), and the long-run frequency of stockouts and the average inventory 
level. 

The third theme is explicit recognition of system interactions. One of the diffi¬ 
cult tasks in writing an elementary text is to convey how, in real applications, the 
model-building effort is oriented toward management system considerations. The 
results of an operations research analysis must be integrated into the management 
information, decision-making, and control systems fabric of the organization. 
Operations research applications cannot be undertaken in isolation from the 
surrounding managerial environment. For these reasons, an operations research 
project should be regarded, at least in part, as a systems effort. 

In one easy lesson. Many of the scientists who pioneered present-day 
applications of operations research are still alive and carry forward their individual 
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banners of progress. One cannot help but be struck by the way each of these men 
treats nearly all of the significant decision-making problems he encounters by 
using his own specialty, such as linear programming, dynamic programming, 
inventory theory, or simulation, etc. This ability to apply a single solution technique 
or mathematical construct to a diverse range of problems—and to do so effec¬ 
tively—attests not only to the sheer genius of these innovators but to the flexibility 
of their approaches. 

The experience of these scientists not withstanding, most operations research 
analysts, when faced with a difficult managerial decision problem, usually do not 
find it self-evident that a single solution technique or model is patently most appro¬ 
priate. For example, an analysis of what markets a company should serve, what 
products it should manufacture, what investments it should undertake, or where it 
should locate its plants and warehouses rarely leads to an immediate selection of a 
linear programming, or a dynamic programming, or a simulation approach. This 
being the case, you may well wonder how you will go about building or selecting 
a model when faced with a particular decision problem. 

We know that the notion of model-building, as described in a textbook, carries 
with it an aura of mystery. Regrettably, it is virtually impossible to provide you 
with a checklist for infallibly selecting and developing a model. But rest assured, 
there is considerable evidence that most students who have been trained in either 
the sciences, engineering, mathematics, business administration, or economics have 
little trouble building models in practice, provided they are inclined to do so. 
And nowadays, rarely, if ever, will you be faced with applying operations research 
unaided by an experienced practitioner. Therefore, you can count on being 
tutored at least the first time you use operations research. 

1.5 PROCESS OF QUANTITATIVE ANALYSIS 

We outline below the stages that are standard in applying quantitative analysis. 
An experienced practitioner takes these steps almost instinctively, and frequently 
does not attach formal labels to them. Actually, the components are not entirely 
distinct, and at any point in time, several of the phases proceed in concert. As a 
beginner, however, you will find it helpful to look over the entire process seriatim, 
so that you can plan ahead accordingly. 

A prelude to a quantitative analysis of a decision problem should be a thorough 
qualitative analysis. This initial diagnostic phase aims at identifying what seem 
to be the critical factors—of course, subsequent analysis may demonstrate that some 
of these factors are not actually so significant as they first appear. In particular, it 
is important to attain a preliminary notion of what the principal decisions are, 
what the measures of effectiveness are among these choices, and what sorts of 
tradeoffs among these measures are likely to ensue in a comparison of the alterna¬ 
tives. There will be trouble ahead unless you get a good “feel” for the way the 
problem is viewed by the responsible decision-makers. Without this appreciation, 
you may encounter considerable difficulty in gaining acceptance and implement- 
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ing your findings. What is worse, your results could very well be erroneous or 
beside the point. 

Formulating the problem. The preceding diagnostic should yield a 
statement of the problem’s elements. These include the controllable or decision 
variables, the uncontrollable variables, the restrictions or constraints on the vari¬ 
ables, and the objectives for defining a good or improved solution. 

In (he formulation process, you must establish the confines of the analysis. 
Managerial decision-making problems typically have multifold impacts, some of 
them immediate and others remote (although perhaps equally significant). 
Determining the limits of a particular analysis is mostly a matter of judgment. 

Building the model. Here is where you get down to the fine detail. You 
must decide on the proper data inputs and design the appropriate information 
outputs. You have to identify both the static and dynamic structural elements, 
and devise mathematical formulas to represent the interrelationships among 
these elements. Some of these interdependencies may be posed in terms of con¬ 
straints or restrictions on the variables. Some may take the form of a probabilistic 
evolutionary system. 

You also must choose a time horizon (possibly the “never-ending future”) to 
evaluate the selected measures of effectiveness for the various decisions. The 
choice of this horizon in turn influences the nature of the constraints imposed, 
since, with a long enough horizon, it is usually possible to remove any short-run 
restrictions by an expenditure of resources. 

Performing the analyses. Given the initial model, along with its 
parameters as specified by historical, technological, and judgmental data, you 
next calculate a mathematical solution. Frequently, a solution means values for 
the decision variables that optimize one of the objectives and give permissible 
levels of performance on any other of the objectives. The various mathematical 
techniques for arriving at solutions comprise much of the contents of this text. 

As pointed out previously, if the formulation of the model is too complex and 
too detailed, then the computational task may surpass the capabilities of present- 
day computers. If the formulation is too simple, the solution may be patently 
unrealistic. Therefore, you can expect to redo some of the steps in the formula¬ 
tion, model-building, and analysis phases, until you obtain satisfactory results. 

A major part of the analysis consists of determining the sensitivity of the solu- 
• tion to the model specifications, and in particular to the accuracy of the input 
data and structural assumptions. Because sensitivity testing is so essential a part of 
the validation process, you must be careful to build your model in such a way as 
to make this process computationally tractable. 

Implementing the findings and updating the model. Unfor¬ 
tunately, most tyro management scientists fail to realize that implementation 
begins on the very first day of an operations research project. There is no “moment 
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of truth” when the analyst states, ‘''Here are my results,” and the manager replies, 
“Aha! Now I fully understand. Thanks for giving me complete assurance about 
the correct decision.” 

We consider the entire process of implementation in Chap. 22. But we mention 
here the importance of having those executives who must act on the finding's 
participate on the team that analyzes the problem. Otherwise, the odds are heavy 
that the project will be judged only as a provocative, but inconclusive, exercise. 

It is common for an operations research model to be used repeatedly in the 
analysis of decision problems. Each time, the model must be revised to take account 
of both the specifics of the problem and current data. A good practitioner of opera¬ 
tions research realizes that his model may have a long life, and so documents its 
details as well as plans for its updating. 

What’s it all about? Having learned the basic components of the quanti¬ 
tative analysis process, you should step back to see what the entire approach 
accomplishes. 

The major effort is constructing a mathematical representation of a complicated 
situation, along with gathering the required data. The model is essentially approx¬ 
imate—elaborate enough to capture the essentials, yet gross enough to yield 
computable solutions. The balance between detail and tractability is found by a 
trial and error process, involving considerable examination of preliminary findings 
and extensive sensitivity analysis. 

When operations research is applied in a planning context, the solution usually 
consists of a most favorable set of values for the decision variables, with some infor¬ 
mation as to the cost of deviating from these values. When management science is 
used for developing an operating system, such as a means for controlling inven¬ 
tories, then the solution consists of a set of decision rules. Often, these rules are 
embodied in a computer program. For an inventory system, the computer routines 
analyze historical demand data, permit judgmental adjustments if specified, signal 
when replenishment is to take place, and calculate the reorder amount. 

Only rarely does an operations research solution represent a precise forecast of 
what will happen in the future. Such an accurate prediction would be of interest; 
but the crux of the decision problem is to select among alternatives, not to fore¬ 
cast. A well-built model makes a valid comparison among the alternatives. In case 
this distinction between accurately predicting an outcome and legitimately 
comparing alternatives is puzzling, consider the following illustration. 

A company is about to decide whether or not to open a new plant in Europe. 
An operations research model is constructed that contains forecasts of sales, costs, 
and revenues; the resultant solution probably indicates anticipated production 
levels. If the economic advantage of opening this new plant is relatively insensitive 
to a range of reasonable values for the forecasted figures, then the company can 
make the correct expansion decision. It does not need to commit itself, at the same 
time, to production levels; they would be determined subsequently when more 
accurate demand forecasts are available. 
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1.6 OPERATIONS RESEARCH, LILLIPUTIAN STYLE 

Before explaining how and why quantitative analyses have been valuable in 
aiding executive decision-making, we will examine a few highly simplified illus¬ 
trations of operations research models. Since our only purpose is to show what 
mathematical decision models look like, we make no pretense about the realism 
of these formulations; you will find more practical versions in the subsequent 
chapters. 

One-Potato, Two-Potato Problem. A frozen-food company processes 
potatoes into packages of French fries, hash browns, and flakes (for mashed 
potatoes). At the beginning of the manufacturing process, the raw potatoes are 
sorted by length and quality, and then allocated to the separate product lines. 

The company can purchase its potatoes from two sources, which differ in their 
yields of various sizes and quality. These yield characteristics are displayed in Fig. 
1.1. Observe that from Source 1, there is a 20% yield of French fries, a 20% 
yield of hash browns, and a 30% yield of flakes; the remaining 30% is unrecov¬ 
erable waste. The figures for flakes and waste are also 30% for potatoes from Source 
2, but the yield of French fries is relatively higher. 


Figure 1.1. Potato Yields 
for a Unit of Weight. 


Product 

Source 1 

Source 2 

Purchase 

Limitations 

French fries 

.2 

.3 

I.8 

Hash browns 

.2 

,l 

I.2 

Flakes 

.3 

.3 

2.4 

Relative 

Profit 

5 

6 



How many pounds of potatoes should the company purchase from each 
source? The answer depends, in part, on the relative profit contributions of the 
sources. These relative figures are calculated by adding the sales revenues associated 
with the yields for the separate products, and subtracting the costs of purchasing 
the potatoes, which may differ between the two sources. (We have used the term 
relative profit contribution because we are ignoring other variable expenses, such as 
sales and distribution costs. These depend only on the products and not the sources 
of the raw potatoes, and so do not affect the purchase allocation decision.) Suppose 
the relative profit contribution is 5 for Source 1 and 6 for Source 2. Even though 
Source 2 is more profitable, it does not follow that the company should purchase 
all of its potatoes from Source 2. 

At least two other factors are relevant to the purchase decision: the maximum 
amount of each product that the company can sell, and the maximum amount 
that the company can manufacture—given its production facilities. To keep the 
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exposition simple, suppose that the two factors, in concert, imply that total produc¬ 
tion cannot exceed 1.8 for French fries, 1.2 for hash browns, and 2.4 for flakes, 
where these constants are measured in terms of an appropriate unit of weight 
(such as millions of pounds). These restrictions can be expressed mathematically as 
follows. 

Let Pi denote the amount (in weight) of potatoes that will be purchased from 



Figure 1.2. Feasible Purchasing Policies. 
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Source 1, and P 2 the amount from Source 2. Then the values for Pi and P^ are con¬ 
strained by the linear inequalities 

,2Pi + . 3 P 2 <1.8 for French fries 

.2Pi + .IP 2 <1.2 for hash browns 

(1) 1 . - 

.3Pi + ,3P2 < 2.4 for flakes 
Pj > 0 and P 2 > 0. 

The nonnegativity restrictions Pi > 0 and P 2 > 0 are imposed because a value 
such as Pi = —4 would have no physical significance. 

All the values for Pi and P 2 satisfying (1) are shown in the shaded region in Fig. 
1.2. Notice that each line in the diagram is represented by a restriction in (1) 
expressed as an equality. The arrow associated with each line shows the direction 
indicated by the inequality signs in (1). Explain why a pair of values for Pi and 
P 2 that satisfies both the French fries and hash brown constraints will also satisfy 
the flakes constraint. 

Optimal values for Pi and P 2 are found by making the relative profit contribu¬ 
tion as large as possible, consistent with the constraints. Therefore, the optimiza¬ 
tion problem is to 

(2) maximize (5Pi + 6 P 2 ) 

subject to (1). In this simple problem, the solution can be exhibited graphically, 
as in Fig. 1.3. 


Figure 1.3. Relative 
Profit. 



Each of the parallel straight-line segments represents different combinations of 
Pi and P 2 that give the same value for the linear objective function 5Pi GPj. 
The highest segment still having a point in the feasible constraint region is the 
optimal value of the objective function, and such a point is an optimal splution. 
You can see in Fig. 1.3 that there is only one optimal solution in this example; 
it occurs at the intersection of the French fries and hash brown constraints. Con- 
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sequently, you can calculate the optimal values by solving the associated simul¬ 
taneous linear equations 

.2Pi + .3^2 = 1*8 for French fries 
(3) 1 -r 2 

^ ' ,2Pi + , 1 P 2 = 1-2 for hash browns. 

Verify that the optimal answers are Pi = 4.5 and P^ = 3, as shown in Fig. 1.3, 
giving an objective-function value of 40,5. 

This problem illustrates what is termed a linear programming model. Real applica¬ 
tions of linear programming usually involve hundreds of constraints and thousands 
of variables. You will learn how to formulate and solve such models in Chaps. 
2 through 7. 


Secretary Problem. An executive wishes to hire a new secretary, and 
is about to ask a placement service to send qualified girls for him to interview. He 
has found from past experience that he can determine from an interview whether 
a girl, if hired, will turn out to be terrific, good, or just fair. He assigns a relative 
value of 3 to a terrific secretary, 2 to a good one, and 1 to a fair one. His previous 
experience also leads him to believe that there is a .2 chance of interviewing a 
girl who will be a terrific secretary, a ,5 chance that she v/ill be a good one, and 
a ,3 chance that she will be a fair one. 

He wishes to see only three girls at most. Unfortunately, if he does not hire a 
girl immediately after an interview, she will take another job: hence, he has to 
decide right away. 

If the first girl he sees is terrific, he will hire her immediately, of course. And if 
she is fair, he has nothing to lose by interviewing a second girl. But if the first girl 
looks good, then he is not sure what to do. If he passes her by, he may end up with 
only a fair secretary. Yet if he hires her, he surrenders the chance of finding a 
terrific girl. Similarly, if he chooses to see a second girl, he will again face a difficult 
decision in the event that she turns out to be good. 

The selection problem can be displayed conveniently by a so-called decision iree^ 
shown in Fig. 1.4. The circled nodes represent the interviewed girls, and the 
branches from these nodes show the chance events and their probabilities. The 
boxes indicate where a decision must be made, and the number at the end of a 
branch gives the relative value of stopping the decision process at that point. 

The problem of finding an optimal decision strategy can be solved by what is 
termed dynamic programming, and in particular, by a process known as backward 
induction. You will study dynamic programming models and solution techniques in 
Chaps. 8 through 12, and Chaps, 17 and 18. The solution process is so simple in 
this example that you can compute the optimal hiring strategy very eeisily, as we 
have done below. 

Suppose that the executive does end up interviewing, and hence hiring, a third 
girl. Then the expected value associated with the uncertain event is 

(4) 3(.2).f-2(.5) + l(.3) = 1.9. 

In other words, the average value of a girl selected at random for an interview is 
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Legend: 

T Terrific 
F Fair 
G Good 

Figure 1.4. Decision Tree for Secretary Problem. 

1.9. Assume that this expectation legitimately represents the executive’s evaluation 
of the chancey event. Mark the number 1,9 above the circled Node 3 in Fig. 1,4, 
Next consider what happens if the executive does interview a second girl, and 
she turns out to be good. If he decides to stop, then he obtains a value 2. But if 
he continues, then he can expect to receive only the value 1.9. So he should stop 
when the second girl looks good. Put an X on the branch indicating “Continue” 
when the second secretary is good; this signifies not to take that action. 

Now you are ready to determine the correct decision if the first girl looks good. 
By stopping, the executive would obtain the value 2. But if he continues, then the 
expected value associated with the chancey outcome of the second interview, and 
possibly the third, is 

(5) 3(.2) + 2(‘5) + 1.9(.3) = 2.17. 

The first term in (5) is for the event of seeing a terrific secretary, whom he hires; 
the second term is for the event of seeing a good secretary, whom he hires, as you 
already determined in the preceding paragraph; and the third term is for the event 
of seeing a fair secretary, and consequently continuing to the third chancey event 
that has a value of 1.9, given in (4). Since 2.17 is larger than 2, the executive 
should pass up the first girl if she turns out to be good. Mark 2.17 above the circled 
Node 2 in Fig. 1.4, and put an X on the branch indicating “Stop” when the first 
secretary is good. 

To summarize, the optimal policy is to stop after the first interview only if the 
girl is terrific, and to continue after the second interview only if the girl is fair. 
The overall expected value of the interviewing jprocess, given that the executive 
acts optimally, is 

(6) 3(.2) + 2.17(.5) + 2.17(.3) = 2.336. 
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Explain why. Mark this number above the circled Node 1 in Fig. 1.4. Since the 
quantity 1.9, calculated in (4), also represents the expected value if the executive 
interviews only a single secretary and hires her, the difference (2.336 — 1.9 = 
.436) is the incremental value from interviewing as many as two more girls. 

Where-or-When Production Problem. The name of this problem 
arises from the observation that the associated mathematical model has several 
interpretations. One is in terms of deciding optimal production levels at each of 
several plants in a single time period; another is in terms of choosing optimal 
production levels at a single plant in each of several time periods. (The model also 
can be interpreted as a combination of the two problems, that is, as a where-and- 
when problerii.) 

Starting with the multiplant version, suppose a company has N plants, and 
must manufacture a total of D units of a single item during a stated time period. 
Hence, letting Xt denote the amount of production at Plant t, the levels Xj, Xj, 

..., must satisfy the constraints 

(7) Xj + X 2 + • • • + Xjv^ = jD and all Xt > 0. 

Assume that the cost of producing X( at Plant t is given by (l/ri)xj, where Ct> 0 
is known from historical accounting information. Consequently, optimal values 
for the Xt are those that 

(8) minimize + • • • + ^) 

\ Cl C2 Cjf/ 

subject to (7). 

This optimization problem can be solved by dynamic programming methods 
as well as by some simple nonlinear programming techniques, which are dis¬ 
cussed in Chaps. 14 and 15. The numerical answers can be easily computed from 
the insightful formula 

(9) optimal Xt = - r , for / = 1, 2,..., AT, 

*r ^2 “T • • • '-r Cif 

which yields the associated minimum cost 

(10) c,+ 

Turning to the multiperiod version, suppose you interpret Xt as being the level 
of production in a single plant during Period t. Notice that in this version, all the 
costs are due to production, and no storage costs are incurred while the units are 
inventoried from Period 1 to the end of Period N, when the demand requirement 
D must be met. Given this view, you can state what would be the optimal value 
for Xt if the preceding levels Xj, X 2 ,..., Xf_i were already determined [not neces¬ 
sarily by (9)], namely, 

optimal conditional Xt = ~ ■^<- 1 ) . 

Cl + Ct+i + • * • + 

(xi 3 ^ 2 > • • •> ^ 4-1 are specified). 


( 11 ) 
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As you can verify, calculating Xi, X 2 , • •., Xy recursively (that is, successively, one 
by one, starting with Xj) from (11) yields the same values as computing each of 
them from (9). 

Economic Order Q^uantity Problem. In Chaps. 9 and 19, and Ap¬ 
pendix II you will study a variety of inventory replenishment models that have 
proved successful in practice. The formulation below is perhaps the simplest such 
model. Its precise assumptions are only rarely satisfied in real life. Nevertheless, the 
resultant solution turns out to be sufficiently close to optimal for many practical 
situations as to make it a very useful approximation. 

Consider a company that consumes (or sells) an item at the rate of, say, M 
units per week. For simplicity, suppose there is no uncertainty about this consump¬ 
tion. Hence, if the inventory level is kM units, then this stock is depleted in exactly 
k weeks. Further, suppose the rate M is unchanging over time, so that the company 
must regularly place a replenishment order. The decision problem is to determine 
the most economical order quantity. (Assuming that the delivery time for an 
order is also known exactly, each replenishment action is initiated early enough so 
that the order arrives just when the inventory level falls to zero.) 

Let the order quantity be denoted by Q,. Then the level of inventory can be pic¬ 
tured by the sawtooth pattern shown in Fig. 1.5. Observe that each time a replen¬ 
ishment arrives, the inventory level shoots up by the order quantity Q,. Then the 
level diminishes, as shown by the downward slope of the sawtooth, which equals 
-M. 



Legend: 

M demand per week 
0 order quantity 

Figure 1.5. Pattern of Inventory Levels. 


An optimal order quantity strikes a balance between the costs associated with 
replenishing and with holding inventory. Specifically, assume that a fixed setup 
cost JC is incurred each time an order is placed, that a purchase cost c is paid for 
each item ordered, and that a holding cost A is assessed for each unit of inventory 
held per week. The setup cost is related to the effort expended in placing and 
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receiving the order. The holding cost is associated with storage, insurance, and the 
capital tied up in inventory. 

Let the economic criterion of effectiveness be measured as average cost per 
week. Then the contribution due to setup costs is since there are M/Q, 

setups per week. The contribution due to purchase costs is cAf, since M items are 
consumed per week. And the contribution due to holding costs is A(Q,/2), since 
(2,/2 is the average level of inventory, as you can see in Fig. 1.5. Adding the com¬ 
ponents, you have 

(12) average cost per week = AG = + cM + 

The economic order quantity that minimizes AG is 

(13) optimal a 

which can be found by setting the derivative of AG with respect to Q, equal to 0, and 
solving for Q,. It follows from (13) that the optimal order quantity only doubles 
when the demand rate quadruples. Also note that the optimal quantity is deter¬ 
mined by the ratio of setup to holding costs. 

OR Airline Problem. The One-Ride Airline Gompany is opening a 
reservation service to be located in a suburban shopping center. A passenger 
making reservations will be able to telephone the office and state his request. 
The OR Airline Gompany wants to decide how many telephone lines to install 
for answering reservation calls. It can easily compute the telephone and personnel 
expenses that vary with the number of lines. But it also wishes to compare the level 
of service for several different numbers of lines. In particular, suppose the company 
seeks to determine the percentage of time all the lines will be busy and the average 

length of such busy periods. 

This sort of analysis is classified as queuing 
or waiting line theory^ and is explained in 
Ghaps. 20, 21, and Appendix III. We 
could construct an explicit model and sub¬ 
ject it to rigorous mathematical analysis, 
but instead, we explain here how the 
method of simulation can be used to deter¬ 
mine the service figures. To keep the expla¬ 
nation easy, we present only a rudimentary 
technique, and suggest that you read 
Ghap. 21 for a more detailed exposition 
of the simulation approach. 

Suppose the company obtains data show¬ 
ing the statistical frequencies of minutes be¬ 
tween successive incoming telephone calls. 
As a first approximation, assume that 



Figure 1.6. The Distribution of Telephone 
Interarrival Times. 
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these successive interarrival times are completely independent (such independence 
does not hold precisely if, for example, a passenger calls up, finds the lines busy, 
and immediately redials the number). A convenient way to summarize this 
distribution is to use a pie diagram, such as the one in Fig. 1.6, where we assume 
for simplicity that the time between incoming calls never exceeds 5 minutes. 

Imagine a pointer, or spinner, affixed to the center of the pie diagram—the 
mechanism would look something like a wheel-of chance at carnivals, or a device 
that is often included in a child’s game to determine how many advances a player’s 
piece may take at each turn. You can simulate the traffic of incoming calls by giv¬ 
ing the pointer a succession of sharp spins, and jotting down the resultant sequence 
of interarrival times. 

In addition, suppose the company has a frequency distribution of the number 
of minutes that incoming calls require. Assume that these service times are inde¬ 
pendent of each other and of the interarrival times. Then another pie diagram 
and spinner mechanism can be constructed for generating the service times. 


Busy 

Period* 



Linel 

Busy. 

Linel 

Free. 




Time - X 0(g)® -X-X —® ®- x-@ - ®^ 

Coll 1234 567 8 9 10 II 

Number 

Figure 1.7. Simulated History for One Telephone Line. 

You are now ready to simulate the system. To begin, suppose there is only a 
single telephone line. Then a simulated history may look like that in Fig. 1.7. 
The instants of incoming calls are recorded with X’s on the time axis, and are 
determined by successive spins of the pointer mechanism for interarrival times. 
The telephone line becomes busy as soon as the first call arrives. The length of 
the busy period is determined by a spin of the pointer mechanism for service 
times. Notice that the calls arriving at the instants circled in Fig. 1.7 are not 
answered because the single telephone line is busy. You can obtain a good estimate 
of the percentage of time the, line will be busy and the average length of a busy 
period by calculating the corresponding statistics for a fairly long simulated run. 

Suppose next that there are two telephone lines. Then the same sequence of 
incoming calls can lead to a service history like that in Fig. 1.8. Observe that more 
incoming calls are answered. Here a busy period is defined to be a length of time 
during which both telephone lines are tied up. Note how these periods are shorter 
in Fig. 1.8 than in Fig. 1.7 for the particular history. As you can see, the same 
approach can be used to estimate service for any number of lines. 
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Figure 1.8. Simulated History for Two Telephone Lines. 


Commentary* As we stated at the beginning of this section, the above 
examples illustrate how mathematical models are constructed to analyze decision 
problems. We have made no attempt to be “realistic” in these examples. But as 
you continue reading this book, you will discover how to build practical models, 
and learn ways to solve problems with them. 


L7 AT THE END OF THE RAINBOW 

Today, virtually every major corporation employs personnel who are responsible 
for applying operations research. Usually, these people constitute a staff group at 
headquarters level. The group often reports to the controller, the chief financial 
officer, or the head of corporate planning; but with growing frequency, companies 
are also assigning operations researchers to report directly to line managers. In 
a parallel fashion, operations research activity has enjoyed a widespread growth 
within federal and local governments, as well as in other nonprofit organizations. 
This section tells why managem.ent science has succeeded so well. 

Merits of a rational process* Obviously, executives must and do 
make decisions all the time. For a particular situation, an operations research 
model may yield the very same conclusion that an experienced manager would 
arrive at solely on intuitive grounds. Therefore, the benefits of using operations 
research have to be evaluated in terms of its long-run impact on the entire mana¬ 
gerial process. 

The proper comparison is well represented by the question, “If a company does 
not use operations research to guide the decision process, then what will it use, 
and will the answers be consistently as good?” Corporations that apply operations 
research—even when the approach does not meet all of the company’s initial 
expectations—find that analyzing complicated managerial problems this way is 
a sounder method than traditional means. This assertion is amply borne out by 
the steadily increasing support given to operations research in both the private 
and public sectors. 
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Qualities inherent in this particular rational approach make it a valuable 
method, regardless of whether a company’s operations research personnel attain 
the highest level of accomplishment for a specific project. These benefits include: 

• Emphasis on assessing the system-wide interactions and ramifications of 
decision alternatives. Intrinsic in an operations research approach is the 
construction of a model that synthesizes the segments of an enterprise that are 
affected by a decision choice. Each individual part is constructed by personnel 
who are the most knowledgeable about the relevant data. 

• Impetus to developing a full range of decision alternatives. The number of 
action possibilities that can be analysed increases tremendously by the applica¬ 
tion of mathematics and computers. 

• Focus on resolving the critical issues. The approach proceeds in the fashion 
of establishing implications of the form, “if Hypothesis H is true and Action 
A is taken, then Result R will occur.” The method fosters interdepartmental 
communication. As a consequence, clashes of opinion within an organization 
can be sorted into disagreements over the probable truth of different hypo¬ 
theses, and over the assumptions used in deriving the implications of different 
actions. 

Important ancillary benefits emerge from the direction provided by the model 
for gathering data, quantifying the value of additional information, and document¬ 
ing factual knowledge that may be required in subsequent decision analyses. 

Having listed several of the important merits stemming from the rational 
analytic process of operations research, we hasten to point out that the advantages 
will occur to a greater or lesser degree, depending on the skill used in carrying out 
the study. 

Managerial cutting edge. The preceding paragraphs dealt with why 
operations research is helpful in analyzing decision problems. Now we discuss 
how the approach is beneficial. We have classified the ways into four somewhat 
arbitrary and partially overlapping categories: 

i. Better Decisions, Frequently, operations research models yield actions that do 
improve on intuitive decision-making. A situation may be so complex (because of 
intricate interrelationships among decisions, voluminous data pertinent to opera¬ 
tions, and uncertainties of market activity) that the human mind can never hope 
to assimilate all ^he significant factors without the aid of operations-research- 
guided computer analysis. 

Of course, in the past managers have made decisions in these situations without 
the aid of operations research. They had to. But the depth of their understanding 
and the quality of their decisions improve with the application of such models, 
as considerable experience has shown. Particular decisions may ultimately turn 
out to be wrong, but the improved decision-making process reduces the risk of 
making such erroneous decisions. 
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ii. Better Coordination, Sometimes operations research has been instrumental 
in bringing order out of chaos. The following example, drawn from an actual 
application, illustrates what can happen. 

During special campaigns, a food manufacturer runs advertising that signi¬ 
ficantly increases sales volume. But manufacturing production facilities are limited, 
the supply of the foodstuff is limited, and the sales response is often erratic. In 
the past, consequently, the marketing and manufacturing divisions have been at 
opposite poles in terms of cooperative actions. An operations-research-oriented 
planning model becomes a vehicle for coordinating marketing decisions within 
the limitations imposed on manufacturing capabilities. 

iii. Better Control, The managements of large organizations recognize that it is 
extremely costly to require continuous executive supervision over routine deci¬ 
sions. Operations research approaches combining historical experience with the 
scientific method have resulted in standardized and reliable procedures for handling 
everyday activities and for signaling dangerous trends. Executives have thereby 
gained new freedom to devote their attention to more pressing matters, except for 
those unusual circumstances which, when they arise, necessitate reviewing the 
the course of everyday action. The most frequently adopted applications in this 
category deal with production scheduling and inventory replenishment. 

iv. Better Systems, Often, an operations research study is initiated to analyze a 
particular decision problem, such as whether to open a new warehouse. After¬ 
wards, the approach is further developed into a system to be employed repeatedly. 
Thus the. cost of undertaking the first application may produce benefits that are 
longer lasting than originally envisioned. 

Where the action is. By this time, applications of operations research 
are so common in industry and government, and so diverse in the functional 
areas of decison-making, that we cannot hope to provide a complete survey. To 
give you some idea, however, we mention that there are numerous applications 
in industries such as aircraft, apparel, chemicals, cement, glass, computers, elec¬ 
tronics, farm and industrial machinery, food, metal manufacturing and products, 
mining, motor vehicles, paper and wood products, petroleum refining, and phar¬ 
maceuticals, as well as in commercial banks, insurance companies, merchandising 
firms, public utilities, and transportation companies. 

Depending on the industry, the applications pertain to extraction of natural 
resources, manufacturing, transportation and warehousing, plant size and loca¬ 
tion, inventory management, scheduling of men and machines, forecasting, new 
product development, marketing, advertising, cash management and finance, 
portfolio management, mergers, and both short- and long-range corporate plan¬ 
ning. 

Most companies’ early operations research projects deal with monthly or quar¬ 
terly scheduling, annual planning, inventory control, and other fairly well-defined 
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areas of decision-making. After the operations research group demonstrates its 
capability in these areas, a company then applies its operations research talents 
to the study of high-level strategic problems, such as selecting new plant sites, 
entering new markets, acquiring overseas affiliates, and so forth. 

Operations research has provided a significant advance in the techniques of 
long-range strategic planning. Even senior executives have difficulty piecing 
together all the important considerations involved in a well-designed long-range 
plan. What is more, the operations research approach lends itself to the formula¬ 
tion of contingency plans, that is, a complete strategy indicating which courses 
of action are appropriate for various future events. In addition, the findings may 
include directions for obtaining and then utilizing critical information about such 
future events. In this way, the operations research model suggests the actions to be 
taken immediately and the ones to be postponed, and when to undertake a 
reassessment. For these reasons, more and more frequently, boards of directors 
of large corporations find strategic proposals being justified on the basis of exten¬ 
sive operations research studies. 

^Profitable applications in nonprofit organizations* The 

growth of operations research in government and nonprofit corporations has been 
phenomenal. A long succession of military applications began during World 
War II. Now governmental applications involve health, education, and welfare^ 
air and highway traffic control; air and water pollution; police and fire protec¬ 
tion; voter and school redistricting; and annual planning and budgeting by pro¬ 
gram, to name only a few. 

Certainly, much of the credit for the great impetus in the adoption of ojperations 
research in the public sector goes to the RAND Corporation (located in Santa 
Monica), Many fundamental concepts and techniques in operations research 
can be traced to the innovative ideas of RAND scientists. At present, there are 
several other research organizations similar to RAND serving the Federal Govern¬ 
ment; a number of these (RAND among them) work on state and city manage¬ 
ment problems as well as on military projects. 

How different is operations research applied in the public sector from that used 
in the private sector? The answer depends on the aspects of operations research 
you are considering. Specifically, the problem-solving characteristics, the ensuing 
benefits, the emphasis on model-building, the limitations of the scientific method, 
and the analytic process discussed above, all hold with equal validity in applica¬ 
tions to governmental and nonprofit organizations. 

There does seem to be a noticeable difference, however, in the sorts of solution 
techniques used. For example, linear programming has gained widespread 
acceptance in industry, but is employed only occasionally in governmental plan¬ 
ning. The opposite has been true of simulation models. 

Sometimes it is claimed that the lack of clear-cut objective functions to be 
optimized in nonprofit organizations raises a significant difference between appli¬ 
cations in the private and public sectors. Industry most often measures improve- 
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merit in terms of contribution to profit, but this criterion is by no means the only 
relevant one for decision-making. Businesses are always compromising among 
different objectives. Therefore, we feel that the absence of a profit measure is less 
important than it might seem at first glance. 

Probably the most important difference between the public and private sectors 
concerns the exercise of decision-making responsibilities. The organizational struc¬ 
tures of big corporations are complex, and the authority for taking actions may 
not always be precisely defined. But these structures are simple indeed in compari¬ 
son with most governmental structures. The difference can be stated this way. 
In a commercial company, there is no one left to pass the buck to, once the neces¬ 
sity for making a decision reaches top management (or the board of directors). 
In an organization such as the Federal Gk>vernment, even the President’s decisions 
are subject to the review of—and thus become partly the responsibility of— 
Congressmen, who, along with the President, are publicly elected. Understand 
that diffuse responsibilities and authority only make it difficult to apply opera¬ 
tions research outside of industry. As the record shows, plenty of applications 
are being made in governmental and nonprofit organizations. 

1,8 IN THE BEGINNING... 

Studying linear optimization models is an excellent way to begin learning about 
operations research. Hence, this subject is taken up in the next chapter and 
several to follow. As you read about linear and other models throughout the text, 
try to keep in mind the boundaries and the process of quantitative analysis (which 
we discussed in Secs. 1.3 and 1.4). By so doing, you will maintain a perspective 
on the strengths and limitations of operations research. 


REVIEW EXERCISES 

1 Using the stages in the process of quantitative analysis as outlined in Sec. 1.5, explain 
how you would apply such an approach to the decision-making problems below. 
Also state what you think would be the merits of applying a rational process and what 
would be the associated managerial cutting edge (as discussed in Sec. 1.7). 

(a) Exploring and drilling for crude oil. 

(b) Harvesting of timber lands. 

(c) Assigning customer orders to each of several steel rolling mills in a large steel 
corporation. 

(d) Establishing field warehouses for the dbtribution of canned food. 

(c) Locating and deciding the capacity of an ore reduction plant. 

(f) Market-testing a new packaged soap product. 

(g) Locating new sites for grocery stores. 

• (h) Establishing an advertising budget for the promotion of a soft drink. 

(i) Determining the size of a bank balance for an aircraft company. 

(j) Selecting a portfolio of securities in an insurance company. 
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(k) Deciding whether to merge two railroads. 

(l) Planning for future growth in a computer manufacturing company. 

(m) Deciding whether to automate a post office. 

(n) Allocating funds to urban renewal projects. 

(o) Selecting an intercontinental ballistic weapon system. 

(p) Designing an effective national welfare program. 

(q) Selecting a route for a superhighway. 

(r) Establishing a single nationwide telephone number for emergency police services. 

(s) Determining the number and location of fire stations in a city. 

(t) Deciding whether to unify two separate school districts. 

(u) Allocating funds among participating agencies in the United fund. 

One-Potato, Two-Potato Problem (Sec. 1.6), In exercises 2 through 5, draw a 
diagram showing the feasible purchasing policies and the optimal solution. Also 
calculate optimal values for Pi and P 2 and the associated value of the objective 
function. Assume all the data as given in Fig. 1.1 except for the specific change 
indicated below in each separate part of the exercise. 

2 (a) Purchase limitation on French fries of 1.7. Of 1.9. 

(b) Purchase limitation on hash browns of 1.1. Of 1.3, 

(c) Purchase limitations on French fries of 1.7 and on hash browns of 1.1. 

(d) Purchase limitations on French fries of 1.9 and on hash browns of 1.3. 

(e) Purchase limitation on flakes of 2.5. Of 2.3. Of 2.1. Of 1.5. 

3 (a) Relative profit of a unit from Source 2 of 7, Of 8, 

(b) Relative profit of a unit from Source 1 of 4. 

(c) Relative profit of a unit from Source 1 of 6 and from Source 2 of 2. 

(d) Relative profit of a unit from Source 1 of 6 and from Source 2 of 3. 

4 (a) Yield of .25 French fries from a unit of weight from Source 1. 

(b) Yield of .2 French fries from a unit of weight from Source 2. 

(c) Yield of .3 hash browns from a unit of weight from Source 1. 

(d) Yield of .15 hash browns from a unit of weight from Source 2. 

5 An additional restriction that 

(a) Pi ^4. 

(b) P2^2.5. 

(c) Pi ^ 4 and P, ^ 2.5. 

(d) Pi^5. 

(c) P2^4. 

(f) Pi ^5 and Pa ^4. 

Secretary Problem (Sec. 1.6). In exercises 6 through 11, draw a decision tree, and 
find an optimal interviewing policy. Assume all the data as given in Fig, 1,4 
except for the specific change indicated below in each separate part of the exercise. 

6 The executive is willing to interview up to four girls. What is the incremental value of 
interviewing as many as three more girls instead of only one? What is the incremental 
value of interviewing as many as four girls instead of a maximum of three? 




30 


Executive Decisions 


7 The executive is willing to interview up to five girls. What is the incremental value of 
interviewing as many as four more girls instead of only one? 

8 The relative value of a terrific girl is 4. What is the incremental value of interviewing 
as many as two more girls instead of one? 

9 (a) There is a .2 chance that an interviewed girl will be a good secretary, and a .6 

chance that she will be a fair one. What is the incremental value of interviewing 
as many as two more girls instead of one? 

(b) There is a .1 chance that an interviewed girl will be a terrific secretary, a .4 
chance that she will be good, and a .5 chance that she will be just fair. What is 
the incremental value of interviewing as many as two more girls instead of one? 

10 (a) There is a cosi of .15 for each interview. 

(b) There is a cost of .15 for each interview and the executive is willing to interview 
up to four girls. 

♦ 11 (a) What is the expected number of girls the executive interviews ? 

(b) If the executive is willing to interview up to four girls, what is the expected number 
of girls the executive interviews? (Utilize your answer from exercise 1.6.) 

Where-or-When Production Problem (Sec. 1.6). In exercises 12 through 16, assume 
the following data except for the specific change indicated below in each separate 
part of the exercise: 

D = 100, iV = 4, = 1, ^2 = 2, Cz = 3, ^4 = 4. 

Find the optimal value of each and the associated minimum cost. 

12 Assume the data as given above. 

13 LetZ) = 200. 

14 (a) Let Ci = 3. 

(b) Let ^4 = 2. 

(c) Let Cl = 3 and = 2. 

(d) Let Cl = C 2 = C 3 = C 4 = 2.5. 

15 (a) Let N = 5 and Cj = 5. 

(b) Let N = 5 and cj = C 2 = C 3 = C 4 = C 5 = 2. 

16 (a) Let X 2 = 36. 

(b) Let X 2 = 4. 

Economic Order Quantity Problem (Sec. 1.6). In exercises 17 through 20, assume 
the following data except for the specific change indicated below in each separate 
part of the exercise: consumption rate M = 5, setup cost K = 90, and holding 
cost per unit A = 4, To simplify computations, let purchase cost c = 0. 

17 Find the optimal economic order quantity, the associated average cost per week, and 
the corre^onding number of weeks between successive orders. 
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(a) Assume the data as given above. Then draw a graph of the average cost per week 
as a function of the order quantity Q,. Indicate the optimal value for Q, on your 
graph as well as the minimum average cost per week. Draw a graph of the 
sawtooth pattern of inventory levels for the optimal policy, analogous to that in 
Fig. 1.5. 

(b) Let h = 16. (e) Let A = 16 and X = 360. 

(c) Let K = 360. (f) Let A = 16 and Af = 20. 

(d) Let M = 20. (g) Let K = 360 and M = 20. 

* 18 (a) What is the average cost per week if you use an order quantity that is 10% larger 
than optimal? 

(b) What is the average cost per week if you use an order quantity that is 10% 
smaller than optimal ? 

*19 (a) What is the actual average cost per week if you use the order quantity that is 
optimal for A = 4, but in reality A = 16? 

(b) What is the actual average cost per week if you use the order quantity that is 
optimal for A = 16, but in reality A = 4? 

♦20 (a) What is the actual average cost per week if you use the order quantity that is 
optimal for Af = 5, but in reality Af = 20 ? 

(b) What is-the actual average cost per week if you use the order quantity that is 
optimal for Af = 20, but in reality Af = 5 ? 

OR Airline Problem (Sec. 1.6). Suppose you simulate the arrivals and service of 
telephone calls as follows. Toss four coins and assign an interarrival time of three 
minutes when there are four heads, two minutes when there are three heads, one 
minute when there are two heads, four minutes when there is one head, and five 
minutes when there are no heads (four tails). For each call determine an associated 
service time (which occurs if the caller does not find all the teleplione lines busy) 
by tossing two coins and letting the service timp begone minute when there are 
two heads, two minutes when there is one head, and three minutes wh^n there are 
no heads (two tails). 

21 In each experiment below, simulate 20 calls. Calculate the average time between 
calls, the average service time of those calls that are answered, the number of calls 
not angered (because all the lines are busy), the fraction of time that all the lines are 
busy, and the average length of time when all the lines are busy. In performing the 
calculations, assume that all the lines are free at the start, that the system shuts down 
two minutes after the twentieth call (regardless of whether or not it is answered), and 
that no more arrivals occur within this Hnal two-minute interval. Also, draw a diagram 
(such as in Figs. 1.7 and 1.8) to display the simulated history. Let the number of 
telephone lines be 

(a) One. 

(b) Two. 

(c) Three. 

♦(d) Calculate the theoretical average time between successive arrivals. 

♦ (e) Calculate the theoretical average service time, assuming that all calls are answered. 
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CHAPTER 


Formulation of Linear 
Optimization Models 


, 2,1 INTRODUCTION 

Unquestionably, linear optimization models are among the most commercially 
successful applications of operations research; in fact, there is considerable evidence 
that they rank highest in economic impact. This chapter initiates your study of 
linear models. 

Q^ualitative analysis. To set the stage, suppose we consider the opera¬ 
tions of a large firm in a process-oriented industry such as petroleum, chemicals, 
metal working, or wood products. Specifically, let us choose a major integrated 
oil company. As depicted in Fig. 2.1, the critical decisions in this type of firm 
relate to the processes of; 

• Exploring for oil deposits 

• Producing crude oil 

• Exchanging proprietary crude oil for other companies’ crudes at different 
locations 

• Purchasing additional crude 

• Shipping the crude to any of several refineries 

• Cracking the crude into several blending stocks 

• Combining the stocks into several dozen petroleum products 

• Shipping the manufactured products from the refineries to marketing areas. 

A decision to install a new cracking unit at a refinery, of course, affects the refin¬ 
ery’s operating efficiency. But equally important, such a decision can and often 
does have a decided impact on all the other operations listed. The design charac¬ 
teristics of the new unit influence the selection and allocation of crudes, as well as 
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Figure 2.1. An Integrated Oil Company. 


the relative composition and distribution of marketable products. Similarly, 
^pandin^ the share of the market for gaspline products in a particular region has 
implications for setting refinery operating levels, for making crude-exchange 
agreements with other oil companies, and for deciding where to concentrate explo¬ 
ration activities. 

For another illustration, we may look at a lumber manufacturing firm that owns 
raw timber resources, processes cut logs into various wood products, and distri- 
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Fabricating Plant Pulp and Paper Mill 

Figure 2.2. Wood Products Firm, 


butes these commodities over a widespread geographical area, as shown in Fig. 
2.2. Just as with the petroleum firm, all strategic decisions must reflect the inherent 
constraint stemming from the nature of the raw material: any single log yields only 
a limited amount of each grade of sawn timber. To earn profits, the firm must 
carefully allocate these various grades to the different types of finished products. 
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As a practical matter, then, the company must produce a slate of products that 
fully utilizes the material in the logs; but within this mode of operation there is 
still considerable discretion for deciding the particular composition of the final 
products slate. 

Although both of these illustrations were simplified, the resource allocation 
decision problems posed have close counterparts in actual companies. The com¬ 
plexities to be found in real situations magnify the effort required to provide a 
scientific economic analysis of various decision consequences. Nevertheless, these 
very decision problems are now being solved in many major companies with the 
aid of linear optimization models, usually called linear programming 
models. 

Profit potential. Why do successful companies bother to use such mathe¬ 
matical aids ? Are not management experience, know-how, and intuition sufficient 
to ensure sound decisions? Executives with a profit-making outlook have long 
realized that the comprehensive planning of future operations is important to 
their company’s success, and it is a rare manager who neglects such planning. In a 
large enterprise, however, merely recording the factual information required for 
analyzing significant decisions can entail a tremendous effort. Consequently, 
available manpower for testing the likely economic impact of a plan is a limiting 
factor for a company. 

As you will see, linear programming provides a breakthrough in amplifying the 
analytic abilities of a manager and his staff. The mathematical models you will 
study enable an executive to evaluate a wider range of resource allocation plans 
than he would ever have thought possible. As we pointed out in Chap. 1, it is impor¬ 
tant to realize that the results of these tests are not meant to substitute for the man¬ 
ager’s experience and intuition. To the contrary, they provide the precise and 
comprehensive data he requires to apply his knowledge effectively. 

The way it is. The purpose of this chapter is to introduce you to a variety 
of optimizing problems that lend themselves to a linear model formulation. In 
order not to detract from this purpose, the chapter will not deal with the numerical 
techniques for solving the models. That topic is treated in detail in Chap. 4. 

If you want to learn how to apply linear programming, sooner or later you have 
to face up to the fact that real situations are much more complex than the One- 
Potato, Two-Potato Problem given in Chap. 1. Several other Lilliputian examples 
are in the Formulation Exercises at the end of the chapter; these will offer you 
an opportunity to refresh your abilities in writing and graphing linear relation¬ 
ships. But we think there is no point in our dillydallying with misleadingly simple 
toy problems. You need to become accustomed to models having more than two 
or three unknowns and restrictions, and the quicker, the better. Consequently, 
we have chosen to explain more typical-looking linear programming problems. 

By the same token, you must learn to express decent-sized problems in algebraic 
terms, since, with plentiful data, graphical representations are out of the question. 
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In a real application you will have to rely on employing an electronic calculator 
to find a numerical solution. Such computer routines require that you specify the 
model in an algebraic fashion. Therefore, you will need facility in translating verbal 
problem statements into mathematical symbols. In short, you are about to see 
linear programming like it is. 

The first two examples deal with product- and feed-mix selections, and are 
prototypes of many actual applications. The third example treats fluid-blending; 
it includes considerations found in both the mix selection problems and also imposes 
additional restrictions based on the manufacturing technology. 

All three examples are called static models because they deal with decisions that 
occur only within a single time period. The formulation required for multiperiod 
or dynamic models is explored in Sec. 2.6. 

An illustration of an important class of models known as transportation problems 
is given in Sec. 2.7. Many large-scale linear programming models combine trans¬ 
portation and mix selection problems, and are posed in a dynamic context. 

We expect that as you continue reading this chapter you will become increas¬ 
ingly familiar with ways to specify a linear programming model. Hence, we pre¬ 
sent more detailed explanations in the first sections than we do toward the end of 
the chapter. Here and there, we advise you to verify certain statements, so you can 
check whether you have indeed been following the progression of ideas. 

Study guide. In reading the examples below, ask yourself 

1. What is the key decision to be made? What problem is being solved? 

2. What makes the real decision environment so complex as to require the use 
of a linear optimization model? What elements of this complexity are 
embodied in the model ? What elements are ignored ? 

3. What distinguishes a practical decision from an unusable one in this environ¬ 
ment? What distinguishes a good decision from a poor one? 

4. If you were a manager, how would you employ the results of the analysis ? 
What would be your interpretation of the answers? In what ways might 
you want or need to temper the results because of factors not explicitly 
considered in the models? 

After presenting these examples, we will return to a discussion of the application 
of linear programming models in today’s business firms. 

2.2 PRODUCT-MIX SELECTION 

In product-mix selection, the decision-maker wishes to determine the levels for 
a number of production activities during a specified period of time. These levels 
are constrained by technological or feasibility considerations, given in the form of 
linear equalities or inequalities. Subject to these restrictions, management seeks 
to optimize a particular objective function. In the instance below, the objective 
will be maximum profit. In the subsequent examples, you will see other objectives, 
such as minimum cost. 


38 Linear Optimization Models 


• 2.2 • 


Suppose the Knox Mix Company has the option of using one or more of four 
different types of production processes. The first and second processes yield items 
of Product A, and the third and fourth yield items of Product B. The inputs for 
each process are labor measured in man-weeks, pounds of Material Y, and boxes of 
Material Z. Since each process varies in its input requirements, the profitabilities of 
the processes differ, even for processes producing the same item. The manufacturer, 
deciding on a week’s production schedule, is limited in the range of possibilities by 
the available amounts of manpower and of both kinds of raw materials. The full 
technology and input restrictions are given in Fig. 2.3. 


Item 

One Item of Product A 

One Item of Product B 

Total 

Process 1 

Process 2 

Process 3 

Process 4 

Availabilities 

Mon-weeks 

1 

1 

1 

1 

<15 

Pounds of 
Material Y 

7 

5 

3 

2 

0 

VI 

Boxes of 
Material Z 

3 

5 

10 

15 

<100 

Unit profit 
($) 

4 

5 

9 

II 

Maximize 

Production 


^2 

X3 

X4 


level 




Figure 2.3. Knox Mix Product Selection. 


We could jump immediately to taking the data in Fig. 2.3 and writing a linear 
formulation, just as we did for the One-Potato, Two-Potato Problem in Chap. 
1. But this time we choose to be more contemplative. We consider what assump¬ 
tions about the technology are needed to imply linearity. 

Linear axioms. We will make two crucial technological and economic 
assumptions about how the production processes operate: 

i. Divisibility, For each activity, the total amounts of each input and the asso¬ 
ciated profit are strictly proportional to the level of output—that is, to the activity 
level. In other words, each activity is capable of continuous proportional expan¬ 
sion or reduction. To illustrate, the direct effect of doubling the inputs of any 
process is the doubling of output and profits. In particular, to manufacture 10 
units by Process 1 {xi = 10), there must be 10 man-weeks, 70 pounds of Material 
Y, and 30 boxes of Material Z allocated to the activity, and the accrued profits 
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are $40. The divisibility postulate further implies that the activity levels are per¬ 
mitted to assume fractional values as well as integer values. For example, we admit 
the technological possibility of X 2 = 2.5 or 

ii. Additivity, Given the activity levels for each of the decision variables 
the total amounts of each input and the associated profit are the sums of the inputs 
and profit for each individual process. To illustrate, in Fig, 2.3 one unit of Process 
1 and one unit of Process 3 require 2 man-weeks, 10 pounds of Material Y, and 
13 boxes of Material Z, and yield $13 profit. 

The assumptions of divisibility and additivity are equivalent to stating that the 
underlying mathematical model can be formulated in terms of linear relations. 
Strictly interpreted, the axioms imply constant returns to scale and preclude the 
possibility of economies or diseconomies of scale (in both the technology and profit 
aspects). As you can read in Chaps. 13, 14, and 15, suitable mathematical devices 
of an advanced nature often make it possible to introduce economies and dis¬ 
economies into a modified linear model. In real situations the above two postulates 
may hold only approximately, but nevertheless well enough, to permit useful 
application of the linear approach. For all the examples in the remainder of this 
chapter, we assume the legitimacy of the two assumptions. 

In the specific example there is one linear inequality relation for each labor and 
material restriction, and one linear relation expressing profitability: 

(1) maximize profit = maximize (4xi + 5x2 + + 11x4)^ 

subject to the constraints 


Ixi + 1 x 2 + Ixj -f- 1 x 4 ^ 15 (man-weeks) 

7xi + 5 x 2 + 3xj + 2 x 4 ^ 120 (Material Y) 

3xi + 5 x 2 + 10 x 3 + 15x4 ^ 100 (Material Z). 


( 2 ) 


There is no physical meaning to negative production levels, such as Xi = —4.2, 
so we do not allow production to be “negative.” We constrain each unknown 
production level to be either zero or positive, that is, to be nonnegative, 


(3) 


^1^0 X2 ^ 0 Xj ^ 0 X4 > 0. 


(A variable that is allowed also to assume negative values is said to be 
unrestricted in sign.) 

Management’s problem is to find values for all the unknowns x^ that satisfy the 
relations (2), (3), and also maximize profit (1). In general, such values need not be 
unique. There may exist alternative optimal solutions. 

Rather than get side tracked at this juncture by discussing a method for cal¬ 
culating an optimal solution, the computational question will be postponed to 
Chap, 4. You will find it very helpful in reading Chaps. 4 and 5 to have a copy of 
Fig. 2.3, and we urge you to make one. Also, jot down now what you think may 
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be an optimal solution to this example, so that you can check your guess against 
the answer given in Chap. 4. 

We will demonstrate in Chap. 4 the remarkable fact that in linear program¬ 
ming problems having m restrictive relations (not including the nonnegativity 
constraints) and possessing finite optimal solutions, there is at least one best solu¬ 
tion in which no more than m activity variables are employed at positive levels. 
Therefore in this example, there is an optimal solution using no more than three 
of the variables. 

In addition to wanting optimal values for each atj, management may also want 
to know how profit would be affected by increasing each input, by improving one 
of the technological processes, by a change in the cost of raw materials—and 
consequently a change in the profitability of the processes—or by some other 
resource used in the processes and heretofore not considered limited becoming 
scarce. In many real applications of linear programming models, these considera¬ 
tions are even more important than finding exact optimal values for each Xy 
Techniques for such sensitivity analysis are presented in Chap. 5. 

2.3 FEED-MIX SELECTION 

The Hion Hog Farm Company may purchase and mix one or more of three 
types of grain, each containing different amounts of four nutritional elements; 
the data are given in Fig. 2.4. The production manager specifies that any feed 
mix for his livestock must meet at least minimal nutritional requirements, and he 
seeks the the least costly among all such mixes. Suppose his planning horizon is 
a two-week period, that is, he purchases enough to fill his needs for two weeks. 

Assuming divisibility and additivity, we write the linear formulation as 

( 1 ) minimize cost = minimize (41;ci + 35^2 + 96 x 3 ), 


Item 

One Unit Weight of 

Minimol Total 
Requirements 
Over Planning 
Horizon 

Grain 1 

Grain 2 

Grain 3 

Nutritional ingredient A 

2 

3 

7 

>1250 

Nutritional Ingredient B 

1 

1 

0 

> 250 

Nutritional ingredient C 

5 

3 

0 

> 900 

Nutritional ingredient 0 

6 

25 

1 

^ 232.5 

Cost per unit weight 
($) 

41 

35 

96 

Minimize 

Weight level 


^2 

^3 



Figure 2.4. Hion Hog Company Feed-Mix Selection. 
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subject to the constraints 


2 ;^, 

+ 

3^2 

+ 7xs 


1250 

1a:, 

+ 

1 X 2 



250 

5*, 

+ 

3X2 



900 

.6a:, 

+ .25^2 

+ l*j 

> 

232.5 

xj>0, 

for j = 

1 , 

2. 3. 


Give a verbal interpretation of (1) and (2). 

Note that in real applications of this model, as well as of the previous model, 
the number of activities can exceed the number of inequality restrictions. In the 
Product-Mix Selection Model, letting Xj = 0 for each variable is feasible in that 
all the constraints are satisfied; here the values Xi = 625 and = X 3 = 0 provide 
one feasible solution. The optimal solution turns out to be x, = 200, X 2 = 50, and 
^3 = 100, so that the minimum cost is 19,550. 

The production manager may want information other than merely values for 
each Xj, since the model may be an oversimplified version of the real problem. 
For example, he may want to know how much it will cost him to use a nonoptimal 
value for a specific Xj, how much excess over each minimal total nutritional require¬ 
ment an optimal feed mix provides, how much he would save by reducing the mini¬ 
mal nutritional requirements, or how much the cost of a ne\/ feed component 
must drop, say Grain 4 , before he seriously considers using this grain in his mix. 
These questions fall in the realm of sensitivity analysis. 

2.4 FLUID-BLENDING SCHEDULE 

Another variant of mixture models is the economic problem associated with 
blending fluids, such as crude oils, molten metals, and other chemicals, into sale¬ 
able intermediate or finished products. Instead of using specific numerical values 
as we did in the previous examples, here we make use of a generalized notation. 

Suppose the Twobridge Company, a firm that sells various cheiAical products, 
is planning to prepare amounts of three such blends. Each of the products may be 
blended from at least one of two chemical inputs, subject to certain availability 
and mixture constraints. Define the unknown activity levels as 

Xij number of gallons of Chemical i to be used in the blend of Product j, 

and we assume Xij ^ 0. Verify that in the example, we have i = 1,2, and j = 
1, 2, 3. 

The first pair of constraints limits the amount of input of each of the chemicals 

( 1 ) 

^21 + ^22 4“ ^23 ^ •Sij 

where St is the amount (in gallons) of Chemical i available during the planning 
horizon period. The top constraint in (1) indicates that the sum of the number 
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of gallons of Chemical 1 used in Products 1, 2, and 3 cannot exceed the amount 
iS*! that is available. 

The second set of constraints expresses the requirement that production must 
be scheduled to at least meet specific minimal levels of customer demand for each 
of the products: 

^11 + ^21 ^ -^1 

(2) Xi2 + X 22 ^ ^2 

^18 + ^23 ^ 

where is the minimal demand for Product j during the planning period. Assume 
there is no volumetric loss when the tv/o chemicals are blended. So the first 
constraint in (2) requires that amounts of Product 1 to be blended from both 
Chemicals 1 and 2 must be at least as large as the minimal level of demand Di. 

The next restriction refers to a technological property to be satisfied by a mixture 
yielding Product 1. Suppose each input chemical contains a critical constituent. 
Specifically, a gallon of Chemical i has the proportion of this constituent. The 
constraint is that in the blend of Product 1, the proportional content of this con¬ 
stituent must be at least the fraction : 


j ^ 1^11 ^ 2^21 ^ 

Xii + ^21 

An analogous restriction may hold for Product 2: 


(4) 


^ 1^12 ^2X22 ^ , 

I — • 2 > 

X ^2 + ^22 


here the fraction r 2 is a maximal limit. Check that after some algebraic manipula¬ 
tion both (3) and (4) may be rewritten in a standard linear form: 


(^1 0^11 "1“ (^2 — *'0^21 ^ 0 

w) 

{bi — r2)xi2 + (^2 — ^ 2)^22 < 0. 

Finally, there may be a simple minimal ratio to be observed between the two 
chemical inputs in the Product 3 blend: 


(6) ^ ^ n or Xi3 — rjATjj ^ 0. 

X 23 

Letting pi^ be the profit associated with a unit of activity Xijy we state 

the objective fanction as 

maximize profit = maximize-j- pi 2 ^i 2 “i" Pn^n 
+ P21X21 + p22^22 ” 1 " p2i^2i* 

Examine how the model consisting of (1) through (7) is summarized in tabular 
form in Fig. 2.5. (From this point onward, we usually indicate zero coefficients of 
activities by blanks in the technology table.) What questions would a manager want 
answered by this model? 
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One Gallon of 


Item 

Chemical 1 in Product 

Chemical 2 in Product 



1 

2 

3 

1 

2 

3 

Restriction 

Chemical 1 supply 

1 

1 

1 





Chemical 2 supply 




1 

1 

1 

sSz 

Product 1 demond 

1 



1 



>0, 

Product 2 demand 


1 



1 


iOz 

Product 3 demand 



1 



1 

2 03 

Product 1 technology 

Oy-ry 



QZ-fy 



> 0 

Product 2 technology 





bz'-rz 


^ 0 

Product 3 technology 



1 



b^-r^ 

> 0 

Unit profit ($) 

P\\ 


Pvh 

Pz\ 

Pzz 

Pzi 

Maximize 

Activity level 
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Figure 2.5. Twobridgc Chemical Company Fluid-Blending Schedule. 

In reviewing the blending model, notice that the supply and demand restric¬ 
tions are similar to the restrictions you saw in the previous two examples. The 
technological restrictions are of a somewhat different complexity. It is possible in 
a model such as this that the entire set of restrictions does not admit a feasible solu¬ 
tion. In other words, 5,, and fj may be such that no set of values exists for the 
Xij satisfying every constraint. For example, ifiJj = 5 and 2)^ = 10 for all i andj, 
then total minimal demand would exceed total available supply. 

* 2.5 ARBITRAGE TRANSACTIONS 

In certain exchange markets such as foreign currencies, stocks and shares, and 
commodities for spot and future delivery, it is possible to transact buying and sell¬ 
ing agreements that yield a profit due to an imbalance in the pertinent exchange 
rates. Such a transaction is knoym as arbitrage. Suppose an arbitrageur named Jay 
Canes faces possible trades in six types of market commodities. Assiune he may 
make, all in all, up to nine separate kinds of transaction agreements. Figure 2.6 
illustrates the details of the situation. 

For activity x ,5 a sale of a unit of Market Commodity II yields Tn units of Market 
Commodity I. For activity Xj, a sale of a unit of Commodity I yields units of 
Commodity III and r^ units of Commodity VI. The other activities arc defined 
analogously. The r^, of course, may be fractional-valued. Note that activities 
for 7 = 1, 2,..., 5, imply that any commodity may be exchanged into Com¬ 
modity I. Also note the sign convention observed in Fig. 2.6. Since you 
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Figure 2.6. Jay Canes* Arbitrage Tramactions.- 


must distinguish between sales and receipts of each commodity, you can do this by 
letting a negative number denote an outflow, or sale, and a positive number denote 
an inflow, or receipt. 

In the idealization of the market, envisage Canes as making all transactions 
simultaneously and instantaneously. The only constraints imposed are those ensur¬ 
ing that he is not short on any commodity. In other words, for each commodity, his 
sales must not exceed his purchases. Verify that in terms of linear constraints, the 
restrictions are 

+ ^12X2 + r, 3 X 3 + r, 4 X 4 + r, 5 X 5 — Xe — X7 ^ 0 

—<^1 + raoXe + ^ 0 

—^2 + ^37X7 + r3»X8 ^ 0 

( 1 ) —X3 — Xg + r4pX9 ^ 0 

—X 4 + TseXs ^ 0 
—-^6 + re7X7 - Xj ^ 0 

x^ ^ 0 for j = 1, 2,..., 9. 

Assume his objective function is to maximize his net proceeds in terms of Market 
Commodity I: 

(2) maximize (ruXi + ri2X2 + ri3X3 + ri4X4 -j- fisXs — Xe — X7). 

When all the right-hand sides of a set of linear inequalities are zero, as is true 
for ( 1 ), the.system is termed linear homogeneous. Note that = 0 for 
all j is a trivial feasible solution but yields zero net proceeds. Suppose there is some 
set of nontrivial values for the x^ that yields positive net proceeds. Then, because 
of the homogeneity of (1), kx^ would also be feasible for all A: ^ 0, and the net 
proceeds would thereby be changed by a factor L Thus in this model, if there 
exists any feasible solution with positive net proceeds, the value of the objective 
function can be made arbitrarily large. In such a situation, we state that there is an 
unbounded optimal solution. 

We would mislead the reader if we purported that the above linear optimiza- 
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tion model is a practical tool for an actual arbitrageur. In real commodity markets, 
a trader must be able to spot imbalances in rates very quickly since economic 
market forces correct such situations rapidly. Consequently, our primary purpose 
for citing this illustration is to show how a problem with an unbounded optimal 
solution can arise reasonably. Such a model is of independent interest to economists 
for gaining insights into the fundamental properties of multicommodity exchange 
markets. 

2.6 DYNAMIC PLANNING—^INTEGRATED 
PRODUCTION PLANNING EXAMPLE 

Each of the examples you have investigated so far can be expanded into a plan¬ 
ning problem extending over several time periods. To illustrate, in the Product- 
Mix Selection Model, if raw material and manpower availabilities as well as unit 
profits change over time, and if it is possible to store excess raw materials and 
finished goods from one period to the next, then the optimization problem is 
truly dynamic. It does not factor completely into separate period-by-period opti¬ 
mization problems. Similarly, in the Feed-Mix Selection Model, if the farmer 
permits mild relaxations of the nutritional requirements in one planning period 
on condition that compensations occur in subsequent periods, and if the grain 
costs vary over time, a dynamic planning problem results. Similar generalizations 
are possible in the blending and arbitrage models. 

Common to all models in this category is that current decisions have their effects 
both presently and in subsequent periods. Consequently, the important economic 
tradeoffs are not only those between activities within a single time period, but 
also those between activities in different time periods. Typically, such tradeoffs 
become significant when the decision-maker can invest, expand capacity, or train 
personnel, with the effect of creating profit-making or cost-saving potentialities 
in future periods. 

Q^ualitative analysis* Suppose the Great Auto Works Company is 
committed to supplying the amounts St, for I = 1, 2 ,..., T, over an incipient 
planning horizon of T time periods. The activities in any period are to 

• Utilize available workers to produce items for current and future demands, 

• Build inventory to meet subsequent requirements, 

• Increase or decrease the work force, or 

• Leave idle some of the work force, while awaiting increased production in 
future periods. 

Leaving some of the work force idle may be economical, provided the costs of 
firing and subsequently hiring workers exceed the cost of keeping workers idle for 
a limited amount of time. (In a real situation, the level of idleness may manifest 
itself in a slow-down or decrease in efficiency.) The linear technology of this 
problem can be summarized by showing the effects of the activities in Period t 
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upon the constraining relations for Period t and subsequent periods. The time 
interrelations can be exhibited in a variety of ways. Experience and ingenuity with 
problems of this sort often enable one to arrive at a formulation giving the inter¬ 
relations in terms of Period t and of only a limited number of future periods, rather 
than of the entire set of future periods. This idea will be made clear in the present 
example. 

We pause a moment before launching into the detail. This example is the largest 
and most involved you have read so far. Gro slowly, and plan to reread the explana¬ 
tion at least once or twice. It is worth the effort because many of the truly important 
applications of linear programming are dynamic models and have a structure 
similar to this example; that is why we chose it. Once you have gotten the hang 
of this problem, you will find it fairly easy to formulate other dynamic models, 
really. 

Time-staged formulation. Figure 2.7 gives the essence of the entire T 
period technology in terms of the interrelations between only Periods t and t + !• 
The nonnegative decision variables in Period t are 

ei = number of workers currently engaged in production (employment level 
activity) 

St = inventory of items on hand at the end of Period t in excess of current requi¬ 
rements, in units of 25 items (stock level activity) 

Xi = increase in number of workers at start of Period t (hiring activity) 
yt = decrease in number of workers at start of Period t (firing activity) 
dt = number of workers currently idle, in units of 10 workers. 

Within each period there are two constraints to be observed, these being, in general 
terms, that (1) the sales requirement St is to be met, and (2) the production level is 
to be limited by the level of the work force. 

Getting into the detail, observe that there are two ways of meeting the sales 
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yt 

dt 
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Figure 2.7. G.A.W, Company. Two-Period Interrelations. 
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requirement Sty namely, through the output of workers currently engaged in 
production and from inventories entering Period t. Current production and entering 
inventories in excess of St become inventory at the end of Period t. From the et 
column of Fig. 2.7, you see that each employed worker produces four items during 
the period. The value of inventory St is measured in units of 25 items. Thus, if 

= i, 25 units remain at the end of Period t, or if St = f, only 10 units remain at 
the end of Period t. The fact that the coefficient of St in the Period t + 1 sales 
relation is 20 implies that spoilage of 5 units occurs from one period to the next 
whenever 25 items are inventoried. Note the sign convention employed for 
the technological coefficients. A positive number in a sales-requirement relation 
implies the inflow of items to meet the requirement, and a negative number implies 
an outflow of items that is in addition to the sales requirement. 

The second set of restrictions applies to the work force. The inflow of work 
force entering Period t from the previous period may be augmented by hiring, and 
is then allocated as an outflow to the activities of producing, firing, and leaving 
workers idle. A sign convention is adopted to distinguish between the inflow of 
work force and the outflow or uses of it. For example, the coefficient of et in the 
work force restriction of Period t is +1 to denote a use of labor; the coefficient is 
— 1 in the Period t + 1 restriction to signify an inflow of labor from Period t. The 
work force entering Period t + 1 consists of those persons engaged in production 
and those left idle during Period t. But note that the technology of the idle activity 
indicates that for every 10 men left idle in Period t, only 9 are available for possible 
work in Period t + 1. Presumably the reason for this shrinkage effect is that 1 out 
of 10 men left idle decides to leave the firm. 

With the information in Fig. 2.7, you now may construct in a straightforward 
manner the full technology of the model, analogous to the technologies exhibited 
for earlier models. The entire structure appears in Fig. 2.8. The constant Wi 
represents the level of the work force at the start of Period 1, before any Period 1 
increase or decrease in the work force takes place. Let = 0 if the firm has just 
started business. A positive value for Wi may be viewed as an exogenously given 
supply of work force, that is, a supply determined independent of the optimiza¬ 
tion process. If the G. A. W. Company has made other exogenous decisions to 
increase the work force by the amounts Wty for / = 2, 3,..., 7*, such positive 
numbers will appear in the work force equations instead of the 0 constants on the 
right-hand side. Exogenous decreases would be handled analogously using negative 
values for Wt^ 

For expository purposes, we list a few specific relations that are embodied in 
Fig. 2.8 


Period 1 sales requirement: 
Period 1 work force restriction: 
Period 3 sales requirement: 
Period 3 work force restriction: 
Period T sales requirement: 
Period T work force restriction: 


Si 25^1 = 4«i 

+ 10^1 = —yi 

Si -f- 25 j 3 = 20X| -j- 4^3 
ei + 10</3 = + 9 ^ + Xi —yi 

iSj» 25rjt = 20 j'2<.i “j“ 4^|» 

+ lO^r = ^r-i 4“ 9^-1 + — yr* 
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Figure 2.8. G.A.W. Company Dynamic Planning Example. 

The above relations express the idea that, “ Total uses of resources or products must 
equal their total supply f or “ Total amount flowing in must equal total amount flowing out^ 
For example, the Period 1 sales requirement states that the amount to be supplied 
in Period 1 (iSi), plus the inventory at the end of Period 1 (25jj), must equal total 
production (4tfi) in Period 1 . Similarly, the Period 3 sales requirement states 
that the amount to be supplied in Period 3 (^j), plus the inventory at the end of 
Period 3 (25^3), must equal inventory, entering from Period 2 ( 20 ^ 2 )} total 
production ( 4 ^ 3 ) in Period 3. As a final illustration, the Period 3 work force 
restriction states the number of workers employed in Period 3 (^ 3 ), plus the 
number left idle (lOt^), must equal the number employed in the previous period 
(^ 2 ), plus the number left idle in Period 2 that still remain with the company 
( 9 ^/ 2 ), plus the number hired in Period 3 (;c 3 ), less the number fired in Period 3 

Simililarly, the objective form is written as 

minimize [{cie^ + i,s, + h,Xi +/,^, + nfli) 

+ (^ 2«2 + hSi + hiX^ +f2y2 + nzdi) 

+ +fiys + nfli) 

+ •. • 

{Ct^t "t" *rS'r 4" hfXr -{-/ryT 4" **T^r)]- 


( 1 ) 
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As you can see, the above is a clumsy expression to write. In building your own 
models you certainly would not want to use such a drawn out way of expressing 
the idea that the objective function consists of the sum of each period’s costs, 
starting with Period 1 and ending with Period T. Therefore, the lengthy expression 
can be abbreviated by employing the standard summation notation S (sigma) 
that you often find in statistics texts. Specifically, (1) can be rewritten as 

(2) minimize («««» + »>t + +ftyt + >«(</,) j. 

From Fig. 2.8, you should be able to see what complexities arise in determining 
a dynamic plan. For example, a decision to increase the number of workers en¬ 
gaged in production during a period, of course, has an immediate impact on the 
costs of that period. But such a decision also may influence the number of workers 
required subsequently if the plan provides for inventories to be lefl over at the end 
of the period. Similarly, an exogenous increase in the sales requirement of a period 
may bring about a revision in the work force plan for several preceding periods. 
As a third illustration, a personnel policy that decreases the attrition arising when 
workers are idle can have a complicated overall effect in reducing the need to 
hire new workers. In what ways do you think a manager would find such a planning model 
useful? 


^ At the start of this section we mentioned there are alternative, but, of course, mathe¬ 
matically equivalent, ways of viewing such a model. We illustrate this point by trans¬ 
forming the relations in Fig. 2.8 into another meaningful equivalent representation. 
You proceed in the following manner. Add Row 1 to Row 3 to form a new Row 3'. Add 
the new Row 3' to Row 5. Add the new Row 5' to Row 7, and so forth. Similarly add 
Row 2 to Row 4. Add the new Row 4' to Row 6. Add the new Row 6' to Row 8, and 
so forth. 

The results of these operations appear in Fig. 2.9. The legitimacy of the transforma¬ 
tions follows immediately from the rule, "Adding equals to equals yields equals,^' All you 
have done is to add one equation to another to yield a composite equation. 

The sales restrictions are now stated in terms of cumulative sales. In any Period t*, 
total sales to date, plus spoilage losses' plus inventory at the end of Period t* equals 
accumulated production; 

f ..-i f t* 

(i) 2 ^ 2 2 requirement 

Row (2t^ - 1)^ 


Similarly, the work force restriction in Period t^ appears as the initial work foxx:e, 
plus accumulated hirings, equals accumulated firings, plus accumulated shrinkages due 
to workers left idle, plus men currently engaged in production, and men currently left 
idle: 

(ii) + 2 + 2 

t-i t-i t»i 


Period t* 

work force restriction 
Row (2t*)^ 


Test your facility in making transformations of this sort by revising Fig. 2.9 for the 
case in which no inventory spoilage occurs (therefore the —20 in’ Fig. 2.7 is changed to 
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Figure 2.9. G.A.W. Company Alternative Formulation. 


—25) and no shrinkage in the number of workers left idle occurs (therefore the —9 
in Fig. 2.7 is changed to —10). Then give a corresponding verbal interpretation of the 
revised equations (i) and (ii). ^ 

2.7 PRODUCT ALLOCATION THROUGH 
A TRANSPORTATION NETWORK 

Many important linear programming models have what is termed ametwork 
structure, which yields a very simple kind of technology table. We provide a typi¬ 
cal example below, and subsequently explain why it is equivalent to a network 
problem. But first let us mention what this special technology looks like, so you 
will know it when you see it. 

For any activity variable in a network structure, the entire set of constraints 
contains no more than two nonzero entries, these coefficients being either +1 
or — 1, depending on the sign convention adopted. When there are two coefficients 
for an activity, then one is +1 and the other — 1. When the technology of the 
model has this form, it is usually possible to interpret the optimization problem 
in terms of routing the flow of a single commodity through a network. Sometimes 
it is necessary to manipulate the equations of the model in order to establish the 
network structure of the problem. In Chap. 6 you will study network models in. 
detail. 

In case you forgot. We mentioned in the preceding section that employ¬ 
ing the S (signaa) summation symbol considerably reduces the amount of writing 
to express lengthy expressions. You will see this claim borne out again in the model 
below. If you have never taken a statistics course, or did so long ago, you may 
need a brush up on the rules for using the S symbol. By means of examples, we 
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present such a review next. (If you don’t need this review, skip directly to the 
model.) 

1. Consider the summation 

CiXi + C 2 X 2 + ^ 3^3 + ^ 4^4 + • • • + CnXn- 

In S notation, this sum is written as 

>-i 

where the so-called index of summation^ in this case j, may be any conveniently 
chosen letter. Any other index would serve as well, such as 

2. The expression 

CiXi -f- diyi -f- C2X2 ”f" ^3^3 ” 1 ” “h * * * ” 1 ” "h ^ny-n 

can be written in several ways, two of which are 

n n n ^ 

2 + diyt) and 2 + 2 ^tyt- 

t=i t»i 

3. Suppose that all the in the above expressions are equal; for example, 
suppose every = 5. Then 

5x, + 5X2 + 5 X 3 -f • • • + 5xn 
can be written simply as 

5 S Xy 

4. Consider the expression 

H" ”1" ^t3^3 ”1" * ’ * "h 

Here the coefficient of each Xj is where i is a fixed, but unspecified, num¬ 
ber. This expression is written as 

n 

5. Suppose the summation involves two subscripts: 

^ll^U + ^12^12 + ^21^21 + ^22^22 + ^ 31^31 + ^ 32 ^ 32 * 

Notice that the first subscript takes the values 1,2, and 3, and the second 
subscript takes the values 1 and 2 ; also, every combination of values for the 
first and second subscripts appears. This so-called double summation can be 
abbreviated as 

3 2 

2 2 

tol j-l 

The above examples cover almost all the applications of the S notation we use in 
this book. 
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Example. A large dairy firm, the Saur Milk Company, has m plants located 
throughout a state. Daily milk production at Plant i can supply at most Si gallons, 
for i = 1, 2, . . ., 771. Each day the firm must furnish its n distributing warehouses 
with at least fresh gallons, for j = 1, 2, ..., ti, to meet demand requirements. 
The economic problem is to designate which plants arc to furnish which ware¬ 
houses so that total transportation costs are a minimum. 

Let Xi^ be the number of gallons shipped from Plant i to Warehouse j, and Ci^ 
be the associated shipping cost per gallon. A mathematical specification of the 
model is 

771 71 

(1) minimize 

t=i j=i 

subject to the restrictions 


(2) 

2 ^ 

for I = 1, 2,. . ., 771 (supply restrictions) 

(3) 

2 ^ij > 

for j = 1, 2,. . . , 71 (demand requirements) 

(4) 


each Xij > 0. 


Give verbal interpretations of (2) and (3), and write out in full (1), (2), and (3) 
for 77Z = 3 and ti = 4. 

Figure 2.10 is a convenient tabular summary of the situation. If > 

SJ-i so that total supply is at least as large as total demand, it is always pos¬ 
sible to find a feasible transportation schedule in which no more than {m + n — 1) 
routes are utilized. As shown in Chap. 6, an optimal solution also exists in which no 
more than {m + n — 1) routes are employed. 



►To construct one such feasible schedule with at most (m + w — 1) routes, start at 
the upper-left corner, or, as it sometimes is called, the northwest comer, and allocate the 
available production Si to each demand requirement, beginning with D,, until the 









































Warehouse Demands Plant Supplies 


• 2.7 • 


Transportation Network 53 


Figure 2.11. Initial Feasi¬ 
ble Solution. 
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production is exhausted. Then continue the process with S 2 , and so forth. Figure 2.11 
illustrates the procedure with an example. Needless to say, this trial routing may be, 
and often is, far from optimal. M 

Network equivalence. Suppose we multiply by —1 each warehouse 
demand-requirement relation in (3). Multiplication by a negative number reverses 
the sense of the inequalities. Then the technology can be written as in Fig. 2.12, 
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Figure 2.12. Transportation Network Example. 


Observe that in the column for each there are only two nonzero coefficients, 
+ 1 and —1, the former indicating an output at Plant i and the latter an input 
at Warehouse j. You can visualize the plants and warehouses as a set of points in 
space or, using network terminology, as a set of nodes. Each Xg variable then 
corresponds to a flow along a directed link or arc between the Nodes i and j, and 
Cij corresponds to the associated cost of a unit of flow. We draw the network for this 
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Plants Warehouses 



Figure WS. Transportation Network Schematic. 


example as Fig. 2.13. To complete the network specification, we show each plant’s 
supply and each warehouse’s demand requirement. As a network flow problem, 
the task is to allocate the amounts at the positive nodes along the various arcs so 
as to meet the negative nodes’ requirements at a minimum cost. In management 
science literature, this model is often referred to as the Hitchcock-Koopmans Trans¬ 
portation Problem, 

It is a remarkable property of the network problem, described by (1) through 
(4), that if the model has a feasible solution, an optimal solution exists for which 
all the A’tj have integer values (provided Si and Z)j are integers). We elaborate on 
this result in Chap. 6. Network models involving the occurrence of gains and losses 
along the arcs generally do not possess this property of integer optimality. 

* 2.8 ROUTE SELECTION THROUGH A NETWORK 

Here we consider a significant network problem, discussed in greater detail in 
Chaps. 6 and 10. Examine the network depicted in Fig. 2.14. Observe that it is 
possible to go from any Node i, for i = 1, 2, 3, 4, to any other Node j, provided 
that j > i. The route can be either direct or indirect by way of intermediate nodes 



Figure 2.14. Shortest-route Model. 
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iij — i> 1. Thus a route from Node 1 to Node 5 can be direct, across the arc 
(1, 5); or indirect, by way of (1, 3) and then (3, 5); or by (1, 3), then (3, 4), and 
finally (4, 5); and so forth. 

Assume each arc (z,j) has an associated cost for traversing it. There is no 
particular relationship between the value of Cif and the pictorial representation of 
the length of the arc {i,j) in Fig. 2.14. In a particular application, for example, 
r ,5 could be either larger or smaller than ^,3 + ^ 35 . Ordinarily Cfj ^ 0, but this 
assumption need not be made here. 

One illustration of how such a network can arise is the following. Suppose a 
trucking company needs to lease a particular amount of transportation equipment 
over a four-year period. The company can meet its requirements by leasing a 
new piece of equipment at the beginning of Year 1 and keeping it until the begin¬ 
ning of Year 5. The associated cost ^,5 is comprised of the rental fee plus the expected 
running and maintenance costs of this equipment for the four years. Another 
alternative is to lease a new piece of equipment at the beginning of Year 1 and keep 
it until the beginning of Year 3, then lease another new piece and keep it until the 
beginning of Year 5. The associated cost of the latter policy is ^,3 -j- ^ 35 , where 
again these costs include rental fees plus running and maintenance costs. In most 
instances r ,3 -j- ^ 35 . Obviously, there are other alternatives to be investigated 
in pursuit of the minimum-cost leasing policy. In this particular context, the net¬ 
work model of Fig. 2.14 is an example of a dynamic planning problem. 

The specific questions to be answered for the network in Fig. 2.14 pertain to 
finding a best route from Node 1 to Node 5. We pose the questions in two distinct, 
albeit closely related, versions: 

1. What is the cost of a least-cost route from Node 1 to Node 5? 

2. What is the route of a least-cost route from Node 1 to Node 5 ? 

It is intuitive that in answering one of these questions you concomitantly provide 
the answer for the other question. However, we will proceed by setting up a linear 
optimization model to solve each question separately and then analyze the con¬ 
nection between these two models. We start with question 1 . 

Cost of a least-cost route. Define for i = 1 , 2, 3, 4, 

™ cost of a least-cost route from Node i to Node 5. 

Then you can fitid^’t by calculating for each Node j (where j > i) the cost of pro¬ 
ceeding directly to Node j, and then continuing optimally from Node j to Node 5. 
The smallest of all these costs is the value for yf. This approach can be stated 
mathemaiically as 

(1) Vi =•- iTiinimum [r., 4-J’l] for i = i, 2, 3, 4, and y^ ^ 0. 

.5 

In particular, for ? = 1 you have 

( 2 ) = minijnum [<-,5 +.)•., c,, + Vj, c,, +,)•, Cjs]- 

'I’hus you can find the cost of a least-cost route from Node ! to Node o if you know 
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the cost of a least-cost route from Node 2 to Node 5, Node 3 to Node 5, and Node 
4 to Node 5; for then it is simply a matter of comparing the cost of going directly 
from Node 1 to Node 5 with the cost of getting from Node I to each intermediate 
node and taking the best route from that node to Node 5, 

The calculation in (2) can be expressed in terms of a linear optimization model as 

(3) maximize Iji 

subject to 

U'l ^ ^12 + 1^2 
lyi < cu + 

< ^ 15 - 

The equivalence of (2) to (3) and (4) can be argued as follows. The value of 
yi can be no larger than the value of each Cu for j = 2, 3, 4, 5, by the defini¬ 
tion of a minimum in (2). The constraints (4) express this fact. However, yi 
actually must be equal to at least one of the values +yj, forj = 2, 3, 4, 5, and 
this property, is ensured by the maximization in (3). Explain why. 

Of course, you do not know the values ofyi, for i = 2, 3, 4, and if you limit 
yourself to questions 1 and 2 above, you have no particular interest in these values. 
But you can treat each for i = 2, 3, 4, as an unknown in (4) provided you apply 
relatipn (1) to yield the appropriate constraints; 



1^8 ^ CjS + lj’3 


( 5 ) 

I)*! ^ C!4 + lyi 
U’2 ^ ^85 

for i = 2 

( 6 ) 

1>'3 ^ Cu + 

1^3 < Cu 

for i = 3 

( 7 ) 

lj'4 < Cu 

for i = 4. 


It is permissible but not necessary to strengthen (7) to an equality. For the sake 
of symmetry, we leave (7) as an inequality. Question 1 is answered by solving the 
model (3) through (7). Note that you do not constrain the^j to be nonnegative. If 
all Cij > 0, you could add nonnegativity conditions without loss, but even then 
you need not state them explicitly. The linear optimization model is summarized 
in Fig. 2.15. 

► Observe that although the optimal value of^i in the solution of (3) through (7) does 
indeed provide the optimal cost from Node 1 to Node 5, the other values of ji, for i = 
2,3,4, may not turn out to be the optimal cost from Node i to Node 5. This arises because 
(5), (6), and (7) do not fully embody the minimization operation indicated by (1). 
For example, suppose c ,5 = 1 and all other cjj > 1. Then^j = 1 and^t = 0, i = 2, 3, 4, 
solves (3) through (7). But yi = 0, for i = 2, 3, 4, is not the cost of a least-cost route 
from Node i to Node 5. However, you always obtain all optimal yi if you maximize 
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Does solving this model answer question 2 as well? Yes. To avoid a lengthy 
description of the process for finding an optimal route having the minimal cost, 
we explain the approach using a simplified example. As you verified above, in an 
optimal solution at least one of the con-' 
straints in (4) will be satisfied as an equality. 

Assume there.is a single such constraint, and 
that it is the one involving and ^ 3 . Next 
examine ( 6 ), the constraints for > 3 . Again 
assume there is only one of these restrictions 
that is satisfied with an equality, and that it 
is the one involving C 35 . Then the unique 
optimal route is along arcs (1,3) and (3, 5). 

It may puzzle you that although the 
problem to be analyzed involves a network, 15 
the technology exhibited in Fig. 2.15 seems ^ 
inconsistent with the network problem as 
described at the start of Sec, 2.7. In partic¬ 
ular, there are more than two nonzero entries 
in each of the columns. Look at the table 
more carefully. Do you observe any indica¬ 
tion that the analysis above seems to have 
dealt with a transposed network technology? 

In pursuit of this idea, we turn now to a 
direct solution of question 2 . 

Route of a least-cost route. Solving the route problem directly re¬ 
quires only a straightforward extension of the ideas used in the transportation 
network model of Sec. 2.7. As before, let 

Xij = amount of flow from Node i to Node j for i = 1, 2, 3, 4 and j > t. 

Impose the restriction Xi^ > 0. To find the best route, make a unit of flow avail¬ 
able* at Node 1 and force it to flow out to the other nodes 

( 8 ) lXi2 + I.V 13 -f- lXi 4 + I.V 15 = 1 . 

Then ensure that any amount of flow entering Nodes 2, 3, and 4 must flow out to 
a higher numbered node 

— lxi 2 + 1^23 + Ix 2 a + 1^25 = 0 (Node 2) 

(9) — 1 a:i 3 1^23 + 1^34 + 1^35 = 0 (Node 3) 

— Ixn — \x 2 i — l;c 34 + 1^45 = 0 (Node 4). 

Give a verbal interpretation of the Node 3 constraint. 

You also could add a constraint stating that all the flow entering Node 5 must 
equal 1, but this relation actually is implied by ( 8 ) and (9), as can be seen by 
summing all these constraints. Therefore such a relation is redundant. 


y\ yz >’3 >'4 


1 . -1 

<C,2 

1 -1 

SC|3 

1 -1 

SC|4 

1 

SC|5 

1 -1 

<C23 

1 -1 

<C24 

1 

5^25 

1 -1 

<C34 

1 

<C35 

1 


10 0 0 

Maximize 


Figure 2.15. Least-cost Analysis. 
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The objective function is 

minimize c, 2 ^i 2 + ^is^is + + <^ 15^15 

+ ^ 23-^23 + ^ 24-^24 + ^ 25-^25 
+ ^34-^34 4" ^35-^35 
4 " ^ 45 ^ 45 } 

which can be succinctly written with the S notation as a double summation: 

4 5 

( 10 ) minimize 

t=i ^=t+i 

Thus the linear optimization model involves finding nonnegative values for 
satisfying ( 8 ), (9), and (10), The technology of the problem is shown in Fig, 2,16, 
Now it is evident that this model satisfies the conditions for a network: for any 
variable, the entire set of constraints contains no more than two nonzero entries— 
these being either +1 or — I; when there arc two coefficients for a variable, one is 
rl and the other is — I, Consequently, among the optimal solutions to ( 8 ), (9), 
and (10) there is at least one that is integer-valued. In this instance, the integer 
property results in each Xij ])eing either 0 or 1, Explain why. It follows that if 
( 8 ), (9), and ( 10 ) have a unique solution, then that solution must be integer- 
valued. For ease in exposition assume this uniqueness condition. Suppose that the 
cpiiinal solution is .Vj, = ,^35 = ] and all otlier Xfj = 0. Then the optimal route 
is along arcs (1,3) and (3,5). 

Docs solving this model yield the answer to question 1 ? Of course. In the illus¬ 
tration, the cost of the least-cost route is simply f ,3 + ^ 35 . 

Duality. By this point you will have discovered the interconnection between 
the two models exhibited in Figs. 2.15 and 2.16. The rows of Fig. 2.15 are the 
columns of Fig. 2.16, but the sense of optimization is opposite in the two models. 


Nodes 

X|2 ^13 ^14 ^15 

X23 ^24 ^2S 

^34 ^35 

^45 


1 

1111 




= 1 

2 

-1 

1 11 



=0 

3 

-1 

-1 

1 1 


=0 

4 

-1 

-1 

-1 ' 

1 

=0 


C12 C |3 C|4 C|5 

C23 C 2 /i C25 

^34 C35 

^45 

Minimize 


Figure 2.16. Best-route Analysis, 


iiiquisinvc reader should try to show that if all Ctj > 0, then (8) and (9) can be 
written as {!>) relations without loss, in whicii case the sense of the inequalities also is 
hi t)v 1 .:j models. As we remarked previously, in this instance you can add 
v.iihov.t h- 'i .’.ietion V, > 0 in the first model. 
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The transposition phenomenon in Figs. 2.15 and 2.16 is referred to as duality. 
You will study it again in Chap. 5. It suffices to say here that every linear 
optimization model has a corresponding dual problem and, as you will verify 
later, solving one of the problems automatically furnishes a solution to its dual. 

2.9 INDUSTRIAL IMPORTANCE OF LINEAR 
OPTIMIZATION MODELS 

The beginning of this chapter briefly discussed actual industrial applications 
of linear programming. Now that you have studied several small-scale models, 
you are ready to proceed with this topic in more detail. 

We start again with the example of a major integrated oil company. This 
industry’s use of linear optimization models deserves special attention for two 
important reasons. First, oil companies throughout the world have the best overall 
record of success in early and continued application of linear programming. Their 
experience amply demonstrates that it is practical and profitable to use mathe¬ 
matical models for planning purposes. Second, the oil companies, encouraged by 
their initial success, have pioneered the application of linear optimization methods 
to a wide variety of decision areas, and thus have demonstrated techniques for 
making this scientific approach workable in a competitive business environment. 

Look back at Fig, 2,1 (p. 34) to review how the Jlow of material starts with 
the activity of pumping crude from the earth and ends with the activity of market¬ 
ing gasoline at service stations. Linear programming models have been formulated 
and tested to aid in decision-making at every major point in this stream. Specifi¬ 
cally, models have been developed to: 

• Schedule production from a series of underground oil rcsci'voirs so as to 
maximize profit, subject to equipment-capacity limitations and constraints 
imposed by physical pumping phenomena; 

• Determine the net profitability of exchanging a proprietary crude for another 
company’s crude, given the configuration of refineries and the associated 
economics of processing the exchanged crude; 

• Calculate the incremental cost of increasing the amount of a product for a 
spot sale (for example, manufacturing a specified amount of jet fuel for a 
government contract), given the targeted quantities of other products to be 
manufactured; 

• Plan weekly minimum-cost schedules for refinery unit operations and product¬ 
blending, taking into account crude availabilities, throughput constraints on 
the refining units, performance characteristics of each product (such as octane 
rating), and the pre-established shipping requirements for the products; 
Establish the return on investment of a proposed new refinery unit, realizing 
its full impact on existing units; 

• Route products from several refineries to a number of marketing areas along 
least-cost transportation paths, recognizing factors ofdiflercniial costs of manu- 
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factured products at the separate refineries, relative shipping charges, and 
seasonal variations in customer demand; 

• Construct an annual plan to integrate the major decisions of the entire com¬ 
pany. 

By no means have we exhausted all the linear programming applications, or 
even all the important ones within an oil company. But the list is long enough to 
convey the wide applicability of the approach. 

Since each oil company has its peculiar characteristics, such as refinery location 
and age configuration, marketing districts, crude-oil reserves, and so forth, firms 
differ in their use of linear programming models. Some find it convenient to have 
one or two comprehensive models that can be repeatedly employed to make sev¬ 
eral of the analyses described above. Other companies have constructed separate 
models for each specific purpose, with differing degrees of complexity and detail. 

Certain patterns of use are well established, however, and these can guide you 
to an understanding of how linear programming is presently being employed in the 
oil industry. Many refineries calculate optimal operating schedules on a weekly 
or a monthly basis. A number of oil companies apply a linear programming analy¬ 
sis whenever they are considering a major agreement for the exchange of crude 
or other products. More and more, oil firms are periodically analyzing their dis¬ 
tribution patterns to discover transportation cost savings and profit potentials 
for new or expanded markets. The leading firms are using linear optimization 
models to test different strategies for long-term growth (e.g., five years). In the 
dozen largest oil companies in the world, you will typically find 25 to 35 people— 
and often two or three times that many—whose prime responsibility is to apply 
linear programming to the analysis of important decisions. 

Similar trends are developing in other major industries, including chemicals, 
iron and steel, aluminum, wood products, food processing, and banking. The 
applications include production planning, allocating customer orders to different 
manufacturing plants, establishing cash flow^ requirements, scheduling work 
force levels and composition, purchasing raw materials, and deciding whether 
to make or buy a product. Some firms use linear programming as a performance- 
control device to establish variable cost budgets for a plant with significantly 
fluctuating output levels, and to compare actual cost performance with scientif¬ 
ically determined standards. Frequently, a linear optimization analysis is under¬ 
taken as a special study. For exampie, companies have used linear programming to 
establish the number and location of warehouses and new plants, the profitability 
of a particular line of business, and the appropriate capacity dimensions of a new 
piece of heavy equipment. 

Improving profit. It is no easy task, of course, to apply a linear optimiza¬ 
tion model to a real business problem; use of the model must be justified on the 
basis of its potential, direct or indirect, contribution to the company’s profits. 
The principal costs incurred by utilizing this approach are associated with the 
tasks below. 
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1. Establishing the Appropriate Framework of Analysis, It is a rare manager who 
immediately recognizes that solving his decision-making problem is tantamount 
to solving a linear programming model. Even the executive thoroughly trained in 
the techniques of linear optimization models finds it difficult to conclude at first 
glance that such methods will pay off. The reason is simply that a certain amount 
of skillful trial and error is usually required to determine whether the essence of 
a complex decision problem can be captured in a linear model. To illustrate, the 
analyst must consider thoroughly the proper selection of a criterion function to be 
optimized. In some instances he may test multiple criteria, that is, apply 
several different (perhaps incommensurable) objective functions, to see if they 
point toward significantly different strategies. He must also exercise care in depict¬ 
ing the relevant constraints to be imposed on the optimization. The dangers here 
are twofold: he must not overlook any significant constraining factor, but on the 
other hand, he must not impose so many restrictions that optimization becomes 
impossible. The latter danger has proved the more difficult to avoid. In fact, 
model formulation entails about a third of the effort required in applying linear 
programming. 

. 2. Obtaining the Required Data, Applications of linear optimization to actual 
planning problems typically involve 100 to 200 constraints, with two to three 
times as many unknowns. And models with 500 to 1000 constraints are steadily 
becoming more common. These large-scale applications arise when a plan is to 
integrate several manufacturing plants, each represented by a medium-scale model, 
and when a firm wants to develop a long-term program of growth. Obtaining the 
necessary data for a model is by far the largest of all the tasks listed here. It can 
easily represent half the work required. 

3. Calculating and Analyzing the Trial Answer, Once the model has been con¬ 
structed with actual data, it is necessary to calculate an optimal result. Without 
high-speed electronic computers, it would be virtually impossible to solve realistic 
industrial linear programming models. However, in minutes, these devices perform 
operations that would consume the equivalent of decades or even centuries of 
manual computing. The cost of computing a linear programming solution is now 
the smallest portion of the entire expense involved. 

4. Testing the Validity and Sensitivity of the Answer, Modifying the Trial Model as 
Necessary, Any mathematical model of a management decision problem is inher¬ 
ently an abstraction. Consequently, there is no guarantee that the first trial model 
will produce an answer that is practical, given the myriad considerations not 
explicitly taken into account in the analysis. The solution must be examined care¬ 
fully for its soundness. Further, the answer must undergo sensitivity tests to deter¬ 
mine whether it depends critically on the exact values of particular data. If it does, 
the analyst may heed to improve the accuracy of his data before he can rely on the 
validity of the model’s result. Chapter 5 discusses sensitivity analysis in detail. 
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By studying the examples in this chapter and answering the exercises below, 
you will gain considerable experience in translating the essence of a complex 
situation into the form of a linear optimization model. In the chapters to follow, 
you will be introduced to additional examples that will highlight certain problems 
with special structures. 

From here on. In the next three chapters you will learn the fundamentals 
of linear optimization models. Chapter 3 summarizes the various algebraic tech¬ 
niques that assist in translating a verbal formulation of a linear program to an 
equivalent mathematical representation. The chapter also illustrates geometrical 
interpretations of a linear optimization model. Chapter 4 explains the most widely 
employed technique for solving a linear program. This material will not only 
deepen your knowledge of linear programming per se, but will also help you better 
understand many notions that are central to optimization methods. In Chap. 
5 the discussion extends to an analysis of solution sensitivity to input data. 

Chapters 6 and 7 focus on models of the network type. They discuss how the 
special structure of a network problem can be exploited both for deriving an under¬ 
standing of the nature of these applications and for computing optimal .solutions 
efficiently. The reason for an interest in special computational methods for network 
problems is plain. Recall the transportation network example in Sec. 2.7. in a 
real application the number of plants could easily be 25 and the number of ware¬ 
houses 100. A.s a result there would be 2500 variables in the model. It w^ould con¬ 
sume an excessive amount of computer time to handle such a problem by one of 
the methods in Chap. 4 instead of by a network approach. 

Chapters 8 through 12 turn to optimization models that encompass dynamic 
phenomena and discuss fruitful points of view' for analyzing these problems. 
In certain simple yet important situations, you can exploit the dynamic struciure 
so as to be able to relax the requirements for linearity without seriously increasing 
the difficulty of finding a solution. 

Chapter 13 deals with models in which the divisibility axiom of Sec. 2.2 is dropped. 
Only integer values for certain activity levels are permitted. The most general 
nonlinear optimization models studied in this book arc treated in Chaps. 14 and 
15. 

Starting in Chap, 16, we introduce probabilistic uncertainty. Frequently, 
random variation in the structural specification of a model entails only an obvious 
extension of the deterministic analysis given in the preceding chapters. But in other 
situations, probabilistic elements present new considerations requiring careful^ 
analysis. 


REVIEW 


EXERCISES 


I Four study-guide questions are po.scd at the end of Sec. 2.1. Di.scuss the answer.-^ (o 
these quc.siioii.s in the context of each cxaniph' treated in ih^ chapter: 
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(a) Knox Mix Company (Sec. 2.2). 

(b) Hion Hog Farm Company (Sec. 2.3). 

(c) Twobridge Chemical Company (Sec. 2.4). ’ 

♦(d) Jay Canes’ Arbitrage Problem (Sec. 2.5). 

(e) Great Auto Works Company (Sec. 2.6). 

(f) Saur Milk Company (Sec. 2.7). 

2 One reason that the models treated in this chapter are approximations to reality is 
that the axioms of divisibility and additivity rarely hold exactly. How good do you 
think these approximations are when applied to situations such as the 

(a) Knox Mix Company (Sec. 2.2). 

(b) Hion Hog Farm Company (Sec. 2.3). 

(c) Twobridge Chemical Company (Sec. 2.4). 

♦(d) Jay Canes’ Arbitrage Problem (Sec. 2.5). 

(e) Great Auto Works Company (Sec. 2.6). 

(f) Saur Milk Company (Sec. 2.7). 

3 Knox Mix Company (Sec. 2.2). Using a diagram like Fig. 1.3 for the One-Potato, 
Two-Potato Problem discussed in Chap. 1, draw the feasible production policies, 
indicate an optimal policy, and calculate the associated value of the objective function 
assuming you can utilize only 

(a) Process 1 and Process 3 (so that x^ 

(b) Process 1 and Process 4 (so that x^ 

(c) Process 2 and Process 3 (so that x^ 

(d) Process 2 and Process 4 (so that Xi 

4 Knox Mix Company (Sec. 2.2) 

(a) Suppose you let ^2 = X 3 = X 4 = 3. What is the best value for Xi ? 

(b) Suppose you let Xj = X 2 = X 3 = 3. What is the best value for x^ ? 

(c) Suppose you let Xi — Xj and ^2 = ^4 = 0. What is the best value for Xj (and X 3 ) ? 

(d) Suppose you let Xg = X 4 and Xj = X 3 = 0. What is the best value for X 2 (and X 4 ) ? 

(e) Suppose you let = X 2 = X 3 = X 4 . What is the best value for Xi (and each other 
^j)? 

5 Hion Hog Company (Sec. 2.3). Using a diagram like Fig. 1.3 for the One-Potato, 
Two-Potato Problem discussed in Chap. I, draw the feasible purchase policies, indi¬ 
cate an optimal policy, and calculate the associated value of the objective function 
assuming you can utilize 

(a) Grain 1 and Grain 2 but not Grain 3 (so that X 3 = 0). 

(b) Grain 1 and Grain 3 but mot Grain 2 (so that X 2 = 0)# 

(c) Grain 2 and Grain 3 but not Grain 1 (so that Xi = 0). 

(d) Grain 1 and Grain 2, after setting X 3 = 100 for Grain 3. 

(e) Grain 1 and Grain 3, after setting X 2 == 50 for Grain 2. 

(f) Grain 2 and Grain 3, after setting Xj = 200 for Grain 1 . 

6 Hion Hog Company (Sec. 2.3). 

(a) Suppose you let X 2 = X 3 = 50. What is the best value for Xi ? 


= X4 = 0). 
= X3 = 0). 
= ^4 = 0). 
= X3 = 0). 
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(b) Suppose you let xi = Xi = 200. What is the best value for ^3 ? 

(c) Suppose you let Xj = Xj and X 3 = 0. What is the best value for Xj (and Xj) ? 

(d) Suppose you let Xi = X 3 and X 2 = 0. What is the best value for Xj (and X 3 ) ? 

(e) Suppose you let Xj = X 3 and Xj = 0. What is the best value for Xj (and X 3 ) ? 

(f) Suppose you let Xj = X 2 = X 3 . What is the best value for Xi (and X 2 and,X 3 ) ? 

7 Hion Hog Company (Sec. 2.3). Given that the optimal solution is Xj = 200, Xi = 50, and 
X 3 = 100 , can the minimal total requirement for any of the ingredients be raised 
without also raising total cost ? Justify your answer. 

8 Twobridge Chemical Company (Sec. 2.4). Let 81 = 82 =^ 100 and Dx = D 2 = = 50. 

Assume r 3 = 1. Either indicate a feasible solution or demonstrate that no feasible 
solution exists if 


(a) r, = .1, 

r2 = .6, 

fli = .3, <70 = .4, bx 

= .2, and 

b. = .5, 

(b) r. = .5, 

r2 = .6, 

II 

Jo 

to 

II 

= .2, and 

b2 = .5, 

(c) r. = .1, 

r2 = .1, 

<7j — .3, <72 — .4, bx 

= • 2 , 

and 

b2 = .5, 

(d) r. = .1, 

.r2 = .6, 

Ox .5, fl2 

= . 2 . 

and 

b2 = .5, 

(e) r, = .1, 

r2 = .6, 

<7i = .3, <io — .4, bx 

= .8, 

and 

b2 = .4, 


♦9 Jay Canes' Arbitrage Problem (Sec. 2.5). Demonstrate that there exists an unbounded 
optimal solution if 

(a) Til = .75 and r 26 = 2. 

(b) rii = 2 and r 2 ^ = .75. 

(c) rnT 26 > 1. 

10 Great Auto Works Company (Sec. 2.6). In what ways do you think a manager would 
find such a planning model useful? 

11 Great Auto Works Company (Sec. 2.6). Write the sales requirement and the work force 
restrictions for 

(a) Period 2. 

(b) Period 4. 

(c) Period T — 1. 

12 Great Auto Works Company (Sec. 2.6). Assume the planning horizon length is T = 4. 
Let Wi = 0, and 8 x = 100, 82 = 200, .S'3 = 100, and 8 i = 160. To keep the compu¬ 
tations simple, assume there is no spoilage of items inventoried from one period to the 
next; hence, the amount 25 should appear instead of 20 in Fig. 2.8. Also assume that 
there is no shrinkage effect in workers left idle; hence, the amount —10 should appear 
instead of —9 in Fig. 2.8. Devise a plan so that 

(a) Every period keeps inventory at a minimum. 

(b) The company hires as few workers as possible. Is there more than one such plan ? 
Explain. 

(c) Answer (a) and (b) for = 160 and ^4 = 240. 

13 Transportation Network (Sec. 2.7). Write all the linear restrictions (2) and (3) and the 
complete objective function (1) for a transportation problem in which 
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(a) m = 2 and n= Z. 

(b) m = 3 and n = 2. 

(c) m = n = 3. 

(d) m = 3 and n = 4. 

*14 Route Selection Through a Network (Sec. 2.8) 

(a) Find the cost and route of a least-cost route from Node 1 to Node 5 in Fig. 2.14, 
where 


Ci2 = 2 

II 

o 

Cu = 16 

Ci5 — 20 


II 

= H 

= 17 



cji = -5 

^35 = 9 




CO 

II 


(b) Given your answer to part (a), show feasible solutions to the linear programming 
problems in Figs. 2.15 and 2.16. 

*15 Route Selection Through a Network (Sec.. 2.8). Consider a network like that in Fig. 2.14 
augmented by Node 6, which can be reached from each other Node i directly by an 
arc from that node. Draw the network showing all the arcs. State in full a linear 
programming model that obtains the cost of a least-cost route from Node 1 to Node 6. 


16 Explain your understanding 

linear programming model 

divisibility 

additivity 

nonnegative variable 
variable unrestricted in sign 
feasible solution 
objective function 
optimal solution 
* unbounded optimal solution 


of the following terms: 

alternative optimal solutions 
multiple criteria 
sensitivity analysis 
sign convention 
time-staged problem 
planning horizon 
network 

* redundant relation 
*linear homogeneous system. 


FORMULATION EXERCISES 

17 The Tim Burr Company wants to best utilize the wood resources in one of its forest 
regions. Within this region, there is a sawmill and a plywood mill; thus timber can 
be converted to lumber or plywood. 

Producing a marketable mix of 1000 board feet of lumber products requires 1000 
board feet of spruce and 3000 board feet of Douglas 6r. Producing 1000 square feet 
of plywood requires 2000 board feet of spruce and 4000 board feet of Douglas fir. 
This region has available 32,000 board feet of spruce and 72,000 board feet of 
Douglas fir. 

Sales commitments require that at least 4000 board feet of lumber and 12,000 
square feet of plywood be produced during the planning period. The profit contri- 
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butions arc $40 per 1000 board feet of lumber products and $60 per 1000 square feet 
of plywood. Let I be the amount (in 1000 board fee^ of lumber produced and P be 
the amount (in 1000 square feet) of plywood produced. 

(a) Express the problem as a linear programming model. 

(b) Using a diagram like Fig. 1.3 for the One-Potato, Two-Potato Problem in Chap. 
1 , exhibit the feasible product mixes and indicate an optimal solution. 

18 The Jericho Steel Company must decide how many pounds of pure steel Xi and how 
many pounds of scrap metal X 2 to use in manufacturing an alloy casting for one of its 
customers. Assume that the cost per pound of pure steel is 3 and the cost per pound 
of scrap metal is 5 (which is larger because the impurities must be skimmed off). The 
customer’s order is expressed as a demand for at least five pounds and the customer is 
willing to accept a greater amount if Jericho requires a larger production run. 

Assume that the supply of pure steel is limited to four pounds and of scrap metal to 
six pounds. The ratio of scrap to pure steel cannot exceed 7/8. The manufacturing 
facility has only 18 hours of melting and casting lime available; a pound of pure steel 
requires three hours whereas a pound of scrap requires only two hours to process 
through the facility. 

(a) Express the entire problem as a linear programming model. 

(b) Using a diagram like Fig. 1.3 for the One-Potato, Two-Potato Problem in Chap. 
1 , exhibit the feasible mixes of pure steel and scrap metal, and indicate an optimal 
solution. 

19 The Flagg-Poole Advertising Company has announced that it can optimally allocate 
its client’s advertising dollars by means of linear programming. The company’s 
approach is to identify the various audiences the client wants addressed—such as 
teenagers, young married couples, the geriatric group, etc. Let the index i represent 
the ith audience. The client must specify a desired level of exposure Ei for each audience 
i. Then each advertising vehicle (such as Life Magazine, a prime-time television spot 
commercial, a color ad in a Sunday newspaper, etc.) is scored for its effectiveness 
in each of the identified audience categories. Let the index J represent the Jth adver¬ 
tising vehicle, and the scored effectiveness for the ith audience of allocating a 
dollar to the Jth vehicle. 

Each decision variable is designated as Xj, which represents the total amount of 
dollars allocated to thejth advertising vehicle during a promotion campaign. The 
client’s objective is to minimize its total advertising expenditure while still meeting 
its desired levels of product exposure. 

(a) Assume that ihere are three audiences and five advertising vehicles. Write the 
linear programming model implied by the above description. 

(b) Discuss whether you think this linear programming model is an appropriate 
approach for choosing an optimal allocation of advertising dollars. 

20 The Best Tasties Corporation makes four different kinds of breakfast cereals: Noisies, 
Soggies, Bursties, and Reposies. Each of these is a composite of ingredients (grains, 
vitamins, sugar, and preservatives). Let the index i represent Ingredient i, where 
i = 1, 2,. .., I. Let a/fi be the amount of Ingredient i in a pound of Noisies, and 
similarly obu and ajti be the amounts for the three other cereals. Assume that Mt 
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is the maximum amount of Ingredient i available during the next month for making 
all of these breakfast cereak. 

The profit contribution of a pound of Noisies is represented by pjfy and similarly, 
the profit contributions for the other cereak are represented by ps^pay and pR. Let 
^Ny xs, XBi and xr represent the number of pounds of each breakfast cereal manufac¬ 
tured during the next month. At least 100,000 pounds of Nokies must be manufac¬ 
tured, as well as at least 125,000 pounds of Soggies, 30,000 pounds of Bursties, and 
500,000 pounds of Reposies. 

Show how an optimal production schedule can be obtained by a linear program¬ 
ming formulation. 

21 The Turned-On Transistor Radio Company manufactures Modek A, B, and G which 
have profit contributions of 8, 15, and 25, respectively. The weekly minimum produc¬ 
tion requirements are 100 for Model A, 150 for Model B, and 75 for Model G. 

Each type of radio requires a certain amount of time for the manufacturing of 
component parts, for assembling, and for packaging. Specifically, a dozen units of 
Model A require three hours for manufacturing, four hours for assembling, and one 
hour for packaging. The corresponding figures for a dozen units of Model B are 
3.5, 5, and 1.5, and for a dozen units of Model C are 5, 8, and 3. During the forth¬ 
coming week, the company has available 150 hours of manufacturing, 200 hours of 
assembling, and 60 hours of packaging time. 

Formulate the production scheduling problem as a linear programming model. 

22 The manager of the Boilen Oil Company wkhes to find the optimal mix of two 
possible blending processes. For Process 1, an input of one unit of Crude Oil A and 
three units of Crude Oil B produces an output of five units of Gasoline X and two 
units of Gasoline Y. For Process 2, an input of four units of Crude Oil A and two units 
of Crude Oil B produces an output of three units of Gasoline X and eight units of 
Gasoline Y. Let Xi and x^ be the number of units the company decides to use of 
Process 1 and Process 2, respectively. 

The maximum amount of Crude Oil A available is 100 units and of Crude Oil B, 
150 units. Sales commitments require that at least 200 units of Gasoline X and 75 
units of Gasoline Y are produced. The unit profits of Process 1 and Process 2 are pi 
and p 2 y respectively. 

Formulate the blending problem as a linear programming model. 

23 The High Tail Airfreight Company, which operates out of a central terminal, has 8 
aircraft of Type 1, 15 aircraft of Type 2, and 12 aircraft of Type 3 available for 
today’s flights. The tonnage capacities (in thousands of tons) are 45 for Type 1, 7 
for Type 2, and 4 for Type 3. 

The company dispatches its planes to Cities A and B. Tonnage requirements (in 
thousands of tons) are 20 at City A and 30 at City B; excess tonnage capacity supplied 
to a city has no value. A plane can fly only once during the day. 

The cost of sending a plane from the terminal to each city is given by the following 
table. 


Type 1 Type 2 

23 5 

58 10 


City A 
City B 


Type 3 
1.4 
3.8 
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Let Xiy X 2 y and x, denote the number of planes of each type sent to City A, and 
similarly, and^j the number sent to City B. 

(a) Formulate a linear programming model of this routing problem. 

(b) Discuss whether an answer so derived would represent an optimal solution to the 
actual routing problem. 

24 The Glassey-Staire Television Network wants to establish competitive but profitable 
prices for advertising time. The following is a simplified version of its pricing problem. 
Assume there are three classifications of network advertising time: prime-evening, 
weekday, and Saturday/Sunday afternoon (before 6 p.m.). Let pi,p 2 , and p^ be the 
price per minute for each of these time slots, respectively. 

The network sells time to K major advertisers, who accordingly have a significant 
effect on the determination of the prices. Glassey-Staire estimates that it can sell in (oto 
Ml, M 2 , and M^ minutes of prime, weekday, and weekend time, respectively, to these 
major advertisers and other smaller advertisers provided its prices are competitive. 
Let the index k denote a major advertiser. Glassey-Staire knows that Advertiser k 
wants to purchase anc, a 2 k, and a^k minutes in the three different time slots, and 
believes that Advertiser k is unwilling to pay more than dollars for all of this time. 
Formulate the pricing problem in terms of a linear programming model. 

25 Trim Problem, A mill of the Fine-Webb Paper Company produces so-called liner 
board in jumbo reels having a standard width of 70 inches. (Each reel has a fixed 
length.) The company’s customers, however, order reels having smaller widths (and 
the same fixed length as the larger reel). Today’s orders are for 100 reels of 22-inch 
width, 125 reels of 20-inch width, and 80 reels of 12-inch width. These smaller widths 
are to be cut from the larger standard size reel. 

For example, the company can decide to slit a jumbo into two reels each 22 inches 
wide; and one reel 20 inches wide; this leaves 6 inches of trim waste from the 70-inch 
jumbo. The company wants to manufacture today’s orders so as to minimize total 
trim waste. 

(a) Find every possible way to slit a 70-inch jumbo reel into combinations of 22-inch, 
20-inch, and 12-inch width reels. One such combination was already illustrated 
in the example. Calculate the trim waste for each combination. Label the com¬ 
binations 1, 2, 3,..., and let Xi be the number of jumbo reels cut into Combi¬ 
nation i. 

(b) Formulate the problem as a linear programming model. (Note that if any smaller 
reels in excess of the customer requirements are produced, these too must be 
counted as waste.) 

(c) Show that the objective function also can be written as 

minimize 70(xi + X 2 + • • •)• 

26 Trim Problem, Answer the questions in exercise 25 assuming that the width of the 
jumbo roll is 105 inches, and that today’s orders are for 100 reels of 25rinch width, 
125 reels of 30-inch width, and 80 reels of 35-inch width. [Note that the coefficient in 
part (c) is no longer 70.] 
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27 Trim Problem. Answer the questions posed in exercise 25 assuming that the width of 
the jumbo roll is 200 inches, and that today’s orders arc for 100 recb of 30-inch width, 
125 reels of 55-inch width, and 80 reels of 65-inch width. [Note that the coefficient 
in part (c) is no longer 70.] 

28 The Fly-by-Night Airline must decide on the amounts of jet fuel to purchase from 
three possible vendors. The airline refuels its aircraft regularly at the four airports it 
serves. 

The oil companies have said that they can furnish up to the following amounts of 
fuel during the coming month: 250,000 gallons for Oil Clompany 1; 500,000 gallons 
for Oil Company 2; and 600,000 gallons for Oil Company 3. The required amoimt 
of jet fuel is 100,000 gallons at Airport 1; 200,000 gallons at Airport 2; 300,000 
gallons at Airport 3; and 400,000 gallons at Airport 4. 

When transportation costs are added to the bid price per gallon supplied, the 
combined cost per gallon for jet fuel from each vendor furnishing a specific airport 
is shown in the table. 


Airport 1 
Airport 2 
Airport 3 
Airport 4 


Company 1 
12 
10 
8 
U 


Company 2 
9 
11 
11 
13 


Company 3 
10 
14 
13 
9 


(a) Formulate the decision problem as a linear programming model. 

(b) Is the model equivalent to a transportation or a network problem? Explain. 


29 The Hayes Manufacturiiig Company produces a small component for an industrial 
product and distributes it to five wholesalers at a fixed delivered price of $2.50 per unit.' 
Sales forecasts indicate that monthly deliveries will be 3000 units to Wholesaler 1; 
3000 units to Wholesaler 2; 10,000 units to Wholesaler 3; 5000 units to Wholesaler 4; 
and 4000 units to Wholesaler 5. 

The monthly production capacities are 5000 at Plant 1; 10,000 at Plant 2; and 
12,500 at Plant 3. The direct costs of producing each imit arc 81.00 at Plant 1, 8.90 
at Plant 2, and $.80 at Plant 3. 

The transportation costs of shipping a unit from a plant to a wholesaler are given 
below. 



Whlslr. 1 

Whlslr. 2 

Whlslr. 3 

Whlslr. 4 

Whlslr. 5 

Plant 1 

8.05 

8.07 

8.10 

8.15 

8.15 

Plant 2 

.08 

.06 

.09 

.12 

’.14 

Plant 3 

,10 

.09 

.08 

.10 

.15 


Formulate a linear programming model to indicate optimal production amounts at 
each plant, and to show how many components each plant supplies each wholesaler. 

30 The Kleen City Police Department has the following minimal daily requirements for 
policemen: 


I 
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Time of Day 


Minimal Number of 
Policemen Required 

(24-Hour Glock) 

Period 

During Period 

2-6 

1 

20 

6-10 

2 

50 

10-14 

3 

80 

14-18 

4 

100 

18-22 

5 

40 

22-2 

6 

30 


Note, you are to consider Period 1 as following immediately after Period 6. 

Each policeman works eight consecutive hours. Let Xt denote the number of men 
starting work in Period t every day. The Police Department seeks a daily manpower 
schedule that employs the least number of policemen, provided that each of the 
above requirements are met. 

Formulate a linear programming model to find an optimal schedule. 

31 Around 435 B.G., Sparta decided to draft reserve troops to supplement its regular 
army. New warriors could be enlisted for 1, 2, 3, or 4 years. Let X 2 t, Xzt, and Xu 
be the number of warriors enlisted in Year t for 1, 2, 3, and 4 years, respectively. 
The associated unit costs are Cn, C 2 t, and In each Year t, the minimal total 
reserve warrior strength was set at Ri; Rt varied from year to year. 

As a Spartan general, you could find an optimal enlistment policy for the ensuing 
20 years by solving the problem as a linear programming model. For simplicity, let 
/ = 1 denote the year 435 B.G. 

(a) Gonstruct a table like that in Fig. 2.7 to exhibit the time interrelations for 

X 2 i> and x^i over Years t, t -f 1, t 4- 2, and t + 3. 

(b) Exhibit the manpower constraints for Years 1, 2, 3, 4, and 5. 

(c) Exhibit the manpower constraints for Years 16, 17, 18, 19, and 20. 

(d) Exhibit the general form of a manpower constraint for Year t, where 6 < / < 15. 

32 The Feedem-Speedem Airline Gompany must decide how many new stewardesses to 
hire and train over the next six months. The requirements expressed as the number of 
stewardess-flight-hours needed are 8000 in January; 9000 in February; 8000 in March; 
10,000 in April; 9000 in May; and 12,000 in June. 

It takes one month of training before a stewardess can be put on a regular flight; 
so a girl must be hired at least a month before she is actually needed. Also, a trainee 
requires 100 hours of actual in-flight experience during the month of training. Hence, 
for each trainee, 100 less hours are available for flight service by regular stewardesses. 

Each experienced stewardess can work up to 150 hours in a month, and Feedem- 
Speedem has 60 regular stewardesses available at the beginning of January. If the 
maximum time available from experienced stewardesses exceeds a month’s flying and 
training requirements, the regular girls work fewer than 150 hours, and none are laid 
off. Each month, approximately 10% of the experienced stewardesses quit their jobs 
to be married or for other reasons. 

An experienced stewardess costs the company S800 and a trainee $400 a month in 
salary and other benefits. 
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(a) Formulate the hiring-and-training problem as a linear programming model. 
Let Xi be the number of stewardesses that begin training in Month t, where 
Xq = 60. Define any additional symbols that you need to express the decision 
variables. 

(b) The above statement of the problem assumes a six-month planning horizon. 
Suppose you add July’s requirements t,. the model. Would the previous solution 
necessarily change? Explain. 

33 The Holme Appliance Company produces refrigerators, stoves, and dishwashers. 
During the coming year, sales are expected to be the following. 

Quarter 


Product 

. 1 

2 

3 

4 

Refrigerators 

2000 

1500 

3000 

1000 

Stoves 

1500 

1500 

1000 

1500 

Dishwashers 

1000 

3000 

1500 

3000 


The company wants a production schedule that meets each of these quarterly demand 
requirements. Management also has decided that the inventory level for each product 
must be at least 1000 units at the end of each quarter. There is no inventory of any 
product at the start of the first quarter. 

During a quarter only 8000 hours of production time are available. A refrigerator 
requires .5 hour, a stove 2 hours, and a dishwasher 1.5 hours of production time. 
Refrigerators cannot be manufactured in the fourth quarter because the company 
plans to modify tooling for a new product line. 

Assume that each item left in inventory at the end of a quarter incurs a holding or 
storage cost of 5. The company wants a production schedule that does not exceed 
the production-time limitation of each quarter, that meets the quarterly demand and 
inventory requirements, and that keeps the total cost of carrying inventory to a 
minimum. 

Let Ri, Si, and Di be the number of refrigerators, stoves, and dishwashers manu¬ 
factured in Period t. Define any other symbols that you need to represent the variables. 
Show how the problem can be solved by a linear programming model. 

34 Knox Mix Company (Sec. 2.2). Assume that the technological data shown in Fig. 2.3 
are valid for each of four consecutive time periods. Suppose you can store excess (or 
unused) Material Y and Material Z from one period to the next at a holding cost per 
unit per period of hy and hzy respectively. Formulate the product-mix selection 
problem as a dynamic planning problem. Be sure to define new symbols that indicate 
the time period for each production variable, and to specify the storage activities. 

35 Twobridge Chemical Company (Sec. 2.4), Assume that the data in Fig. 2.5 are valid in 
each of T consecutive periods, except that the available supplies of chemicals, the 
demands for the products, and the profits of each activity vary from period to period. 
State how you would modify the model to permit excess supplies of Chemicals 1 and 2 
in one period to be inventoried for possible use in succeeding periods, and to allow for 
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Products 1,2, and 3 made in one period to be stored for meeting demands in sub¬ 
sequent periods. Be sure to define new symbols that indicate the time period for each 
production variable and to specify the storage activities. 

36 The Expando Manufacturing Company, which produces a single type of item, wishes 
to enlarge its capacity over the next six periods (a period is three months). The 
company’s objective is to have its production capacity as large as possible by the end 
of the sixth period. 

Each item produced in a period requires d dollars (for the purchase of raw materials 
and the payment of wages), and one unit of plant capacity; it yields r dollars of sales 
revenue at the beginning of the next period. 

In each period, the company can use either or both of two construction techniques 
to expand its plant. Each requires cash in the period the expansion is initiated, and 
one takes more time than the other. Specifically, building a unit of capacity by 
Expansion Method 1 requires b dollars when the construction activity is started and 
yields the added capacity by the beginning of the following period. Building a unit 
of capacity by Expansion Method 2 requires c dollars when the construction activity is 
started and yields the added capacity by the beginning of the period after the following 
one. 

The company has D dollars to finance its production and expansion in Period 1, 
but requires that the subsequent production and expansion activities be self-funding 
(that is, no supply of cash will be be forthcoming from outside sources after the first 
period). The plant capacity at the start of Period 1 is K, 

For each period, /, let denote the production level; Wf and Vi the levels of capacity 
expansion using Methods 1 and 2, respectively; Wt unused cash at the end of the 
period; and Z( unused plant capacity. 

(a) Construct a table like that in Fig. 2.7 to exhibit the time interrelations. 

(b) Show how the problem can be solved by a linear programming formulation. 


MIND-EXPANDING EXERCISES 

37 The Uptight Faucet Company has two factories, one on the East Coast and one on 
the West. Let the index i designate Factory i. To keep the explanation simple, assume 
that both factories produce the same type of faucet. The company wants to plan its 
production schedule for the next two quarters, which are designated as Periods 1 and 2. 
In each period, the production capacity at Factory i is Li, 

The company ships its faucets to two distributing warehouses, designated by the 
index j. The minimal demand requirement to be met by the end of Period t at Ware¬ 
house j is where / = 1 and 2, andj = 1 and 2. 

For each Factory i, let 

Xi = amount produced and shipped by the end of the first quarter. 

= amount produced and shipped by the end of the second quarter. 

Si = amount produced and stored at Factory i at the end of the first quarter. 

For each Factory i. and Warehouse j, let 

Zij = amount shipped from Factory i to Warehouse j by the end of the first quarter. 
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Vij = amount shipped from Factory i to Warehouse j by the end of the second 
quarter. 

For each Warehouse j, let 

wj = amount stored at Warehouse j at the end of the first quarter. 

Define any other symbols you need to represent the variables in the problem. 

The company requires that the sum total of production for Factory 1 in both 
quarters combined be at least 80% of the like quantity for Factory 2. Because of 
limited storage space, no more than Si faucets can be stored at Factory i at the end of 
the first quarter. 

The production cost of a faucet at Factory i is pi. The cost of a faucet stored at the 
end of the first quarter at Factory i is Atj; the cost of a faucet stored at the end of the 
first quarter at Warehouse j is hj. Shipping an item from Factory i to Warehouse j 
incurs a transportation cost of Cij per faucet. Assume that transportation time is 
virtually zero (as compared to the length of the quarter), so that an item shipped 
during a quarter arrives before the quarter is over. 

Show how an optimal production and shipping schedule can be found from a linear 
programming formulation. 

38 The Mighty Steel Corporation has two iron ore reduction plants, which arc designated 
by the index r. Each of these plants processes iron ore into two different ingot stocks, 
which are designated by the index s. The ingot stocks arc shipped to any of three 
different fabricating plants, which are designated by the index f. Each- fabricating 
plant makes two products, which arc designated by the index p. Over the forthcoming 
planning horizon, the company wants to minimize the total tonnage of iron ore 
processed in its reduction plants, subject to production and demand-requirement 
constraints. 

Let the decision variables be 

X = total tonnage of iron ore processed by both reduction plants. 
ysfr = total tonnage of iron ore processed into Ingot Stock s for shipment to Fabri¬ 
cating Plant f by Reduction Plant r. 

Zgpf = total tonnage of Ingot Stock s manufactured into Product p at Fabricating 
Plant f. 

The technological data are 

Ogr = tonnage yield of Ingot Stock s from one ton of iron ore processed at Reduc¬ 
tion Plant r. 

bgpf = total yield from one ton of Ingot Stock s manufactured into Product p at 
Fabricating Plant f. 

Cr = maximum tonnage of iron ore that can be processed by Reduction Plant r. 
kf = maximum tonnage of all stocks that can be manufactured into products at 
Fabricating Plant f. 

Dp = tonnage demand requirement for Product p. 

The production and demand requirement constraints include 

(i) The total tonnage of iron ore processed by both reduction plants must equal 
the total tonnage processed into ingot stocks for shipment to the fabricating 
plants by the reduction plants. 


I 


‘IIIII iirr 
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(ii) Total tonnage of iron ore processed by each reduction plant cannot exceed 
the plant’s available capacity. 

(iii) Total tonnage of ingot stock manufactured into products at each fabricating 
plant must equal the tonnage of ingot stock shipped to it by the reduction 
plants. 

(iv) Total tonnage of ingot stock manufactured into products at each fabricating 
plant cannot exceed the plant’s available capacity. 

(v) Total tonnage of each product manufactured must equal its demand require¬ 
ment. 

(a) Formulate the problem as a linear programming model. 

(b) Comment on the appropriateness of the objective function. Suggest an alternative 
objective function that might be more appropriate. 

39 The Babylon Corporation finds its internal telephone communication network is 
failing to meet peak-load requirements because of limited transmission capacities. Two 
stations originate calls, and three stations receive the calls; more than one call can be 
sent or received by any station. The calls are transmitted through links and two 
switching nodes. Figure 2.17 illustrates the situation. 



The decision problem is to increase the transmission capacity of the network in an 
economic manner. Increasing the capacities to handle the entire peak-load require¬ 
ments may be too expensive, but some improvement in service is required. 

Let be the daily peak-load requirements of messages originating at Station i and 
destined for Station j, where i = 1 and 2 andj = 5, 6, and 7. Let Ck) be the present 
link-transmission capacity between Switching Node k and Destination j, where k = 3 
and 4, and j = 5, 6, and 7, Let •S’* be the present transmission capacity for messages 
flowing through Switching Node k. 

Define 

^ik) = the number of calls originating at Station i, destined for.Station j, and passing 
through Switching Node k. 

Ckj = new transmission capacity along the link between Switching Node k and 
Destination j. 

Sk = new transmission capacity through Switching Node k. 

The constraints to be satisfied are 
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(i) The total number of calls from each originating station to each destination 
cannot exceed the peak-load requirement for such calls. 

(ii) The total number of calls passing along the link between a switching node and 
a destination cannot exceed the corresponding link-transmission capacity. 

(iii) The total number of calls passing through any switching node cannot exceed 
the corresponding transmission capacity. 

(iv) The new link and node capacities must be at least as large as at present. 

The service-level requirement is specified as follows. The routing of calls must 
satisfy the restriction that the ratio of the total number of calls that are actually trans¬ 
mitted to the sum of all the peak-load requirements must be at least/(where/ < 1). 

The cost of increasing link-transmission capacity by one unit between Switching 
Nodek and Destination j is djejy and of increasing node-transmission capacity at 
Switching Node k by one imit is d*. 

Show how a linear programming formulation can be used to find a minimum-cost 
plan for increasing link and node capacities, subject to the above-mentioned con¬ 
straints. 
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CHAPTER 


Algebraic and Geometric 
Representations of Linear 
Optimization Models 


3.1 INTRODUCTION 

Chapter 2 scanned a wide assortment of linear optimization models. For the 
most part, the constraints were written as immediate translations of a verbal 
description. The chapter gave scant notice to the possible connections among differ¬ 
ent formulations, nor did it elaborate on how much flexibility you have in specify¬ 
ing a model. We provide this information here. 

In the initial sections below, you will study how to manipulate linear constraints 
into different-appearing but mathematically equivalent statements. The techniques 
presented are helpful for two reasons. First, they enable you to prepare any 
particular problem you may have for numerical solution by the techniques dis¬ 
cussed in Chaps. 4 and 5. Second, they permit you to formulate the model in a 
way that will subsequently enable you to see how to apply powerful mathematical 
analysis. 

The remainder of the chapter deals with geometric representations of a linear 
optimization model. Of course, two- and three-dimensional geometric construc¬ 
tions are not sufficient for solving actual large-scale applications; however, these 
pictorial interpretations do aid you in developing intuition about the workings 
of a linear programming model. 

In all honesty, though, if you are impatient to learn how to find numerical 
solutions to linear programming models, go ahead and read Chap. 4. You can 
come back later to read the material in this chapter. 

3.2 GENERAL ALGEBRAIC FORMULATION 

We can summarize the mathematical representations of the several models 
you studied in Chap. 2 in the following way. Letting be the level of Activity j, 
for j = 1, 2,..., you want to select values for such that 
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.3.2 • 


piXi + p2X^ + ... + 

is maximized, or minimized, depending on the context of the problem. The 
are constrained by a number of relations, each of which is one of the following type: 

dlXi + ^2^2 + • • • + anXn ^ « 

+ b^Xi + •«• + Mn = b 

CiXi + C2X2 + • • • + < c. 

The first relation includes the possible restriction Xj > 0. 

Such a constrained optimization problem may have 

(i) no feasible solution, that is, there may be no values of all the Xj, for 
j = 1, 2,, . ., n, that satisfy every constraint; 

(ii) a unique optimal feasible solution; 

(iii) more than one optimal feasible solution; or 

(iv) a feasible solution such that the objective function is unbounded; that is, 
the value of the function can be made as large as desired in a maximiza¬ 
tion problem, or as small in a minimization problem, by selecting an 
appropriate feasible solution. (This can happen in the arbitrage model of 
Sec. 2.5.) 

In a few instances. Chap. 2 indicated how there can be more than one explicit 
way to represent the linear relations characterizing a linear optimization model. 
For example, in the dairy company model of Sec. 2.7, the sense of the demand- 
requirements inequalities (3) was reversed by changing the sign of all the constants. 

Changing the sense of the optimization. Any linear maximiza¬ 
tion model can be viewed as an equivalent linear minimization model, and vice 
versa, by accompanying the change in the optimization sense with a change in the 
signs of the objective function coefficients. Specifically, 

n n 

maximize 2 CjXj can be treated as minimize XI {'~^j)xj 

j-i 

and vice versa. Of course, if V is the optimal value of the right-hand expression 
above, then — F is the optimal value of the left-hand expression. 

Changing the sense of an inequality. All inequalities in a linear 
programming model can be represented with the same directioned inequality since 

n n 

(1) 2 ^ b can be written as Y! (—ai)xi > —by 

j-i 

and vice versa. To illustrate, the following two inequalities are equivalent: 

Ixi — 1x2 ^ — Ixi + 1x2 > 4. 

Converting an inequality to an equality. Any inequality in a line¬ 
ar model can be represented as an equality by introducing a nonnegative variable 
as follows: 
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(2) 

'Eapcj^b 

J^l 

(3) 

a jXj'^b 


can be written as 
can be written as 


2 + 1 j ^ where j > 0 


n 




where / > 0. 


It is common to refer to a variable such as j in (2) as a slack variable, and 
t in (3) as a surplus variable. 

To illustrate, in the Product-Mix Selection Example of Sec. 2.2, the constraint 
on the availability of pounds of Material Y was 


7x, + 5x2 + 3xs + 2x4 ^ 120. 

An equivalent way of viewing this restriction is 

7x, + 5x2 + 3x3 + 2x4 + Ij; = 120 where ^ 0, 

where^ is interpreted as left-over or unused pounds of Material Y. Here is a 
slack variable. 

Similarly, in the Feed-Mix Selection Example of Sec. 2.3, you can write the con¬ 
straint on Nutritional Ingredient A as 

2x, -f 3x2 + 7x3 — Ifl = 1250 where a > 0, 

where a is viewed as the overfulfillment of the requirement for Ingredient A. 
Thus a is a surplus variable. 


Converting equalities to inequalities. Any linear equality or set 
of linear equalities can be represented as a set of like-directioned linear inequalities 
by imposing one additional constraint. To motivate the idea, note that x = 8 
is equivalent to the combination x and x ^ 8, which in turn can be written 
as the pair x ^ 8 and —x < — 8. As you should check graphically, the equations 
X = 1 and_y = 2 are equivalent to the combination x ^ Ij^ ^ 2, and x -f ^ 3, 
which in turn can be expressed as x < 1, < 2, and —x —3. The idea 

generalizes as follows; 

n 

(4) 2 for t = 1, 2,..., m can be written as 

n n 

(5) 2 ^ for I = 1, 2,..., m and 2 ^ 

where 

(6) oCj = — 2 Cij and 

I-i i-l 

Thus for m = 1, (5) is simply 

(7) 2 aijXj :< bi and —2 atjXj ^ —i,. 

j-i >-i 

For an illustration, consider the system 

lx, + 1 x 2 =1 

2x| — 4x3 = —S. 
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Apply (5) and (6) to verify that this system is equivalent to the system below: 


\Xi + 1^2 

2xi — < —5 


' —3;ri — 1x2 + 4^3 < 4. 


^Converting variables unconstrained in sign to nonnega¬ 
tive variables. When for some j, Xj is unconstrained in sign in a linear model, 
two transformations are helpful in finding a numerical solution. The first is to 
select a constraining equality containing Xy This can always be done, possibly after 
converting an inequality to an equality. Then, using this relation as a defnition 
for Xjy substitute for Xj in all the other linear relations, including the objective 
function, and simplify by collecting terms. 

For example, if Xj is unconstrained in sign and appears in relation 1, which is 
stated as an equality, write 



( 8 ) 


Then the right-hand side of (8) is substituted wherever Xi appears in the model. 
As is easily seen, the resultant relations are still linear in the remaining variables. 

During the optimizing computational process, the definitional relation for Xj 
may be completely ignored. After a solution for the remaining variables is obtained, 
X) can be computed from its definitional relation. 

The second transformation converts x^ into the difference between two non¬ 
negative variables and substitutes this difference for x^ wherever Xj appears. 
Specifically, 


( 9 ) 


Xj = xj — x” where xj > 0 and xj' > 0. 


Thus this transformation expands the number of variables in the model, and pre¬ 
serves linearity of the original relations. The legitimacy of the substitution of (9) 
requires an argument, but the idea is simple enough and you may want to justify 
it for yourself. 

► A third transformation, closely allied to the idea used in relations (4), (5), and (6), 
is to add only a single new nonnegative variable z and make the conversion 

(i) xj = xj — z where xj ^ 0 and z > 0 
for each xj unconstrained in sign. 

Suppose xj, for j = 1, 2,..., A: ^ n, is unconstrained in sign. Then the constraints 

n 

(ii) S = bi for f = 1, 2,.... m 


become 


k 


n 


(iii) 
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where 


(iv) 



◄ 


^Converting lower-bounded variables to nonnegative vari¬ 
ables. When an Xj is bounded below by bj ^ 0, it is possible to convert the model 
into one with a nonnegative variable (that is, where is replaced with 0) by 
employing the relation 


Xj = ifj + x\ where x’^ > 0. 


( 10 ) 


Thus the right-hand side of (10) is substituted for x^ wherever x^ appears. 

3.3 CANONICAL FORMS FOR LINEAR 
OPTIMIZATION MODELS 

As you will see in later chapters, it is convenient to be able to write any linear 
optimization model in a compact and unambiguous form. The various trans¬ 
formations above allow you to meet this objective, although it is now apparent that 
there is considerable latitude in the selection of a particular canonical form to 
employ. We illustrate two such representations here. 

Any linear optimization model can be viewed as 


n 


( 1 ) 


maximize 2 


subject to 


(2) 2 ^ for £ = 1, 2,..., 771 


(3) > 0 forj = 1, 2,..., 71. 


Similarly, any linear optimization model can be written as 


n 


( 4 ) 

subject to 

( 5 ) 


minimize 2 


2 for t = 1, 2,..., 771 {bi ^ 0) 


(6) xj>0 forj = 1, 2, ...,7i. 

It is typical, although not required, that n > m in (5). 


3.4 GEOMETRIC INTERPRETATION 


In undertaking the solution and mathematical analysis of linear optimization 
models in Chaps. 4 and 5, you will sec that a grasp of the algebraic nature of this 
type of model is required. Your comprehension of the algebraic properties may be 
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enhanced, however, through a geometric interpretation of the ideas. Accordingly, 
in order to deepen your understanding of the properties of linear optimization 
models, we turn to description of the geometry of these problems. So that you do 
not misconstrue the intent here, we emphasize that a geometric viewpoint usually 
affords very little help in obtaining numerical solutions to actual linear optimiza¬ 
tion models. 

There are two geometric representations of a linear optimization model. 
One is called the solution space representation and is treated here. The other is called 
the requirements space representation^ it is less important for a beginner to under¬ 
stand and is explained in the advanced Sec. 3.6. 

Solution space representation—two dimensions (vari¬ 
ables). You have already seen this representation in the One-Potato, Two- 
Potato Problem of Chap. 1. You may want to skim over the example to refresh 
your memory. Here we proceed directly to the consideration of a numerical 


example: 


(1) 

maximize \2xi + ISxj 

subject to 


(2) 

4x, + 3*2 ^ 12 

(3) 

2xi + Sxj ^ 10 

(4) 

^ 0 and Xt ^ 0. 


The problem is graphed in Fig. 3.1. Observe that both (2) and (3) are drawn as 
equations. Then each inequality is indicated by an arrow on the side of the line 
representing permissible values of Xj and x^. Since the two variables must be non¬ 
negative, the region of permissible values is bounded also by the two coordinate 
axes. 



0 


Figure 3.1. Solution Space. 
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Accordingly, the polygon oabc represents the region' of values for Xi and X 2 
that satisfy all the constraints. This polygon is called the solation set. The 
set of points described by the polygon is convex, meaning that if any two 
points of the polygon are selected arbitrarily, then a straight-line segment joining 
the two points contains only points of the polygon. The vertices (?, a, by and c are 
referred to as the extreme points of the polygon in that they are not on the 
interior of any line segment connecting two distinct points of the polygon. 

The parallel lines in the figure represent various values of the objective function. 
The arrow points in the direction of increasing values of the objective function. 
The optimal solution is at the extreme point b, where Xi = ^y X 2 = y, and 
\2xi + 15x2 = 

Alternative optimal solutions. If the coefficients of the objective 
function are changed so as to alter the direction of the parallel lines in Fig. 3.1, 
it is clear that the optimal solution point may change, but in any case there is’ 
always an extreme-point optimal solution. Consider, in Fig, 3.2, a problem for one 
specific rotation of the parallel lines: 

(5) maximize 4x, + 10x2 

again subject to the same constraints (2), (3), and (4). 



Now, all the points (an infinite number) on the segment ab are optimal. Thus 
Xi = and X 2 = y are still optimal. But so are Xj = 0 and X 2 = 2, as well as any 
positive-weighted average of these two optimal solutions. The optimal value of the 
objective function is 20. 

Unbounded optimal solutions. The third illustration, pictured in 
Fig. 3.3, is based on the model 
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Figure 3.3. Unbounded 
Optimal Solution. 


(6) 

maximize — 2x, + 6x2 

subject to 


(7) 

— lx, — 1X2 < “2 

(8) 

— lx, + 1x2 ^ 1 

(9) 

X, ^ 0 and X2 > 0. 


The solution set for this problem is unbounded. Check that it is convex, and 
verify that a and h are the only extreme points. The objective function for the prob¬ 
lem can be made arbitrarily large. Given any value for the objective function, 
there always is a solution point having an even greater objective function value. 
Such a point will satisfy (8) with equality. 



Figure 3.4. No Optimal 
Solution. 


Infeasible problem. The fourth illustration, shown in Fig. 3.4, is based 
on the problem 

(10) maximize lx, + Ixj 
subject to 

( 11 ) 


— lx, + 1^2 ^ — 1 
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(12) Ixi - 1X2^ -1 

(13) ^ 0 and X2'^0- 

Figure 3.4 illustrates that the problem does not have a feasible solution. 

Conclusion. We will summarize the results of the several geometric illus¬ 
trations you have studied. 

(i) If the solution set is nonempty, it is convex and may be either bounded or 
unbounded. 

, (ii) If the solution set is nonempty, the optimal value of the objective function 
may be finite or unbounded. If finite, then an optimal solution exists at an 
extreme point. 

* 3.5 SOLUTION SPACE REPRESENTATION IN 
HIGHER DIMENSIONS 

We record here the vocabulary of the geometric representation of the solution 
space for models having more than two variables. 

Assume that the linear optimization model is given by 

n 

( 1 ) maximize 2 

subject to 

n 

(2) for i = 1,2,...,^ 

(3) ^^^0 forj = 1,2, 

Then consider Euclidean n-space, which is comprised of points with n coordinate 
values (xi, Xj,. . ., x„). Since each Xj ^ 0, only the positive orthant of Euclidean 
w-space need be considered. 

Each relation 

n 

(4) 2 for i = 1, 2,.,., m 

corresponds to a hyperplane in «-space, and cuts this space into two 
half-spaces. When the inequality sign ^ is restored to (4), the specified direc¬ 
tion indicates which of the two half-spaces contains points satisfying that relation. 
The intersection of the corresponding m half-space comprises the set of feasible 
solutions. When nonempty, this set is convex and polyhedral, the faces being 
parts of the hyperplanes (4) and the edges and vertices being intersections of these 
hyperplanes. To ease exposition, assume that this convex polyhedral set 
is of full dimension n. 

The objective function can also be viewed by means of a hyperplane in the 
solution space. More precisely, the set of all points in Euclidean n-space for which 
the linear objective function has some given value is a hyperplane. Further, these 
hyperplanes, one for each distinct value of the objective function, are parallel to 
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each other. It follows that a point strictly interior to the solution set cannot be an 
optimal solution, for there exist neighboring points in the set lying on hyperplanes 
having improved values for the objective function. Therefore if the objective func¬ 
tion value of an optimal solution is finite, an optimal solution must exist at an 
extreme point of the solution polyhedron. Further if two or more extreme^ points 
are optimal, then so are all the points on the edges or faces connecting them. 

* 3.6 REQUIREMENTS SPACE REPRESENTATION 

We now investigate an alternative to the solution space representation of a 
linear optimization problem. You can picture a model in terms of the requirements 
space, which has dimension equal to the number of constraints (excluding non¬ 
negativity conditions). The set of coefficients of each activity variable Xj, as well as 
the constants on the right-hand side of the constraints, are given a vector inter¬ 
pretation. The linear technology implies that each vector j is scaled by the amount 
X) > 0, and the resultant vectors are then added. Consider the entire collection of 
points that can be formed from every possible sum of the vectors for the includ¬ 
ing sums in which the vectors have been scaled by a nonnegative factor. This set 
comprises what is called a convex polyhedral cone in the require¬ 
ments space* 

A feasible solution exists, provided the vector corresponding to the right-hand 
side does not lie outside the cone. The optimization objective may be stated as 
finding a best sum of vectors that lies within the cone and equals the right-hand- 
side vector. These geometric ideas will now be illustrated with several of the 
preceding examples. 

Finite optimal solution* In Fig. 3.5 the requirements space is drawn 
for the model 



10 


( 0 , 1 )^ 

0 ( 1 , 0 ) 


5 

Requirement (2) 


Figure 3.5. Requirements 
Space. 
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(1) maximize 12x, + 15x2 
subject to 

(2) 4x, + 3x2 < 12 

(3) 2xi + 5x2 < 10 

(4) > 0 and X 2 > 0, 

which you analyzed at the start of Sec. 3.4. 

To represent fully the possible linear programming activities, add the slack 
variables, which become vectors lying along the coordinate axes. Thus, in alge¬ 
braic form, the constraints of the problem to be illustrated correspond to 

4x, + 3x2 +1^3 =12 

(5) 2x, H- 5x2 + 1^4 = 10 

xj>0 for j = 1, 2, 3, 4. 

With the slack variables added, the requirements space cone consists of the 
entire nonnegative quadrant. The solution, as indicated by Fig. 3.1, is to add a 
multiple Xi = ^ of the vector (4, 2) and a multiple X 2 = y of the vector (3, 5) 
to yield (12, 10): 

m-ilhm-ilhQ- 

From Fig. 3.5 it is clear that you could also combine a positive multiple of (1, 0) 
with (3, 5) or with (0, 1), and a positive multiple of (4, 2) with (0, 1) to yield 
(12, 10). Check that it is not possible to add positive multiples of (1, 0) and (4, 2) 
to give (12, 10). 

Alternative optimal solutions. Suppose that the objective function 
in the above problem is altered to 

maximize 4xi + 10 x 2 

so that there are alternative optimal solutions. This does not change the require¬ 
ments space representation. 

Unbounded optimal solutions. Consider again the model 


(7) 

maximize — 2xj + 6xj 

subject to 


(8) 

-lx, - 1 X 2 ^ -2 

(9) 

— lx, + 1 X 2 < 1 

(10) 

Xi>0 and x* > 0. 


Recall from Sec. .3.4 that in this model the objective function can be made arbi¬ 
trarily large by utilizing .V], Xj, and in effect the slack variable from relation (8). 
In Fig. 3.6 these variables correspond to the vectors ( — 1, —1), ( — 1, 1), and (1,0), 
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Figure 3.6, Unbounded Optimal Solution. 


respectively. If you first add the vectors (—1, —1) and (1, 0), and then add the 
resultant vector (0, —1) to the multiple ;if 2 = 2 of (—1, 1), you obtain (—2, 1), 
as shown: 

o-tD+o-iJi+w-ro-n)- 

This vector addition is indicated by the points labeled a on Fig. 3.6. 

Similarly, if you first add .8 of (—1, —1) to .6 of (1, 0), and then add the resul¬ 
tant vector (—.2, —.8) to 1.8 of (—1, 1), you obtain (—2, 1): 

( 12 ) 

This vector addition is indicated by the points labeled b on Fig. 3.6. There is an 
infinite number of combinations of these three vectors, which, when added, yield 
(—2, 1). Such a combination having a larger multiple of (—1, 1) will give a 
larger value for the maximizing form. 

Optimization* In the requirements space diagrams, we have not indicated 
so far any information pertaining to the objective function, that is, to the problem 
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of what vectors should be summed to optimize the value of the linear objective 
function. The information can be added by introducing one more dimension. 
Consider the problem in (1) through (4), shown in Fig, 3.5. Add as a third 
dimension the coefficients of the maximizing function. Thus the vectors are 
(1, 0, 0), (4, 2, 12), (3, 5, 15), and (0, 1, 0), and span a polyhedral cone in 3-space. 
The requirement vector is (12, 10, Z), where Z is to be maximized over all the 
vectors in the polyhedral cone. 

The illustration appears in Fig. 3.7. Having represented the requirements 


Figure 3.7. Objective- 
Function Representation in 
Requirements Space. 



vector parametrically as (12, 10, Z), draw it as a straight line parallel to the axis 
of the added dimension and piercing the cone. For some problems, the line may 
pierce the cone only once, and for others, not at all. In the latter case, there exists 
either an unbounded optimal solution or no feasible solution at all. 

Assume that the line pierces the cone twice. Then an optimal solution corre¬ 
sponds to the “higher” point of intersection of the line with the cone. This point 
typically is within a face of the polyhedral cone, that is, on a plane spanned by 
several vectors. The variables corresponding to these vectors comprise an optimal 
set. 
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REVIEW AND COMPUTATIONAL EXERCISES 


1 Explain your understanding 

sense of optimization 
sense of an inequality 
slack variable 
surplus variable 
♦lower-bounded variable 
canonical form 
solution space 
solution set 


following terms: 
convex set 

extreme-point solution 
♦hyperplane 
♦half-space 

♦convex polyhedral set 
♦requirements space 
♦convex polyhedral cone. 


of the 


In Secs, 3.2 and 3.3, you read that any linear programming model can be trans¬ 
formed into certain specified canonical forms. Two such representations are 

71 

maximize 2 

subject to (Form 1) 


and 


subject to 


n 

£ for i = 1, 2,..., m 

xj>0 forj = 1,2,.,.,«, 


^ ^ 7» 

minimize 2 


(Form 2) 


rz 

X = bf for e = 1, 2, , .., m, and where bi'>0 
xj>0 forj = 1,2, 

In exercises 2 through 5, transform each problem into Form 1 and Form 2. 


2 Consider 

minimize Sxj — 4^2 4 - l;c 3 

subject to 

— Ixi -1- 5^3 = 50 
2x, - 3x2 ^ 12. 

(a) Assume every Xj > 0. 

*{b) Assume Xi ^ 4, Xa ^ 0, and X 3 > 0. 

♦(c) Assume Xj ^ 0, X 2 ^ —5, and X 3 ^ 0. 


d Consider 


maximize lx, — 2 x 2 Sx, 



Review and Computational Exercises 


subject to 


(a) Assume every xj ^ 0. 

(b) Assume Xi ^ 0, X 2 ^ 0, 

(c) Assume Xj ^ 0> X 2 ^ 2, 

(d) Assume Xj ^ 0, X 2 > 0, 

(e) Assume Xi ^ 0, X 2 > 0, 


Ixi 

■f + 

lx, 

VI 

Ixi 

- i*j + 

1*3 

^-2 

3x, 

+ 

2*3 

= 5 


u, — 

1*3 

^ 1 . 

and X 3 

r^O. 



and X 3 

S: 0 . 



and X 3 

b unconstrained in sign< 

and X 3 

^- 4 . 




4 Consider 

minimize — 2 xi + 5 x 2 — + 6 x 4 . 

subject to 

4xi — 2xj -f- ^4 = 2 

2^1 + xt + 4x3 — X4 ^ 12 

—Sxj 4“ 2x2 — 8 x 4 = —31 

+ ^4 12. 


(a) Assume every x^ 0. 

(b) Assume X 4 ^ 0 and every other Xj ^ 0. 

♦(c) Assume X 4 > 2 and every other xj ^ 0, 

"♦(d) Assume X 4 ^ — 2 and every other xj ^ 0. 

♦ (e) Assume X 4 is unconstrained in sign and every other xy ^ 0. 

♦(f) Assume Xj > 0, X 2 ^ — 6 , x, ^ 0, and X 4 ^ —2. 

♦(g) Assume Xi > 0, Xj ^ 0, and X 2 and X 4 arc unconstrained in sign. 


5 Consider , 

maximize —Xi -f- 5 x 2 + 5xs — 2 x 4 

subject to 

—5xi + 3x2 — 4xs 4 - ^4 ^ —10 
3xi 4- 6 x 2 — 2xs — X 4 = 7 
xi 4- 4x2 4- 4xj 4- 3x4 ^ 15 

2xi 4- X 2 4- 2x8 — 6 x 4 = — 11. 

(a) Assume every xj ^ 0. 

(b) Assume Xi ^ 0 and every other xy ^ 0. 

♦(c) Assume Xj > 2 and every other xy ^ 0. 

♦(d) Assume Xj ^ —2 and every other xy ^ 0. 

♦(c) Assume >fi is unconstrained in sign and every other xy ^ 0. 

♦(f) Assume xi ^ —2, X 2 ^ 0, Xj ^ 0, and X 4 ^ —3. 

♦ (g) Assume X 2 ^ 0, Xj ^ 0, and Xi and X 4 arc unconstrained in sign. 
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In exercises 6 through 20, draw the solution space representation. Identify 
all the extreme points. In each of the parts, indicate an optimal solution. State 
whether it is unique, and if it is not, give two more optimal solutions. Be sure you 
calculate optimal values for Xi and Xj and find the associated value of the objective 
function. 


6 Consider the constraints 


(a) Minimize Xi, 

(b) Minimize X 2 . 

(c) Maximize Xi. 

(d) Maximize X 2 . 

(e) Minimize Xi + X 2 . 


—Xi- + X2 ^ 1 

6xi + 4x2 ^ 24 
Xi ^ 0 and X 2 ^ 2. 

(f) Maximize Xi + X 2 . 

(g) Maximize — Xj + 2x2. 

(h) Maximize Xi — 2x2. 

(i) Maximize — 3xi — 2x2. 


7 Answer the questions in exercise 6 assuming also that Xj ^ 5. 


8 Answer the questions in exercise 6 assuming also that X 2 < 4. 


9 Answer the questions in exercise 6 assuming also that Xj ^ 5 and X 2 < 4. 


10 Consider the constraints 

-10x1-15x2^-150 


5xi + 10x2 ^ 50 


xi — X 2 ^ 0 


Xi ^ 2 and X 2 ^ 0. 

(a) Maximize Xi. 

(f) Minimize Xi + X 2 . 

(b) Minimize Xj. 

(g) Maximize Xi 3x2. 

(c) Maximize X 2 . 

(h) Maximize — 2xi + ^ 2 - 

(d) Minimize X 2 . 

(i) Maximize —Xi — 3x2< 

(e) Maximize Xj + X 2 . 

(j) Maximize — Xi — 2x2. 


11 Answer the questions in exercise 10 assuming the lower bound Xj ^ 5. 

12 Answer the questions in exercise 10 assuming the lower bound Xj ^ 15. 


13 Answer the questions in exercise 10 assuming the lower bound Xj ^ 20. 

14 Answer the questions in exercise 10 assuming the additional constraint X 2 < 5. 

15 Answer the questions in exercise 10 assuming the additional constraint X 2 ^ 6. 

16 Answer the questions in exercise 10 assuming the additional constraint X 2 ^ 8. 
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17 Consider the constraints 




Xi + X 2 ^ 5 
— 7xi + 8 x 2 ^ 0 



3xi + 2 x 2 18 



0 ^x 1 ^4 and 0 ^X 2 ^ 6 . 

(a) 

Maximize Xp 

(g) Maximize 3xi + 5 x 2 - 

(b) 

Minimize x^. 

(h) Minimize Sxj + 5 x 2 . 

(c) 

Maximize X 2 . 

(i) Maximize — 2xi — X 2 . 

(d) 

Minimize X 2 . 

(j) Maximize —Xi — 2 x 2 

(e) 

Maximize Xj + X 2 . 

(k) Minimize Xi — 2 x 2 . 

(f) 

Minimize Xi + X 2 . 

(1) Minimize 2xi — X 2 . 


18 Answer the questions in exercise 17 assuming the first constraint is Xj -h ^ 

19 Answer the questions in exercise 17 assuming the first constraint is Xi + <^2 ^ 

20 Answer the questions in exercise 17 assuming the new bounds 0 ^ ^ 4.5 and 

0 ^ X 2 < 3. 

*21 Draw the requirements space representation for the constraints 

^1 + 4x2 + ^ 1 

1.5xi + X 2 + 3xj ^ 1 

every xj ^ 0 . 

*22 Draw the requirements space representation for the constraints 

Xj "h 3 x 2 — 2X3 —• X4 ^ 1 

5X2 + 2X2 — Xj — 3X4 ^ 2 
every x^ ^ 0. 
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CHAPTER 


Simplex Method 
of Solution 


4,1 THEORY IN PERSPECTIVE 

You will probably never have to calculate manually the solution of a linear 
programming model in a real application, since an electronic computer will do the 
work for you. You may legitimately ask, then, “Why do I need to know the under¬ 
lying theory of linear optimization models? Isn’t it enough merely to have the skill 
to formulate them?” It is difficult to answer these questions convincingly for a 
student who has not already learned the fundamentals of linear optimization 
theory; but in the light of considerable experience in applying linear programming 
to industrial problems, we are persuaded that an executive must understand the 
principles explained here in order to make truly effective and sustained usc*of this 
management tool. 

The following analogy may help win you over to our point of view. In learning 
to drive a car, it strains the intellect only mildly to learn to speed up, slow down, 
go forward or backward, turn, and so forth—^in short, to master the skill of driving. 
To qualify as a topnotch driver, however, you need to know more. You should 
have an idea of how to care for the battery; otherwise you may make the mistake 
of playing the radio for a long time with the ignition off. If you drive on icy high¬ 
ways, a knowledge of the braking mechanism should help you control your car 
when it skids. If your engine overheats suddenly, an understanding of the radiator’s 
function may suggest the correct remedy. Summing up, to be a good driver you 
must know more than how to handle your auto under ideal conditions. You must 
know enough about its mechanism to be able to skirt danger when possible. On 
the other hand, you can obviously be an excellent driver without the training of an 
automobile mechanic. 
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By the same token, the manager who resolutely avoids familiarizing himself with 
the basic mechanism of his operations research application is flirting with trouble. 
If he really wants to maintain control, he must nurture his insight to the approach. 
Only a modest effort is required to reach the appropriate level of knowledge; it 
does not entail your becoming an expert theoretician. 

You may ask another serious question concerning the value of studying linear 
optimization theory: “Do I have any assurance that the methods I learn now will 
remain relevant in a few years, given the rapidity of new advances in technology?” 
Of course, we can only speculate about the future, but history offers guidance. 
Again, the automobile analogy is helpful in framing an answer. If you glance at 
a typical early-model car, you recognize at once that it is an auto. Despite the 
vast technological improvements in automobile design, much of the basic struc¬ 
ture has remained unchanged. 

The advances in methods for solving linear optimization models show a similar 
pattern. Each decade of development has brought substantial modifications in 
computer methods for solving linear programming problems. Nevertheless, 
certain fundamental ideas have withstood the test of time, and still provide the 
basic structure of all approaches. These are the principles you will learn next. 

4.2 ASSESSMENT OF THE OBJECTIVE 

In this chapter, you will examine a computational method—an algorithm 
—for deriving numerical solutions to linear programming models. The task goes 
beyond merely specifying steps that eventually lead to a solution. The goal is to 
discover a systematic method that permits you to analyze and fully understand a 
model and its complex interrelations. Such a goal is essential, for in actual applica¬ 
tions of linear programming you always want more than a mere numerical answer. 
Usually, you want to know precisely how the answer depends on the input speci¬ 
fications, that is, how sensitive the solution is to the original data. The importance 
of sensitivity analysis is apparent since often the technological specifications are 
based on estimates and the constraints included may only be approximate. Further, 
a number of real constraints may be provisionally left absent from the model. 
And the objective function may not completely exhaust the factors of relevance in 
evaluating a solution. 

An equally important objective is perceiving the basic structure of optimizing 
algorithms. By concentrating in this chapter on what is known as the simplex 
techniquBy you will become familiar with the algorithmic method. As a result, you will 
gain the insights needed for a sound understanding of approaches to other optimi¬ 
zation models studied in subsequent chapters. 

Rather than start with a full-blown statement of the simplex algorithm, we 
begin slowly with some simplifying assumptions. Otherwise, you might easily get 
bogged down in detail, and lose sight of the central ideas, which are very intaitive. 
Rest assured, by the time you finish this chapter, you will have learned all the 
details necessary to apply the algorithm to any linear programming model. 
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Preliminary analysis. Before exploring the simplex algorithm, we 
briefly examine a small linear programming model so as to appreciate the hurdles 
to be encompassed by a technique of solution. Consider the problem 

( 1 ) maximize 2 xi + 3 x 2 + Vxj + 9 x 4 
subject to 

(2) Ixi + 1^2 + 1^3 + 1^4 + 1^5 =9 

(3) lx, + 2 x 2 + 4^3 + 8 x 4 + Ixe = 24 

Xj > 0 for j = 1 , 2 ,..., 6 , 

As a start toward solution, try the maximal amount of X 4 , the variable having the 
largest coefficient in the objective function. Check that this trial amount is = 3 
and is determined by the coefficients in (3). The associated objective function value 
is 9(3) = 27. If you try instead the maximal amount ofx 3 , which you can verify to 
be ^ = 6 , then the associated objective function value is the larger amount 
7(6) = 42. Thus, because of the constraining relations, it is not always best to use 
only a variable with the largest payoff per unit. 

Check that letting x^ = 6 implies that Xs = 3. Since the latter variable ddes 
not contribute to the value of the objective function, we ask next whether we can 
improve the solution further by combining several other economic variables. Even 
in a problem as small as this, it would be wearisome to test various combinations 
unless some simple rules were available that allowed us to skip over clearly inferior 
solutions. 

Suppose you guess next that an optimal solution contains X 2 and Xj at positive 
levels, and all the other variables at zero levels. It is easy to verify whether this 
conjecture is valid. You need only be able to manipulate (2) and (3) by the usual 
method in elementary algebra for solving two simultaneous linear equations. In 
mathematics literature, the approach often is called Gaussian elimina¬ 
tion. 

As a preliminary, bring Xj, X 4 , X 5 , and x^ to the right-hand side of the equality 
signs: 

(4) 1x2 + 1^3 = 9 — Ixj — 1 x 4 — 1 x 5 

(5) 2 x 2 + 4x, = 24 — Ixj — 8 x 4 — Ix^. 

Then eliminate X 2 from (5) by multiplying (4) by 2 and subtracting the result from 

(5) to obtain (7): 

( 6 ) Ixi + 1 x 3 = 9 — Ixj — 1 x 4 — 1 x 5 

(7) 2 x 3 = 6 + lx, — 6 x 4 + 2 x 5 — Ixfl. 

Next normalize the coefficient of X 3 in (7) by dividing by 2, yielding (9): 

( 8 ) 1 x 2 + 1 x 3 = 9 — lx, — 1 x 4 — 1 x 5 

(9) 1 x 3 = 3 + ^x, — 3 x 4 1 x 5 — ^x^. 

Finally, you should eliminate X 3 from ( 8 ) by subtracting (9) from ( 8 ), giving the pair 
of equations 
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(10) ^2 = 6 — Iatj + 2^4 — 2a*5 + 

(11) X 3 = 3 + yXj — 8 X 4 + IaTs — 

The combined normalizing and eliminating operation is sometimes called 

pivoting. 

The present hypothesis is that x, = ^4 = Xj = Xq = 0. Then from (10) and (11) 
you can conclude that X 2 = 6 and x^ = 3. [Check that these values satisfy (2) 
and (3).] For this solution, the value of the objective function ( 1 ) is 3(6) + 7(3) 
= 39. Since you previously examined a solution {x^ = 6 ) that gave an objective 
function value of 42, you know the current guess is not optimal. However, we are 
seeking a simple way to discover whether the current guess is optimal without 
referring to any previous trials. 

To proceed in this pursuit, substitute the relations for ^2 and x> as given in (10) 
and ( 11 ) into the objective function: 

2x1 + 3(6 — fx, + 2 x 4 — 2 x 5 + ^-xo) 

(12) 4“ 7(3 + yXi — 3^4 + Ixs — -j-ATe) + 9^4 
= 39 + Ixi — 6 a :4 + Iatj — 2 x 6 . 

It is evident from (12) that raising the value of either a;, or x^ from zero will 
increase the objective function beyond 39. In fact, according to the coefficients of 
Xi and ATs in ( 12 ), for each unit you introduce of either of these variables, the objec¬ 
tive function goes up by 1 . 

Suppose you decide to increase x^. How large can you make it? Looking at (10) 
and (11), which are merely rearranged versions of (2) and (3), you can check 
that if you increase x^ and leave a :4 = Xj = Xq = 0 , then X 2 will decrease by f 
and X 3 will increase by ^ for each unit of x, added. But for a feasible solution all 
X) '> 0. Therefore, Xi can be made only as large as 4, at which point Xo becomes 
0. Further, according to (12), the corresponding value of the objective function is 
39 + 1-4 = 43. 

You are now ready to try your newest guess at an optimal solution, namely that 
Xi and a :3 are positive and all the rest of the variables are zero. Check your under¬ 
standing of the way we progressed from the pair ( 2 ) and ( 3 ) to the pair ( 10 ) and 
(11) by carrying out the analogous computations to solve for Xi and a: 3 . You should 
obtain 

(13) Xj = 4 ■|a :2 + ^X 4 — -jAfs + -Jato 

(14) X 3 = 5 - 3 X 2 ^X 4 + -^Afs — -jAfe. 

If you set X 2 = X 4 == Xs = Xq = 0, then Xj = 4 and AC 3 = 5. The value of the 

ohgcctive function is 2(4) + 7(5) = 43. Notice the objective function equals 

39 + 1(4) = 43, as predicted by (12), and is an improvement on the previous 
guess. 

To test for optimality, you can substitute as before the relations for the solution 
variables (13) and (14) into the objective function. Verify that this gives 

(15) 43 — |a :2 — ■^^X 4 — ^x^ — ^x^^. 
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Now you can see that if the variables X 2 , x^, or x^ are utilized at ar^ level 

above zero, the objective function must decrease from its current value 43, Thus 
the present solution is indeed optimal. 

It is intuitive that the ideas in the preceding analysis could be distilled and 
formalized into a set of rules. The result would be an algorithm that would guide 
the selection of guesses to be tested and provide a criterion for ascertaining whether 
a particular trial solution was optimal. If you are eager to see this done, or if an 
optimizing algorithm is a new idea to you, then we advise that you skip the next 
section and come back to it after you have finished learning the simplex algorithm 
in Sec. 4.4. The questions raised in the next section will be more meaningful after 
you have studied an algorithm. But if you like proceeding from the general to the 
specific, then continue reading. Don’t be too disturbed when some of the material 
seems vague at this first exposure—the fuzziness will disappear by the time you 
have finished the chapter. 

4.3 THE ALGORITHMIG METHOD 

You will study several types of algorithms throughout this book. By far, the 
type used most often is an iterative approach: at each trial solution, starting with 
the first, the instructions specify whether to stop the computations or to proceed 
to a new trial solution. It is essential to know when a suggested computational 
scheme is a bona fide algorithmTor solving a problem. 

Consider, for example, the process we used to solve the small problem above. 
Whenever we wanted to test the optimality for a specified pair of variables, we had 
no trouble in finding unique values for this pair that satisfied all the constraints. 
Was this only luck? We devised a simple procedure for testing whether a trial 
solution could be improved by introducing another variable. Does this approach 
always work ? We found that bringing one variable into a trial solution caused only 
one other variable to leave the solution. Is this sort of uniquely determined substitu¬ 
tion inevitable? We discovered an optimal solution rather rapidly. Is such speed 
typical ? 

You will find it helpful for understanding both the simplex method and other 
algorithms in subsequent chapters to have a general framework for viewing 
computational techniques. Specifically, to evaluate such algorithms, you should 
examine four interrelated characteristics. 

1, Completeness. Are the algorithm’s rules unambiguous? Is there a 
practical method for always obtaining a first trial solution to initiate the 
algorithm ? Is the method described in terms of rules so well specified that 
they can be performed by a person or an electronic calculator having only the 
ability to read and follow the rules without exercising further judgment? 

2. Domain of applicability. What mathematical problems does the 
algorithm purport to solve? How easy is it to determine whether a specific 
problem falls within this domain of applicability? When the calculations 
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terminate, does the final-trial answer invariably provide an exact solution 
to the problem? If not, is there an indication of the existence and cause of 
failure ? 

3. Convergence properties. Does the algorithm always converge? If 
so, does it unfailingly converge to a correct solution? Will the method ter¬ 
minate in a finite number of iterations? For actual problems, how many 
iterations occur until convergence? Do the trial solutions progressively 
improve the objective function value? If the calculations are-stopped short 
of obtaining an optimal solution, is the final-trial answer usable? 

4. Computational requirements. How severe is the computational 
burden needed to obtain a solution? How much numerical accuracy is 
required to ensure the method operates satisfactorily? 

Establishing the answers to many of these questions is often a routine exercise 
for an operations researcher. But the most subtle category of questions is that 
relating to determining the convergence properties of an algorithm. Although the 
topic of convergence in algorithmic methods involves a number of intricacies, the 
idezts essential for the purposes here can be demonstrated simply. For the sake of 
illustration, suppose you are trying to maximize the value of an objective function, 
provided the solution satisfies certain constraints. This would be so if you were 
solving a linear programming model. The cases in Fig. 4.1 show how the value of 
the objective function might var^^ from one iteration to the next for a specific 
algorithm. 

Case L At each iteration you observe a strict improvement in the objective 
function. Convergence to an optimal solution occurs after a finite number of 
iterations. In the example, the optimum is reached at iteration 7. 

Case 2, At each iteration you observe a strict improvement in the objective 
function. However, convergence to an optimal solution does not occur in a finite 
number of iterations. At each trial, the solution gets a little bit better, but the 
improvements get progressively smaller. 

Case 3, At each iteration you observe a strict improvement in the objective 
function. And, as in the preceding case, convergence does not occur in a finite 
number of iterations. But the objective function is tending to a limiting value 
that is below the true optimum 1 

Case 4, Convergence to an optimal solution occurs in a finite number of itera¬ 
tions, and each solution is at least as good as the preceding one. But at some 
iterations the objective function does not show a strict improvement. In the 
example, the objective function stalls at iterations 3, 7, and 8. 

Case 5, Prior to convergence, the value of the objective function decreases for 
several iterations. In the illustration (Case 5 of Fig. 4.1), this phenomenon 
occurs at iterations 4 and 5. 
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Figure 4.1. Examples of Convergence. 
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Thus the objective function in an algorithm may or may not converge to a 
limiting value in a finite number of iterations. If it does converge, the limiting 
value may or may not be the truly optimal value. And, in any case, the objective 
function may or may not show a strict improvement during the course of the 
iterations. 

4.4 INTRODUCTION TO THE SIMPLEX 
ALGORITHM 

Many different algorithms have been proposed to solve linear programming 
problems, but the one below has proved to be the most effective in general. Recog¬ 
nize, however, that if you are able to assume more about the form of the problem— 
such as that it has a network flow structure—then even better algorithms can be 
applied. We discuss these methods for special models in Chap. 7. 

Let us verbalize the reasoning process used in Sec. 4.2 to solve the illustrative * 
example. In that case, there were two equations. You examined trial solutions 
having two unknowns at positive levels and the remaining variables at zero levels. 
If the approach is to generalize, it is reasonable to suppose that in an m-equation 
model, you would choose m variables to have positive levels, and let the remaining 
variables equal zero. Assume for the time being that a feasible solution exists 
and the optimal value of the objective function is finite. We remove these restric¬ 
tive assumptions later. Then the procedure appears to be the following. 

Step 1, Select a set of m variables that yields a feasible starting trial solution. 
Eliminate the selected m variables from the objective function. 

Step 2. Check the objective function to see whether there is a variable that is 
equal to zero in the trial solution but would improve the objective function if made 
positive. If such a variable exists, go to Step 3, Otherwise, stop. 

Step 3. Determine how large the variable found in the previous step can be 
made until one of the m variables in the trial solution becomes zero. Eliminate 
the latter variable and let the next trial set contain the newly found variable 
instead. 

Step 4, Solve for these m variables, and set the remaining variables equal to 
zero in the next trial solution. Return to Step 2, 

Interestingly, after tidying up the language of these steps to remove certain 
ambiguities, the resulting algorithm in fact does find an optimal solution to a gen¬ 
eral linear programming model in a finite number of iterations. Often this method 
is termed Dantzi^s simplex algorithm^ in honor of the mathematician who devised 
the approach. 

We first examine a “well-behaved” problem and explain the simplex 
method by means of this example. Afterwards we complete the presentation of 
the details. 
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Example. Consider the model 

meiximize 4xi + 5 x 2 - + 9 x 3 + 11^4 

subject to 

Ixi + 1x2 + 1^3 + 1^4 < 15 
7 xi + 5 x 2 + 3 x 3 + 2x4 ^ 120 
3 xi + 5 x 2 + 10x3'+ 15x4 ^ 100 

Xj > 0 Xj > 0 X3 > 0 X4 ^ 0. 

You may recall, that this is the Product-Mix Selection Example in Sec. 2,2. Did 
you jot down your guess of the optimal solution? 

Let Xq be the value of the objective function and add slack variables. Then 
write the system as 

Ixo — 4xi — 5 x 2 — 9 x 3 — 11 x 4 =0 Row 0 

Ixj + 1 X 2 + 1^3 + 1^4 + 1^5 = 15 Row 1 

7xt + 5x2 + Sxj + 2x^ + lx, = 120 Row 2 

3xi + 5 x 2 + 19^3 + 15j'f4 + l-^T = 100 Row 3, 

where all the variables must be nonnegative. Notice how the introduction of the 
variable Xq in Row 0 permits us to express the objective function in equation form. 

The task of S/ep 1 is to find a starting feasible solution to (1). There are a large 
number of such solutions, but it is certainly most convenient to begin with Xq = 0 , 
X 5 = 15, Xo = 120, x^ = 100, and all other variables equal to 0. In other words, 
start with an all-slack solution. We term this an initial basic solution, and Xq, X 5 , x®, 
and Xj are known as the basic variables, sometimes shortened to the 
basis. The remaining variables we call nonbasic. 

The word basic stems from the mathematical property that the solution is repre¬ 
sented by four variables, ‘Tour” being the number of linear relations, and that the 
values of these four variables are uniquely determined (and dependent on the 
constants on the right-hand side of the equations). As you will discover, the sim¬ 
plex method employs only trial basic solutions. Despite the restriction to this class 
of solutions, the simplex method nevertheless determines an optimal solution, 
provided one exists. 

If you interpret Xq as profits, then the current solution is certainly not very 
profitable. Undoubtedly you can improve it. A moment’s reflection leads to 
examining the coefficients in Row 0 of those variables currently not in the basis. 
Specifically, look at the coefficients of Xj, X 2 , X 3 , and X 4 . Each negative coefficient 
represents how much Xq will increase if you let the corresponding variable equal 1 . 
This conclusion about the interpretation of the coefficients in Row 0 remains valid 
throughout the trial calculations, and we record the general principle here. 

Interpretation of Coefficients in Row 0. Each coefficient represents the 
increase (for negative coefficients) or decrease (for positive coefficients) in Xq with a unit 
increase of the associated nonbasic variable. 
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For Step 2 the simplex method adopts the following easy-to-apply rule for 
deciding the variable to enter the next trial basis. 

Simplex Criterion I (Maximization). If there are nonbasic variables having a 
negative coefficient in Row 0, select one such variable with the most negative coefficient, 
that is, the best per-unit potential gain (say at^). If all nonbasic variables have positive 
or zero coefficients in Row 0, an optimal solution has been obtained. 

According to Criterion /, you should introduce ^^ 4 . Each unit of added brings 
about an increase of 11 in atq. Clearly, the larger you make x^^ the more you increase 
Xq. But examine the constraints (1). Notice that as you increase X 4 , you must de¬ 
crease every current basic variable corresponding to a row wherein .V 4 has a positive 
coefficient. Specifically, if X 4 is increased to the level 1 , then verify in (1) that 

(i) X 5 must be decreased by 1 so that the Row 1 constraint remains satisfied, 

(ii) must be decreased by 2 so that the Row 2 constraint remains satisfied, 

(iii) X 7 must be decreased by 15 so that the Row 3 constraint remains satisfied. 

How big can X 4 become before.one of the current basic variables reaches its 
lower bound 0? Check that this number is X 4 = = 6.67, at which value you 

find X7 = 0. Such being the case, introduce X4 into the basis and remove X7. 

We summarize the preceding discussion by the following rule for Step 3, 

Simplex Criterion II. (a) Take the ratios of the current right-hand side to the 
coefficients of the entering variable Xj (ignore ratios with zero or negative numbers in 
the denominator), (b) Select the minimum ratio—that ratio will equal the value of Xj 
in the next trial solution. The minimum ratio occurs for a variable x^ in the present 
solution; set xjt = 0 in the new solution. 

The calculations indicated by Criterion II are performed in Fig. 4.2. 

Now that you know X 4 is to replace X 7 in a trial basis, you perform Step 4. Rewrite 
the relations (1) so that X 4 has a coefficient of 1 in Row 3 and coefficients of 0 in 
Rows 0, 1 , and 2, just as X 7 does in ( 1 ). The process by which this is achieved is 
known as a change-of-basis calculation, or a pivot operation. First 
divide Row 3 by 15, the coefficient of X 4 , 
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Figure 4.2. Iteration 1: Criterion II ibr Xi Entering Basis. 
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11x4 
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Row 0 

Ixi 

4- 1^2 4“ 1^3 4" 

1x4 4- 1^5 

= 15 

Row 1 

7xi 

4" 5^2 4“ 3^3 4" 

2^4 - + Ix^ 

= 120 

Row 2 


4" •3-^2 4“ '3«^3 4" 

lx^ 

4- tV^7 = ^ 

Row 3. 


By pivoting on 15, you have created the coefficient of 1 for X 4 in Row 3. Note 
that the mathematical manipulation is legitimate, for all you have done is divide 
equals by equals (both sides of Row 3 by 15). Create coefficients equal to 0 in 
Rows 0, 1, and 2 as follows: 

Row 0: multiply Row 3 by 11 and add to Row 0, 

Row 1: multiply Row 3 by —1 and add to Row 1, 

Row 2: multiply Row 3 by —2 and add to Row 2. 

The result is 

+ -H^ 7 =^ RowO 
+ 1^5 — lV '^7 = ^ Row 1 

+ 1^6 — -it ^7 = Row 2 

“t“ ' 3-^2 4" ^^3 4“ Row 3. 

As before, the mathematical manipulations are 
legitimate, for all you have done is multiply equals 
by equals (both sides of each row by a constant) and 
then add equals to equals (one row to another row). 

Consequently, the relations (3), although appearing 
to be in different form from ( 1 ), are equivalent to 
(1). The usefulness of the form of (3) is that by 
letting .y, = ATo = ATj = ATj = 0 , you see immediately 
the values for the new trial-basic solution. Fig. 4.3. 

As a check of your understanding, observe that the 
new basic solution has profit Xq = which can be 
calculated from the relation 

new profit = (profit from previous solution) -f- (number of units of new 
basic variable) X (per-unit potential gain from new basic vari¬ 
able, as was given by the previous coefficient in Row 0) 

1|0=0 + ^(II). 

You now are better able to interpret Criterion IL If X 5 (or x^ is eliminated instead 
of Xf^ then in finding new values for x^^ Xj^ and Xq (or atj), you would obtain negative 
numbers, contrary to the nonnegativity constraints. Try to eliminate x^ instead of 
Xj for a second-trial solution and thereby verify these remarks. 

Iteration 2, At this point the first iteration of the simplex method has been 
completed. On returning to Step 2, you are ready to determine whether an 
optimal solution has been obtained or if another simplex iteration is required. 


^0 

220 

3 

^5 

25 

3 


320 

^6 

3 


20 

Jf4 

3 


Figure 4.3. Second Trial Basic 
Solution. 
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Criterion /, which examines the nonbasic variables, indicates that a still better 
solution seems to exist. You might profitably enter into the basis either Xi, or X 2 , 
or X 3 . Criterion I selects ;cj, since it promises the greatest gain per unit increase. 
Next perform the Step 3 calculations, using Criterion II as shown in Fig. 4.4. 
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Figure 4.4. lieration 2: Criterion II for Xi Entering Basis. 


From Fig. 4,4. notice that Xi will replace in the next trial solution. You must 
rewTite (3) to reflect the substitution. To make the change-of-basis calculation in 
Step 4, first divide by the pivot -f- in Row 1, causing the coefficient of 1 to be re¬ 
moved from X 5 and to appear for Xj: 

Ixo — -jXj — -5^2 — “f" = ^"3^ Row 0 

1^1 + - 1^2 + + f^5 — ^ 

^Xi + ^;:2 + 4^3 + — - 3 ^X 7 = Row 2 " 

4* 4^2 H" 4^3 + + tV'^‘7 = ^ Row 3 . 

Then to complete the pivot operation, create coefficients equal to 0 for Xj in 
Rows 0, 2, and 3 as follows: 

Row 0: multiply Row 1 by | and add to Row 0, 

Row 2: multiply Row 1 by — ^ and add to Row 2, 

Row 3: multiply Row 1 by —4 and add to Row 3. 

The result is 

1^0 + 4^2 t4^3 + + TT^7 = *44^ 

. 1^1 + 4^2 + A^3 + 1^5 - W 

4-^2 tI^ 3 “ + Ixo + -^Xj = 

4 X 2 + T^^3 + 1^4 - 4^5 + -hXl = ff 

The third trial-basic solution appears in Fig. 4.5. Notice that at each iteration, 
the coefficients of any variable not in the trial solution have the interpretation that 
if a unit of this variable is introduced, the current basis variables will be changed 
by these amounts. In (3), for example, introducing each unit of Xy will decrease 
Xs by 4 , Xo by and X 4 by 4- Corroborate this statement with the values you have 
derived for the third solution, in which units of Xi were actually introduced^ 


Row 0 
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Also observe that prior to introducing Xi into the 
basis, X 2 looked attractive in (3). Entering Xj caused 
X 2 to change its indication in the present iteration (5). 

Iteration 3i Having completed the second 
simplex iteration, once more examine the coefficients 
in Row 0 to ascertain whether you have discovered 
an optimal solution. It now appears favorable to 
enter x^. The calculations to determine which vari¬ 
able leaves the basis are given in Fig. 4.6. You will 
find from these calculations that X 4 , which entered 
at the first iteration, is now to be removed from the 



Figure 4.5. The Third Trial 
Basic Solution. 
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Figure 4.6. Iteration 3: Criterion II for Entering Basis. 


basis. Frequently in applications of the simplex method, a variable enters the 
solution at one iteration and is removed at a later iteration. It is this possibility 
that prohibits stating a helpful upper bound on the number of simplex iterations 
necessary to solve any linear programming problem. 

As with the preceding iterations, create a coefficient of 1 for the entering 
variable x^ in Row 3 by dividing Row 3 by -j^. This gives 

1*^0 + “t" 1T^7 = Rqw 0 

Ixi + ^X 2 + + ixs — tVxj = ^ Row 1 

— i ^2 — Uxi — ^Xi + Ixe + ^Xj = ^ Row 2 

^X 2 + 1:^3 + - 2 ^X 4 — ^ Row 3 . 

Now create coefficients equal to 0 for x^ in the remaining rows as follows: 

Row 0: multiply Row 3 by and add to Row 0, 

Row 1: multiply Row 3 by ""TT and add to Row 1, 

Row 2: multiply Row 3 by 44 ^^d add to Row 2. 

The result is 

lx, +4^, + RowO 

lx, + 4 x 2 — 4*4 + ^Xi — lx, = Row 1 

- 4«J • + -^^4 - + lx, + 4 x 2 = Row 2 

4x, + lx, + - 4x, + 4 * 7 = ^ Row 3, ' 


( 7 ) 
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At this iteration you have just seen another aspect to the computational rule in 
Criterion IL Had you attempted to eliminate a variable where a negative ratio 
occurred in Fig. 4.6—this would be —then x^ would have become negative. 
Since such a condition is not permitted, the negative coefficient causes the basic 
variable for the associated row to increase in value as the new variable is intro¬ 
duced into the solution. If you ever tried to eliminate a basic variable in a row 
where 0 occurs as the coefficient of the entering variable, you would not be able to 
carry out the prescribed pivoting manipulations. The very first operation would 
call for a division by 0 , which is not permissible. 

each new basic solution results only in zero 
or positive values for the trial values of 
the basis. Consequently, the solution 
remains feasible at every iteration. 

Iteration 4. All the coefficients in 
Row 0 of (7) arc nonnegative, and con¬ 
sequently Criterion I asserts you have 
found an optimal solution (Fig. 4.7). 
Thus the calculations are terminated in 
Step 2, Check that the values in Fig. 4.7 
do indeed satisfy ( 1 ). 

Summary. In brief the simplex method consists of 
Step 1, Select an initial basis. 

Step 2. Apply Simplex Criterion L If the current-trial solution is not optimal, 
proceed to Step 3, Otherwise, stop. 

Step 3. Apply Simplex Criterion IL 

Step 4, Make a change of bsisis, and return to Step 2, 

►You can easily interpret the progress of the simplex method in the geometry of the 
solutions space. Each basis corresponds to a vertex of the convex polyhedral set of 
feasible solutions. Going from one basis to the next represents going from one extreme 
point to an adjacent one. Thus the simplex method can be said to seek an optimal 
solution by climbing along the edges, from one vertex of the convex polyhedral solution set to a 
neighboring one, ^ 

Optimality. It is a simple matter to demonstrate that you have in fact found 
an optimal solution. Since you have derived the final formulation (7) from the 
original equations ( 1 ) only by elementary operations, such as multiplying equals 
by equals and adding equals to equals, the two formulations are merely different- 
appearing views of the same problem. Suppose you had started with (7). Rear¬ 
ranging Row 0 slightly, you can write the objective function as 

( 8 ) maximize x^ = — -^Xz — ^x^ — ^Xs — ^Xj. 

If the nonbasic variables ^ 2 , X 4 , X 5 , or Xj have any feasible value other than zero, 
then the above relation shows that Xq will be less than its present value of 
The final Row 0 coefficients of the original variables are sometimes referred to as 


To sum up. Criterion 11 ensures that 



Figure 4.7. Fourth Trial and Optimal 
Basic Solution. 
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relative or shadow costs. They represent the decrease in the optimal 
value of the objective function resulting from a unit increase in ationbasic variable, 
assuming the final basis remains feasible. (The final Row 0 coefficients of the slack 
variables are sometimes called shadow prices, and their interpretation is 
discussed at length in the next chapter.) 

For example, suppose you decided to set ^2 = 1; then the value of Xq would 
decrease by -f- The values of the basic variables would also change, and you could 
examine this effect in the same way you used for Xq, Consider the basic variable Xi 
in Row 1 of (7): 

(9) 

By letting jcg — 1, you decrease the value of by the amount 4 . 


Alternative optima. You saw in Chap. 3 (Fig. 3.2) that a linear pro¬ 
gramming problem may have more than one optimal solution. The two-dimen¬ 
sional diagrammatics show that if more than one optimal solution exists, then an 
infinite number exist. The corresponding generalization to larger problems is 
valid. 

To illustrate how this situation occurs, suppose there is an additional variable 
^8 in the example above, and that it appears in the initial formulation (1) as 


( 10 ) 


-2X8 
+ 1^8 
+ 9X8 
+ .2X8 


Row 2 
Row 3. 


It is straightforward to check that at the final iteration, X8 would appear in (7) as 


( 11 ) 


OATg 

Row 0 


Row 1 


Row 2 

— X , 

Row 3. 


If you introduce X8, then verify that Xj leaves the basis according to Criterion //, 
and that the alternative basic optimal solution is 

7 


Xq — • 


«.-II-5.10 


X , 


Xt 


3045 

7(9.8) 

679 

7(9.8) 


= 44.39 


= 9.90, 


( 12 ) 
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where Xi = X 2 = = Xj = 0. It is eaisy to show that any positive- 

weighted aveVage of these two basic optimal solutions also yields an alter¬ 
native optimal (feasible) solution: 

^ -695 

Xq - y- 

Xi — w{^) + (1 —■ 

(13) x 2 -=w{^) + {l -w)9,90 

x^ = wm^) + {I -w)^,39 

Xs = u;(0) + (1 — a;)5.10, 

where w is the weighting factor such that 0 < a; < 1. [Try w = .5 in 
(13) and calculate Xiy x^, Xq, and x^. Check that these values satisfy the original 
constraints and yield an objective function value of 

► Frequently, in large-scale applications there are more than two alternative optimal 
basic solutions. This condition is indicated by a zero coefficient for two or more nonbasic 
variables in the final Row 0. Advanced techniques exist for determining all basic optimal 
solutions. Once these have been found, any positive-weighted average of the alternative 
solutions is also a solution. Note that the resultant set of variables does not constitute a 
basis. The set contains more variables than the number of constraining relations; 
consequently, listing which of the variables are in the set is not sufficient to uniquely 
determine their values. ^ 

Conclusion. It is fair to say that the optimal solution to even the small 
problem in this section was not obvious from the start. In larger models, the impor¬ 
tance of being able to approach an optimal solution in a systematic fashion is 
manifest. We turn next to an evaluation of the simplex algorithm in terms of the 
four characteristics discussed in Sec. 4.3; if you skipped that section, go back and 
read it now. 

4.5 COMPLETENESS OF THE ALGORITHM 

At some iterations, the operations stated in Criteria I and II of the simplex algo¬ 
rithm may be ambiguous as to the variable to introduce into or to remove from 
the basis. In Criterion /, when two or more variables appear equally promising, as 
indicated by the values of their coefficients in Row 0, an arbitrary rule may be 
adopted for selecting one of these. For example, use the lowest-numbered variable, 
or one suspected to be in the final basis. 

In Criterion IIy when two or more variables in the current basis are to fall simul¬ 
taneously to the level zero upon introducing the new variable, only one of these is 
to be removed from the basis. The others remain in the basis at zero level. The 
resultant basis is termed degenerate. As we discuss in Sec. 4.7 below, a delicate 
question of theory is involved in the selection of a tie-breaking rule. Unless some 
care is given to the method of deciding which variable is to be removed from the 
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basis, you cannot prove the method always converges. However, long experience 
with simplex computations has led to the conclusion that for all practical purposes, 
the selection can be arbitrary and the associated danger of nonconvergence is 
negligible. 

If you find at some iteration in applying Criterion II that there is no positive 
coefficient in any row for the entering variable, then there exists an unbounded 
optimal solution. In this event, the entering variable can be made arbitrarily 
large, the value of atq thereby increases without bound, and the current basic 
variables remain nonnegative. Thus we now may drop the earlier assumption that 
the optimal value of the objective function is finite. The simplex algorithm provides 
an indication of when an unbounded optimal solution occurs. Reword Criterion II 
to cover this case. 

Starting basis. Here we turn to the selection of an initial basis to begin 
the algorithm. Because each constraint in the example of the preceding section 
was of the form 

(1) where > 0, 

adding a slack variable to each relation and starting with an all-slack basic 
solution provided a simple way of initiating the simplex algorithm. As you recall 
from Chap. 2, relation (1) does not encompass all the constraints that may be 
found in a linear optimization model. But the discussion in Chap. 3 enabled us to 
assert that the constraints in any linear programming model can be written as 

71 

(2) 2] for i = 1, 2, ..., m where bt > 0. 

In this form, if a variable appears only in constraining relation i and has a 
coefficient of 1, as would be the case for a slack variable, it can be used as part of 
the initial basis. But relation i may not have such a variable. (This would occur, 
for example, if the fth equation were linearly dependent on one or more of the 
other equations, such as being a sum of two equations.) Then we may employ the 
following approach. 

Write the constraints as 

71 

(3) 2 for f = 1, 2, ..., 7w where bi > 0, 

and whereat > 0; then useyi as the basic variable for relation i, (We have assumed, 
for simplicity, that every constraint requires the addition of a yi.) The name 
artificial variable is given to^i because it is added as an artifice in order to 
obtain an initial-trial solution. Is this approach legitimate? The answer is yes, 
provided you satisfy Condition A. 

Condition A: To ensure that the final solution is meaningful, every yi must equal 0 at the terminal 

iteration of the simplex method. 

If the constraints admit of ,no feasible solution, it will be impossible to satisfy 
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Condition A, At the final iteration of the simplex algorithm, at least one will be 
in the basis at a positive level, indicating the infeasibility condition. Thus you can 
remove the assumption-made in Sec. 4.4 that a fezisible solution exists. If one does 
not, the simplex algorithm will tell you so. 

We can state at last that the instructions for carrying out the rules of the simplex 
method are complete. They can be, and, of course, have been, programmed for 
operation on every large-scale electronic computer. 

►We mention that in thr most sophisticated computer programs of the simplex algo¬ 
rithm, the model designer is not required to write each restriction in the form (2) above. 
He may specify his constraints exactly as he originally formulates them. The computer 
program then automatically adds cither a slack, surplus, or artificial variable to each 
relation and also guarantees Condition A. In some programs, the user may even preselect 
a particular set of variables to be the initial basis. This procedure is known as an 
advanced start, and is designed to reduce the number of iterations required. Other 
similar “hints” may be supplied the computer to indicate those variables that arc the 
most likely candidates for the final basis. Adroit use of such devices for speeding conver¬ 
gence is part of the art of applying the simplex algorithm. ^ 

The Big M Method. There are a number of computational techniques for 
guaranteeing Condition A, Since they are of interest principally to operations 
research specialists, however, we do not delve into them here except to mention a 
simple but somewhat inelegant approach. 

Add to the maximizing objective function each yi with a large penalty- 
cost coefficient: 

(4) — 2 CjXj + 2 ^yi = 

j-1 t«l 

where M is relatively large. To initiate the algorithm, you first eliminate each 
from (4) by'using (3). This gives 

(5) ^0 — 2 2 S 2 

1-1 1-1 

which simplifies to 

n m m 

(6) *0 — L (fj + Af s = —M 2 

1-1 1 = 1 

The very technique of optimizing drives the yi variables to zero, provided there 
exists a feasible solution. (Incidentally, whenever a^( drops from a basis at some 
iteration, you need never consider using it again, and can eliminate it from further 
computations.) 

An example will clarify the approach. Consider the problem 

(7) maximize —-Sxi — 2x2 
subject to 

( 8 ) 1^1 + 1^2 = 10 

(9) 1^1 ^ 4 

(10) ^1^0 Xj ^ 0. 
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Then, after adding a surplus variable in (9), we can write the model as 
^0 + + 2x2 =0 Row 0 

(11) \xi + 1 ;C 2 = 10 Row 1 

l;c, — 1^3 = 4 Row 2. 

Next, we introduce artifical variables^! and let M = 10, giving 

Xq -|” 3^1 -|- 2x2 "h 10)^1 “h 10)^2 ^ 0 Row 0 

(12) lx, + 1 x 2 + b'l = 10 Row 1 

lx, — Ixj + lj ;2 = 4 Row 2 . 

To initiate the algorithm, you have to subtract (M = 10) times Row 1 and {M 
= 10) times Row 2 from Row 0 to eliminate j, and j 2 - 

Xo — 17x, — 8 x 2 + 10 x 3 = —140 Row 0 

(13) lx, + 1 x 2 + 1^1 = 10 Row 1 

lx, — 1 x 3 -f 1^2 == 4 Row 2. 

What is the optimal solution? 

4.6 DOMAIN OF APPLICABILITY 

Many linear programming models have, as their objective, minimizing a 
linear function. They can be solved by changing the signs of the coefficients in 
the objective function, as was pointed out in Sec. 3.2. Then Criterion I (Maxi¬ 
mization) is applied to the revised form. But we find it convenient to record here, 
for future purposes, a restatement of Criterion I when minimization is called for 
and the objective function is left in its original form. 

Simplex Criterion I (Minimization). If there arc nonbasic variables having a 
plus coefficient in Row 0, select one such variable with the most positive coefficient (say 
Xj). If all nonbasic variables have minus or zero coefficients in Row 0, an optimal solu¬ 
tion has been obtained. 

An example will clarify the procedure. Consider the trivial linear programming 
problem 

(1) minimize —2x, + 3x2 
subject to 

(2) Or^x, <6 and 0 :^X 2 :^ 10. 

We could proceed to solve the problem by first changing both the signs of the 
coefficients in (1) and the sense of optimization: 

(3) maximize 2x, — 3x,, 





114 Simplex Method 


.4.6- 


This transformation will give the same optimal values for Xj and Xj. Then if we use 
the simplex algorithm format in the preceding section, we would start the itera¬ 
tions with the model 

Xq — 2a:, + 3 x 2 =0 Row 0 

(4) lx, + Ixj = 6 Row 1 

1 x 2 + 1^4 = 10 Row 2, 

and enter x, into the basis according to Criterion I (Maximization). 

But we can instead employ ( 1 ) directly by defining the variable Xq equal to the 
objective function in ( 1 ), and setting up the initial formulation as 

Xq + 2x, — 3 x 2 =0 Row 0 

(5) lx, + Ixj = 6 Row 1 

Ixj + 1 x 4 = 10 Row 2 . 

Here we would apply Criterion I (Minimization), so that at the first iteration we 
select X, to enter the basis, since its coefficient is +2 in Row 0. 

In a minimization problem, no revision in Criterion II is needed, because this 
second rule is aimed only at keeping each basic solution feasible. 

Property of an optimal solution. Let us summarize what you have 
learned about the simplex method so far: 

(i) The algorithm applies to all linear optimization models. 

(ii) At the final iteration, the terminal answer is an exact solution to the prob¬ 
lem. 

We have not yet established that the simplex algorithm terminates for all linear 
programming models. We discuss this point in the next section. But 2 issuming that 
convergence can be established, you have discovered a remarkable proposition: 

Theorem of the Basis: If a linear optimization model has a finite optimal 
solution, then there exists an optimal basic solution. 

Recall that in a linear model with m constraints, a basis is a set of m variables hav¬ 
ing unique values that satisfy the constraints when all the other variables are 
assigned 0 value. 

To see the full impact of this theorem, suppose you want to solve a linear pro¬ 
gramming model with 50 (linearly independent) constraints and 300 unknowns. 
If the hypothesis of the theorem holds, there is an optimal solution containing 
at most 50 variables at a positive level. Adding more variables may improve the 
optimal value of the objective function, but it will not increase the number of 
variables required for an optimal solution beyond 50. Notice that adding more 
linearly independent constraints to a model enlarges the size of the set of basic 
variables. Therefore, an optimal solution to the augmented model usually requires 
more variables to be at a positive level. 
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How do you know the Theorem of the Basis is correct? You know because 
when the hypothesis is true, the simplex method actually constructs such an optimal 
basic solution. 

► Historical remark: we have used the validity of the simplex method to prove the 
Theorem of the Basis. In the early literature on linear programming, you find the 
result proved independently as a nonconstructive existence theorem. The implication in 
turn is used to motivate the simplex method, which is viewed as a sequential search 
restricted to sets of basic variables. 

4,7 CONVERGENCE PROPERTIES 

Establishing convergence is usually the hardest single task in the development of 
an algorithm. You will see why. 

You know by the way Simplex Criteria I and 11 operate that each successive trial 
solution is feasible and the value of the objective function is no worse than at the 
previous solution. Suppose, for the moment, th^t at each iteration the objective 
function to be maximized actually increases. This information alone is not enough 
to establish convergence in a finite number of iterations. The objective function 
may improve by smaller and smaller amounts, as you saw in Case 2 of Fig. 4.1 
(p. 101). What IS worse, the value of the objective function may not be approaching 
he optimum, as you saw occur in Case 3 of Fig. 4.1. 

But you can make use of more information about the algorithm. The simplex 
method proceeds from one basic solution to another. Since the values for any set 
of basic variables are uniquely determined, it is not possible to return to any 
previous basic solution, given the assumption that the objective function improves 
at each trial. There are only a finite number of basis sets—certainly no more than 
the combination of n things taken m at a time, (Ji) = n\l\m\{n — m)!], for an 
m-equation and n-variable problem. Therefore only a finite number of iterations 
can occur, and we already have argued that when termination takes place, the 
final solution is optimal. 

Now has convergence been established? Unfortunately, no. The argument in 
the preceding paragraphs assumed that the objective function increases at each 
iteration. However, in many linear optimization problems, the objective function 
actually stalls for several iterations. This occurs in the following way. 

At some iteration, when Criterion II is applied the minimum ratio is zero. (Such 
an event can arise when, on the previous iteration, more than one variable in the 
basis is driven to zero as the new variable enters. Thus at the present iteration, 
one or more basic variables are at zero level, so that the basic solution is degenerate.) 
Therefore the new variable enters the basis at zero level, and all the remaining 
variables as well as the objective function remain unchanged. The change-of-basis 
calculation in Step 4 does cause the Row 0 coefficients to be modified, and hopefully 
at the next iteration the new variable selected leads to a minimum ratio that is 
strictly positive. But it is conceivable that the objective function stalls again. This 
condition could persist for several iterations, until eventually a previous basis 
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might reappear! Then the cycle would start all over again, and convergence would 
not occur. 

To sum up, we almost have a finite convergence proof, and would have one 
if we could force the objective function to make a strict increase at each iteration. 
To put your mind at ease, we hasten to mention that a finite convergence proof 
can be devised. Using slightly more advanced mathematical analysis than is 
employed in this book, it is possible to demonstrate that the simplex method can 
be made to cause a strict increase in the objective function. In order to provide the 
proof. Criterion II must be revised slightly to eliminate the ambiguity over which 
variable to remove from the basis when a tie occurs, and the notion of a “strict 
increase in the objective function” has to be suitably defined. 

In practice, degeneracy has not caused the number of iterations required until 
convergence to be a real problem. Considerable empirical evidence suggests that 
most actual applications are solved witliin the range of 1.5m to 3m iterations, where 
m is the number of constraints and the starting basis is comprised only of slack, 
surplus, and artificial variables. 

► In the example of Sec. 4.4, you saw that a variable, namely X 4 , entered and later 
left the basis. It is this phenomenon that causes the number of iterations to exceed the 
number of equations. The following example will startle your intuition about what can 
happen in the course of iterations: 

(i) maximize ly 

subject to 

3x-2y+ 4z + Itt = -2 

.... 3x — 8 z -f li' = 6 

(«) 

15x + 6y — 12z + 1«; = 222 
lx + 42 + = 12 

^^0 ^>0 z^O u^O v>0 w>0 t'^0. 

Figure 4.8 shows the succession of trial solutions. Observe that in proceeding from the 
initial to the second solution, variable z replaces the variable t. Then in going from the 
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Figure 4.8, An Example of 
Slow Convergence. 
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fourth to the fifth solution, t replaces z. Finally, in progressing from the sixth to the 
seventh solution, z replaces t again. The variable z starts outside the basis, later enters, 
then leaves, and finally enters again. The variable t starts in the basis, leaves, enters, 
and then leaves again. Notice also that variable x replaces v at the third solution, and 
later v replaces x at the sixth solution. All in all, the simplex method requires examining 
seven solutions to obtain convergence. (Note that the objective function, Ij^, docs improve 
at each iteration.) 

When we tried to establish finite convergence of the simplex method, recall wc only 
stated that cycimg is conceivable. Wc did not demonstrate that an example exists where 
cycling does occur if an arbitrary rule is used to break ties when degeneracy arises. Here 
is such an illustration. (It was discovered by sheer ingenuity.) 

The example assumes the following arbitrary tie>breaking rule for Criterion II: If two 
variables are candidates to leave the basis, the variable with the smaller subscript is 
chosen. The model is 

(iii) maximize fxi — 150^2 + "" ^^4 

subject to 

iXi — 60X2 — + 9X4 + Xj =0 

(iv) 90x2 — ^X 3 -|- 3 x 4 "h =9 

Ia .3 4- X 7 = 1 

xj ^ 0 for 7 = 1, 2,..., 7. 

The sequence of trial solutions is shown in Fig. 4.9. At each iteration, there is no 
ambiguity in the application of Criterion /. Thus, for the initial solution, the variable Xi 
is chosen to enter. Because of degeneracy, either variable X 5 or Xg can be dropped from 
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the basis. The tie-breaking rule for Criterion II as enunciated above selects ^ 5 . Observe 
that the seventh solution is identical with the initial solution, and consequently cycling 
ensues. Also note that the objective function is stalled at the value 0, whereas the optimal 
value of the objective function is The cycling is eliminated as soon as a mathemati¬ 
cally sound tie-breaking rule is used. ^ 

*4.8 COMPUTATIONAL REQUIREMENTS 

Most real applications of linear programming involve at least several dozen 
and quite often several hundred constraints. Therefore the computational burden 
of the simplex method must be discussed in the context of employing a high-speed 
electronic calculator. In the foregoing sections, we explained the fundamental 
steps of the algorithm. Any computer program of the simplex method achieves 
these steps in one way or another. However, there are variations in performing the 
steps within a computer. These secondary modifications can have a decided impact 
on the number of iterations required for convergence. 

Some computer programs are coded to act like chess players: they look ahead 
several iterations and decide on an appropriate strategy for the next few basis 
changes. This device increases the amount of calculation required at any one 
step, but reduces the total number of iterations for convergence. In other computer 
codes, the search in Criterion I is restricted to only a subset of variables at any one 
iteration. When no further improvement is possible with this subset, a new subset 
is examined. Termination occurs when improvement is impossible for every 
subset. This approach reduces the computation involved at each iteration, but 
may increase the number of iterations for convergence. 

Consequently, the total number of calculations required by the simplex method 
-to solve a particular problerfi usually depends heavily on the computer program 
employed. For practical applications, several million arithmetic operations are 
usual. Considerable experience suggests that, as a rough approximation, the com¬ 
putational burden increases as the cube of the number of constraints. Thus a 
200-equation problem is likely to require 8 times as many calculations as a com¬ 
parable 100-equation model. 

Realize that improvements in both electronic calculators and computer pro¬ 
grams for linear optimization have now made it quite feasible to solve problems 
with hundreds of constraints. The newest advances in technology are opening up 
the realm of possibility to models with more than 1000 constraints. 

If you have had any experience in manually solving small systems of simul¬ 
taneous linear equations, say five equations in five unknowns, you know that round¬ 
off error can build rapidly. In fact, if you keep only two or three significant digits 
in your computations, the answer may be too inaccurate to use. For systems of 
equations as large as those to be found in linear programming applications, the 
danger of accuracy loss is severe. For that reason, most computer programs for the 
simplex method have built-in devices for controling round-off error. Even so, 
trouble does arise on occasion; so when you solve a linear optimization model, 
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watch out for this possibility and check your answer in the original constraints. 

Because computer codes keep only a limited number of significant figures in 
the calculations, some round-off error is inevitable. Therefore, it is necessary to 
allow for such inaccuracies in performing the tests in Criteria I and II, To illus¬ 
trate, a computer code may terminate the algorithm when all Row 0 coefficients 
exceed -—.00001. 

*4.9 TABULAR REPRESENTATION 

In proceeding from one trial solution to the next, it is both cumbersome and 
unnecessary to write x,, ^ 2 , . . ., as we have throughout this chapter. Only the 
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Figure 4.10. Simplex Tableau for Product-Mix Selection Problem. 
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coefficients of the variables are needed in the computations. Once you have under¬ 
stood the straightforward logic of the simplex iterations, you can save yourself 
considerable writing effort by organizing the computations in a convenient tabular 
form called a simplex tableau. 

Figures 4.10 and 4.11 depict two approaches for the example in Sec. 4.4. 
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Current 




1 


Iteration ! 

Basis 

Values 


^2 

^3 

X4 

Row 


^0 . 

0 

-4 

-5 

-9 

-II 

0 


^5 

15 

1 

1 

1 

1 

1 

1 

^6 
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2 
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5 

10 
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Figure 4.11. Condensed Simplex Tableau for Product-Mix 
Selection Problem. 
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The columns for the basic variables in Fig. 4.10 are so simple that they could 
be eliminated with no danger of ambiguity arising. The resulting condensed 
version is given in Fig. 4.11; this smaller table requires changing one column 
heading for the new nonbasic variable at each iteration. 


*4.10 MATRIX REPRESENTATION 


In this chapter and the next, the simplex method and its variants are explained 
by arraying the full equations. Using matrix notation instead is a helpful device 
for exhibiting the mathematics in a condensed form. 

The matrix statement of the problem is 

(1) maximize cx 
subject to 

(2) Ax<b 

(3) x>0. 

When converted to a tabular representation such as Fig. 4.10, the problem data 
are viewed as 


(4) 


[: 


—Cl 

A, 


—Ci 

Ai 


—c, 

A 


„ 0 0 ... 0-1 

i„ £/, ... C/J’ 




where is a column with a 1 in Row i and zeros elsewhere. [As always, the 


top row of (4) is referred to as Row 0.] A further condensation is 


(5) 


rOii c Oij^n 

L * A j^y 


where Opq is a matrix of zeros with p rows and q columns, and is an m-dimen- 
sional identity matrix. 

At any iteration, let represent the m columns of (5) associated with the 


current set of basic variables. The first column corresponds to the basic variable 
in Row 1 of that iteration, the second column to the basic variable in Row 2, 
and so forth. Then the simplex tableau for that iteration is 

CjgB ^b Cgfi •“ c c^B 
B- 


( 6 ) 


IB-^b 


,-.j 


Thus the current tableau is simply (5) multiplied on the left by 


(7) 


r ^ =r * 

k. B-^ J k. 


Simplex Criterion I (Maximization) searches for the most negative entry in 
\cJS~^A — c CaB"*], the term that represents Row 0 of (6). Suppose this entry 
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occurs for variable Xy Then Simplex Criterion II makes use of the coefficients of x^ 
in ( 6 ), which are 


( 8 ) 


B-^A 


J J 


roj 


-Xrni- 


Note that ifxj is the fth slack variable, then Cj = 0 and = Ut in ( 8 ). Assume 
that the minimum ratio found by Criterion II occurs in Row k. 

The ensuing change-of-basis calculation amounts to multiplying ( 6 ) on the left 
by 


“1 

0 •• 


. 0 “ 

0 

1 •• 


• 0 

0 

0 .. 

iirkj 

• 0 ’ 

_0 

0 •• 

* ■ 

L 


where the ratios are placed in the (/: + l)st column of E, Multiplication by E is 
often called an elementary transformation. 

Notice that the values B~^b for the current basis are optimal when 


( 10 ) CnB"^A-c>0 CbB-^>Q. 

The corresponding value of the objective function is 

( 11 ) CnB-^b. 


REVIEW EXERCISES 

1 (a) Solve for Xy and x^ in (2) and (3) of Sec. 4.2, and verify the result in (14) and (15). 

(b) Substitute in the objective function the expressions for Xj and x^ from part (a) 

and verify the result in (16) of Sec. 4.4. 

Exercises 2, 3, and 4 refer to the numerical example in Sec. 4.4 with certain 
specified changes in the data. 

2 What variable do you enter into the basis at iteration 1 of the simplex method if the 
objective function to be maximized is 

(a) 14xi + 5x2 + 9^3 + 11^4? 

(b) 4xi + 5x2 + 9x3 -f- Sx4? 

(c) 4xi -f" 5 x 2 — 9 X 3 -f" 11^4 ? 

(d) -4xi - 5 X 2 - 9 X 3 - 11 x 4 ? 
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3 Given that you enter variable into the basis at iteration 1 of the simplex method, 
what variable do you remove from the basis and what will the new value of Xi be if 

(a) The coefficient on the right-hand side of Row 2 is 20? Is 12? 

(b) The coefficient on the right-hand side of Row 3 is 300? Is 75? 

(c) The coefficient of X 4 in Row 3 is 20? Is 24? Is —15? 

(d) The coefficient ofx 4 in Row 2 is 24? 

(e) The coefficients on the right-hand sides of Rows 1, 2, and 3 are 40, 60, and 20, 
respectively, and the coefficients ofx 4 in these rows arc —5,10, and 2, respectively? 

4 Assume that you enter variable x^ into the basis at iteration 1. What arc the new 
values of the basic variables if 

(a) You remove variable x^ instead o£xj? 

(b) You remove variable x^ instead of x-j? 

(c) Are the values in parts (a) and (b) feasible? Explain. 

Exercises 5 through 14 refer to the numerical example in Sec. 4.4. In each 
part of an exercise, certain coefficients are altered. Find an optimal solution and 
the associated value of the objective function. Think carefully about each prob¬ 
lem, because you may find that it is not necessary to redo all the arithmetic. It 
may be possible to “salvage” many of the previous computations in Sec. 4.4. 

5 Assume that the objective function to be maximized is 

(a) 4 xi -f + 9x3 -f- 10x4. 

(b) 5xi + 5x2 + 9 x 3 -h 11 x 4 . 

(c) 5xi -f- ^xj -I- 9 x 3 + 10 x 4 . 

6 Assume that the coefficient on the right-hand side of 

(a) Row 2 is 140. 

(b) Row 2 is 60. 

(c) Row 1 is 12. 

(d) Row 1 is 10. 

(e) Row 1 is 20. 

7 Assume that the objective function to be maximized is 5xi 5x2 + 9x3 -f- 1 f ^4 and 
the coefficient on the right-hand side of Row 1 is 20. 

8 Assume that the hrst constraint, prior to adding a slack variable, is stated as 

2xi + 2 x 2 + 2x, H- 2 x 4 30. 

9 Assume that the coefficient of 

(a) X 2 in Row 3 is 10. 

(b) X 4 in Row 3 is 20. 

(c) X 2 in Row 2 is 4. 

•(d) X 4 in Row 2 is 1. 

(e) Xi in Row 2 is 10, 

(f) X 3 in Row 2 is 6 . 
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10 Assume that the coefficients in Rows 0, 1,2, and 3 of 

(a) Xi are 40, 10, 70, and 30, respectively. 

(b) Xi are 20, 4,20, and 20, respectively. 

(c) ^3 are 90, 10, 30, and 100, respectively. 

11 Suppose you require x^ = 4. 

12 (a) Consider the variable x^ defined in (10) of Sec. 4.4. Calculate the coefficients of 

x^ at each of the simplex iterations, that is, in (2) through (7) of Sec. 4.4. 

(b) Verify that if you enter x^ into the basis found at iteration 4, the solution is given 
by (12) of Sec. 4.4. 

(c) Let k; = .5 in (13) of Sec. 4.4 and calculate optimal values for Xi, X 3 , Xg, and Xg. 
Check that these values satisfy the original constraints and yield an objective- 
function value of 

13 Consider the variable Xg defined in (10) of Sec. 4.4. Determine whether entering Xg 
into the basis found at iteration 4 improves the solution if the coefficient of Xg in 

(a) Row 2 is 8 , instead of 9. 

(b) Row 2 is 10, instead of 9. 

(c) Row 3 is 1, instead of .2. 

(d) Row 3 is .1, instead of .2. 

14 Suppose another variable z is added to the example in Sec. 4.4, and at iteration 4, 
the coefficients of z in (7) are — 1 in Rows 0 through 4. What is an optimal solution ? 
Exhibit a solution having an objective-function value 100. 

15 Consider the example (7) through (10) in Sec. 4.5. In each part below, apply the 
Big M Method, with Af = 10, and derive a starting set of equations analogous to 
those in (13). 

(a) Add the constraint Xg ^ 2. 

(b) Let the first constraint be 5xi + 6 x 2 = 56, instead of ( 8 ). 

(c) Let the first constraint be — 5xi -|- 6 x 2 =16, instead of ( 8 ). 

16 For a given set of constraints, suppose an all-slack-variable basis provides a feasible 
initial solution. What variable do you enter into the basis at iteration 1 of the simplex 
method given that the objective function to be minimized is 

(a) 4xi + 5 x 2 - 9 x 3 -f 11 x 4 . 

(b) —4xi 4- 5x2 — 9^3 + 11^4- 

(c) — 4 xi — 5x2 — 9x3 — 11x4. 

(d) 4xi + 5x2 4- 9x3 + 11^4- 

(c) Given your answers in each of the preceding parts and that the constraints of the 

problem are Rows 1, 2, and 3 in (1) of Sec. 4.4, state what slack variable drops 
from the basis. 

17 Explain your imderstanding of the following terms; 

algorithm positive-weighted average 

Gaussian elimination weighting factor 



Computational Exercises 125 


algorithmic convergence 
basis and basic solution 
basic and nonbasic variable 
change of basis 
pivot operation (or pivoting) 
relative (or shadow) costs 
shadow prices 


degenerate basic solution 
artificial variable 
penalty-cost coefficient 
♦cycling 

♦simplex tableau 
♦elementary transformation. 


COMPUTATIONAL EXERCISES 

18 One-PotalOy Two-Potato Problem (Sec. 1.6). Apply the simplex method to solve 

maximize 5Pi + 6 -P 2 

subject to 

. 2 Pi -t- .3^2 ^ 1.8 
.2P, + .IP 2 ^ 1.2 
.3Pi + .3^2 < 2.4 
P, ^0 and P 2 ^ 0 . 

19 Apply the simplex method to solve 

maximize 15xi + 6 x 2 + 9 x 3 + 2 x 4 

subject to 

2xi H- X2 H' 5x3 -f .6x4 ^ 10 
3xi X 2 + 3 x 3 -f" *25x4 ^ 12 
7xi + ^4 ^ 35 

every Xj ^ 0 . 

{Hint: you should find an optimal solution in no more than three iterations.) 

20 (a) Apply the simplex method to solve 

maximize 30xi -f- 23x2 + 29x3 

subject to 

6 x 1 4- 5x2 + 3x3 ^ 26 
4xi -I- 2x, -h 5 x 3 ^ 7 
every xj ^ 0 . 

{Hint: you should find an optimal solution in no more than three iterations.) 

(b) Solve part (a), assuming that the two right-hand side coefficients are 31, instead 
of 26, and 20, instead of 7. {Hint: you should find an optimal solution in no more 
than four iterations.) 

21 Apply the simplex method to solve 

maximize 60xi 26 x 2 + 15xs + 4 . 75 x 4 
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subject to 

20^1 + 9^2 + 6x3 + 1^4 < 20 
IOatj 4- 4^2 + 2 x ^ 4- 1x4 < 10 
every Xj ^ 0. 

{Hint: you should find an optimal solution in no more than five iterations.) Does the 
objective function increase at every iteration ? Is there a unique optimal solution ? 

22 Alternative Optimal Solutions 

(a) Apply the simplex method to solve 

maximize Ixj 4- 1>^2 

subject to 

Ixi + 1x2 <3 
1^1 < 2 
1;c2<2 

Xj ^ 0 and X 2 > 0. 

(b) Alter the coefficients ofxj to be 10, instead of 1, in both the objective function and 
constraints. Apply the simplex method to solve. 

(c) Alter the coefficients of X 2 to be 10, instead of 1, in both the objective function and 
constraints. Apply the simplex method to solve. 

(d) In each of the above parts, indicate how you know at the final iteration that there 
are alternative optimal solutions. Using formulas analogous to those in (13) of 
Sec, 4.4, characterize all the optimal solutions. 

23 Unbounded Optimal Solution, Consider the constraints 

Xi — Xz^l 
—xi + X2^ I 
Xi'^0 and Xz ^ 0. 

Apply the simplex method and assume that the objective function is 

(a) Maximize Xj. 

(b) Maximize X 2 . 

(c) Maximize Xi-4-^ 2 - 

(d) In each of the above parts, indicate a solution having objective-function value 100. 

24 Consider the constraints graphed in exercise 10 of Chap. 3: 

-lOxj - 15x2^-150 
5xi + 10x2 ^ 50 

^ Xj - X 2 ^ 0 

Xi ^ 0 and X 2 > 0. 

(We have eliminated the explicit constraint Xi ^ 2 which was shown in exercise 10 
of Chap. 3.) 

Use the Big M Method, with Af = 10, to find an initial basis. Solve for an optimal 
solution by the simplex method and assume that the objective function is 
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(a) 

(b) 

(c) 
(g) 


Maximize Xi + X 2 . (d) Maximize — 2xi ^ 2 - 

Minimize Xj -}- X 2 . (e) Maximize — Xj — 8 x 2 . 

Maximize Xj + 8 x 2 . (f) Maximize —Xi — 2 x 2 . 

State in each of the above parts whether there is a unique optimal solution, and if 
there is not, calculate an alternative optimal solution. {Hint: do not add an 
artificial variable for the first constraint.) 


25 Consider the constraints graphed in exercise 6 of Chap. 8 : 

6 x 1 + 4x2 ^ 24 
Xi ^ 0 and X 2 > 2 . 

Use the Big M Method, with M = 10, to find an initial basis. Obtain an optimal 
solution by the simplex method and assume that the objective function is 

(a) Minimize x,. (d) Maximize —Xi + 2 x 2 . 

(b) Minimize Xi + X 2 . (e) Maximize Xi — 2 x 2 . 

(c) Maximize Xj -f (f) Maximize — 8 x 1 — 2 x 2 . 

(g) State in each of the above parts whether there is a unique optimal solution, and if 
there is not, calculate an alternative optimal solution. 

26 Answer the questions in exercise 25 assuming that Xi ^ 5. 


27 Answer the questions in exercise 25 assuming that X 2 ^ 4. . - 

28 No Feasible Solution, In Sec. 8.4, you saw a simple example that had no feasible solution: 

maximize Xj + X 2 

subject to 


-Xj + X 2 ^ -1 

^1 — ^2 ^ 1 


Xi ^ 0 and X 2 ^ 0. 

Apply the Big M Method to see what happens in this case. (Show that Condition A in 
Sec, 4.5 cannot be satisfied, regardless of how large a value you assign Af.) 


29 Redundancy and Degeneracy, Consider the problem 


subject to 


maximize Xi — X 3 

5xi + 8 x 2 + a:, = 40 


+ ^2 + 4x3 = 10 
every Xj ^ 0 . 

Use the Big M Method to derive an optimal solution when the model is augmented by 
the constraint 

(a) 6 xi 4- 4 x 2 + 5 x 3 = 50. 

(b) 6xi + 4x2 + ^3 = 50. 
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CHAPTER 


SensitivityTesting 
and Duality 


5.1 POSTOPTIMALITY ANALYSIS 

In Chap. 4 you studied the mechanics of obtaining an optimal solution to a 
linear programming model. As a result you learned the important considerations 
for understanding the algorithmic approach and acquired an initial insight into 
the mathematical structure of a linear model—discovering, for example, that if 
a linear optimization model has a finite optimal solution, there exists an optimal 
basic solution. 

But recall that the stated goal has been to provide systematic methods by which 
you can fully analyze and comprehend the complex interrelations in a linear 
programming application. We pursue this objective here. 

The experienced user of a linear programming model rarely confines his interest 
to the numerical values of an optimal solution, unless he has already applied the 
model so many times that he fully understands the range of validity of the answer. 
Typically, he wants to know how far the input parameter values can vary without 
causing violent changes in a computed optimal solution or the composition of a 
basis set. Such an investigation is termed a sensitivity or postoptimality 
analysis. 

Many important postoptimality questions are easily answered, given the numer¬ 
ical information at the final simplex iteration. Three illustrative examples are, 
“If the profit contribution of a particular basic activity decreases, does the current 
solution remain optimal? What happens if resource availability is curtailed? 
What happens if a new activity is added?” Other sensitivity questions can be 
sufficiently complex to require an electronic computer for the analysis, but even 
then, most electronic computer codes make considerable use of the previous optimal 
solution, and do not solve the revised model from scratch. 


129 
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In the sections to follow, we explore the fundamental ideas that underlie sen¬ 
sitivity analysis in linear programming. The methods we present relate to the model 

n 

(1) maximize ^CjXj 
subject to 

(2) 2 for i = 1, 2,. . . , m 

(3) > 0 for j = 1, 2,. . ., n. 

The mechanics of sensitivity testing are straightforward, and rather than confuse 
the presentation with frightening-looking formulas, we explain the ideas in terms 
of the Product-Mix Selection Example that was solved in Sec. 4.4. We repeat here 
both the initial and final systems of equations, labeling them (I) and (F): 



Ixo — 4xi — 5x2 

— 

9*3 — 1 1*4 


= 0 

Row 0 

(I) 

IXj 4 " 1^2 

+ 

1*3 + 1*4 + 1*5 


= 15 

Row 1 

7xj + 5x2 

+ 

3*3 + 2*4 

+ 1*^6 

= 120 

Row 2 


3xi + 5x2 

+ 

10*3 + 15*4 

+ 1^7 

= 100 

Row 3, 

and 

Ixo + 4 ^X 2 


+ -V-^« + ¥^5 


695 

7 

Row 0 

(F) 

1^1 " 1 “ T ^2 


- 1 * 4 + ^*5 

— = 

50 

7 

Row 1 

— yX2 


+ ^*4 - -V-** + 

1-^6 "f* T^7 = 

32 5 

7 

Row 2 


+ 1*! 

I + — T«5 

+ T^7 = 

5 3 

7 

Row 3, 


where Xq is being maximized. 

We ask you to copy (I) and (F) on a sheet of notepaper to keep before you as 
you read the text below. 

5.2 OBJECTIVE FUNCTION 

First we investigate whether a previously optimal feasible solution remains 
optimal if the coefficients in the objective function are altered. Such alterations do 
not affect the feasibility of the previous solution; therefore, whenever an improved 
solution is possible, you need only reinitiate the simplex method computations, 
starting at the previous final iteration. 

Consider the coefficients of the nonbasic variables X 2 and X 4 in Row 0 of (I). 
It is intuitive that if they are made less profitable, the current solution remains 
optimal. If you sufficiently increased their profitability, however, you could 
eventually improve on the current solution. At what level of unit profit for X 2 
does the present solution become nonoptimal ? 

Suppose the unit-profit coefficient of Xg is (5 8), where S (delta) is nonnegative. 

Then Row 0 in (I) would be written: 
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(1) l;co — 4xi — (5 + 8 )a :2 — 9;cs — llAf 4 = 0 Row 0 . 

In performing each simplex iteration, you added a multiple of a row to Row 0. 
Consequently at the final iteration, Row 0 of (F) must be 

(2) Ixo + (^ - S)x, + if ^4 + + ^XJ = ^^ Row 0 , 

as you should verify. Thus, subtracting 8 x 2 in (1) causes 8 x 2 to be subtracted in all 
subsequent Row 0 equations. ^ 

You can see that if 8 is greater than the coefficient of X 2 .in_( 2 ).,is_negatiyc, j 
Then X 2 would enter the next basic solution, according to Simplex Criterion I j 

(Maximization). Similarly, if the coefficient of x^ were increased by more thanjf, • 

the current basic solution would no longer be optimal. \ 

u Thus, the final Row 0 coefficients of the nonbasic variables represent the largest 
} positive increments to the original objective function coefficients that leave the 
^ current solution optimal. 

Suppose you substantially reduce the profitability of Xi or X 3 , which are in the 
basis. It is plausible that the current basic solution might not remain optimal. 

It is not so intuitive that if you increased one of these unit-profit coefficients, the 
current solution could become nonoptimal. Within^what range can the profita¬ 
bility of Xi vary before the present solution can be improved ? 

To answer this question, you proceed as before. Row 0 in (I) becomes * 

(3) l;co — (4 + 8 )x, — 5 x 2 — 9 x 3 — 1IX 4 = 0 Row 0 , 
and in (F) correspondingly becomes 

(4) Ixq — 8 x 1 + y^2 + *^^5 "t” T^7 = 

] To draw any conclusions about a critical range for. 8 , you first must recreate a 

i coefficient equal to 0 for Xj in Row 0. This is achieved in the usual fashion. Mul¬ 
tiply Row 1 of (F) by 8 and add it to (4) to give 

(5) Ix, + (I + + (V- - t 8)^4 + (^ + + (f - iS)xr 

= sp + (P )8 RowO. . 

Look carefully at (5), and check that for 

( 6 ) 

the current solution remains optimal. If 8 is less than —the coefficient of X 2 
becomes negative. If 8 exceeds the coefficient of X 4 becomes negative. Con¬ 
sequently, as soon as 8 falls outside the range in ( 6 ), the present basis is no longer 
optimal. 

► So far, the sensitivity analysis has been explained for a change in the profit coefficient 
of a single basic or nonbasic variable. The same approach is applicable for changes in 
several profit coefficients of the basic variables. However, the expression corresponding 
to (5) will have several change-quantities 8j in each coefficient modifying the nonbasic 
variables. All of these expressions must remain nonnegative. Thus instead of a single 
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pair of inequalities such as (6), there will be a system of inequalities, one corresponding 
to each coefHcient of a nonbasic variable. 

Many computer programs of the simplex method automatically provide ranging 
analysis for the objective function coefficients. A few routines also can perform more 
extensive sensitivity analysis using devices known as multiple objective func¬ 
tions and parametric programming. 

The option of multiple objective functions permits solving a succession of linear 
programming problems, each with a different criterion function. The computer code 
optimizes on the first function, then goes on to each other function, one by one, employ¬ 
ing the previously optimal basis as an initial solution. 

In a parametric programming analysis, the model has as its objective function the 
expression 

(») + 

J=i 

where 8 is viewed as a parameter to be varied over a specified range. In the simplest 
case, all the cf equal 0, except one which equals 1, and the analysis is merely an extensive 
ranging study. The device of parametric programming can also be used to test whether 
a specific alteration in the objective function coefficients leads to the same optimal solu¬ 
tion. To do this, c J* is defined as the increment to corresponding to the specified alter¬ 
native coefficient. Then a test is made to see whether the current basic solution remains 
optimal for 8=1. A 


5.3 RIGHT-HAND-SIDE CONSTANTS 

Now we investigate whether a previously optimal basis remains feasible if a 
constant on the right-hand side is altered. When the basis remains feasible, the new 
solution is optimal, because the coefficients in Row 0 are unchanged. 

Consider the constant oh the right-hand side of Row 2 of (I). Suppose you 
change it to (120 + IS). Observe in (F) that the slack Xo for this row is in the final 
basis, and so will change by the amount 8. Give a; verbal argument for this 
conclusion. Consequently from Row 2 of (F), the present solution remains feasible 
so long as S ^ (What is the value of ;eo if S = — 

Next consider the constant on the right-hand side of Row 1 of (I). Suppose you 
alter it to (15 + 18). Over what range of 8 does the current solution remain feasi¬ 
ble? 

You can determine the answer if, while performing the simplex arithmetic, 
you carry along the quantity 18 on the right-hand side of the equations. Notice, 
however, that row for row the quantity 18 added on the right of (I) is just like 
the quantity \xs already appearing on the left of (I). It follows that at any itera¬ 
tion, the coefficients of 8 will be the same row by row as the coefficients of x^. 
Hence at the final iteration you must have 

Ixo +^x, + i^x, . + RowO 

IXi+^Xt — + ^ + Row 1 

+ + = Row2 

f X, + IXf + ^Xi — +^Xj= f s Row 3 . 
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As always, setting the nonbasic variables Xa, Xi, Xs, and Xj to zero yields the values 
for the basic variables, which are now expressed in terms of 8 , For feasibility, all 
the constants on the right-hand side of (1) must be nonnegative; therefore, the 
current solution remains feasible for 


( 2 ) 


50 ^ s ^ 325 

eT’ 


as you should verify. If S < —fJ, the basic variable Xi becomes negative, and 
if S > the basic variable Xq becomes negative. 

Suppose S = 1, which can be interpreted as augmenting the scarce resource 
in Row 2 by one unit. Then the value of the objective function increases by 
as you can see in Row 0 of (1), and all the basic variables remain nonnegative. 
In other words, is the incremental value of another unit of this resource at an 
optimal solution. 


>The approach in (1) generalizes to simultaneous changes in several of the right-hand- 
side coefficients. Suppose the Row 1 constant is (15 -f- l 8 i), .the Row 2 constant is 
(120 -f iSo), and the Row 3 constant is (100 + l^a)- Then, analogously to (1), there is 
a column for each for i = 1, 2, 3, on the right-hand side of (F). After setting the 
nonbasic variables to zero, the system appears as 

*0 = -^ + ¥8. + f 8, 

„ _ soiios 

^ + -5-6, -Ti>S 

A-o = ^ - V8i + ISj + 

t8 . +|S,. 

The coefficients of each 8i are the same as the coefficients of the corresponding slack 
variable in (F). The expressions for Xj, Xg, and in (i) must be nonnegative to preserve 
feasibility, and consequently, there is a set of three inequalities to be satisRed by Sj, 8 j, 
and 83 . 

Many computer routines of the simplex method automatically perform ranging 
analysis on the right-hand-side constants. Several programs also permit obtaining a 
succession of solutions for multiple right-hand sides, analogous to the approach 
available for multiple objective functions. In addition, certain computer codes include 
a parametric programming device for tracing the solution to a model having right-hand- 
side constants bi -f 8bf, In this approach the quantity 8 is viewed as a parameter that 
ranges over a specified interval. ^ 


5.4 DUALITY 

The techniques you have learned so far in answering questions about the sen¬ 
sitivity of an optimal solution may have seemed logical but ad hoc. In the same 
way, we could continue to develop methods for testing the sensitivity to the tech¬ 
nological coefficients aij, for deciding whether it would be beneficial to enter into 
the basis a newly added variable, and for altering the solution when new constraints 
are imposed. 

But there is a unifying concept, namely, duality, that establishes the inter¬ 
connections for all of the sensitivity analysis techniques. If you are like most 
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beginning students, you will first think of duality as being an abstract idea and, 
consequently, somewhat puzzling and remote. The reaction of most beginners 
when they read the Dual Theorem below is, “That’s nice. So what?” At this point 
in your understanding, the only reply we can give is, “Keep the faith, baby.” 
If ever the maxim applies that “there is nothing more practical than a good 
theory,” this is it. We would be less than honest if we did not admit that many 
effective practitioners of linear programming have managed to avoid understand¬ 
ing duality. The most successful, however, know it cold. 

We begin with a statement of the duality relationships and give a few immediate 
* implications. Then in the subsequent sections we show how to employ duality for 
sensitivity testing. 


Primal and dual problems. Consider the pair of linear programming 
models: 


(1) 

subject to 
( 2 ) 

(3) 


n 

maximize ^CjX^ 




£ aijXj <bi for I = 1, 2, ,.., m 
j=i 

xj>0 for j = 1,2, ...,«, 


and 

w 

subject to 

(5) 

( 6 ) 


m 

minimize 2 




m 

fory = 1,2, ...,n 

t = l 

> 0 for i = .1, 2, ..., m. 


For the sake of definiteness, we arbitrarily call (1), (2), and (3) the 
primal problem and (4), (5), and (6) its dual problem. 

As an illustration, consider the following pair of problems: 

(7) maximize 4xi + 5x2 + 
subject to 

Ixi + 1^2 + 2x3 < 16 

(8) 7x1 + 5x2 + 3x3^25 Primal ; 

^1^0 X2 > 0 X3 > 0 

and 


(9) 

subject to 

( 10 ) 


minimize 16^i + 25^2 

bi + 7)>2 ^ 4 

1^1 + 5 j »2 ^ 5 

2>I + 3^2 ^ 9 

^1^0 ^2 > 0 


Dual. 
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Loosely pui, the dual problem can be viewed as the primal model flipped on its 
side: 

(i) The jth column of coefficients in the primal is the same as the jth row of 
coefficients in the dual. 

(ii) The row of coefficients of the primal objective function is the same as the 
column of constants on the right-hand side of the dual. 

(iii) The column of constants on the right-hand side of the primal is the same 
as the row of coefficients of the dual objective function. 

(iv) The direction of the inequalities and sense of optimization are reversed 
in the pair of problems. 

The proposition of significance is the following: 

Dual Theorem: (a) If both the primal and dual problems possess feasible 
solutions, then the primal problem has an optimal solution xf, for j = 1, 2, 
the dual problem has an optimal solution for i = 1, 2,. •,, and 

( 11 ) 

iol 

(b) If either the primal or dual problem possesses a feasible solution with a 
finite optimal objective-function value, then the other problem possesses a 
feasible solution with the same optimal objective-function value. 

To start demonstrating the ramifications of duality, we show that any feasible 
solution to one problem provides a bound on the optimal value of the objective 
function for the other problem. 

Let Xj actually satisfy , the primal constraints, andyi similarly satisfy the dual 
constraints. Multiply the ith constraint in the primal by^^. Likewise multiply 
the Jth constraint in the dual by Xj, Since > 0 and Xj ^ 0, the directions of the 
inequalities are unchanged by the multiplication. Add all the resultant constraints 
from the primal to give 

( 12 ) ^yi ^ 

1-1 \^=i / 1-1 

and add all the resultant constraints from the dual to give 

(13) 2 (S ^ 2 

Note that adding like-directioned inequalities preserves the inequality direction. 

The calculated quantities on the left-hand sides of (12) and (13) are the same; 
therefore, 

(14) 

Consequently, the value of the objective function for a feasible solution to one 
of the problems bounds the objective-function value in the other problem for 
any feasible solution, including an .optimal one. 
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For the example of the primal problem stated in (7) and (8), and its dual in 
(9) arid (10), we can write (14) as 

(15) 4xi + 5^2 + 9xs < 16j, + 25^2, 

for any feasible primal and dual solutions. A feasible solution to the primal problem 
is = a :2 = 0 and x, = 8, which has an objective-function value of 72. A feasible 
solution to the dual problem is_yi = 5 and j >2 = 0, which has an objective-function 
value of 80. Therefore, the optimal objective-function value for both primal and 
dual problems lies within the range of 72 to 80. Explain why. 

If one problem has an unbounded optimal solution, then the other problem 
cannot have a feasible solution. For example, suppose for any value of the primal 
objective function, no matter how large, there exists a feasible solution yielding this 
value. Then the dual problem can have no feasible solution. If it did, a contradic¬ 
tion would arise. By (14), the dual solution would provide an upper bound on the 
value of the primal objective function for any feasible primal solution. But we 
assumed that the primal objective can be made arbitrarily large. 

We also point out that the Dual Theorem indicates the fundamental test for 
optimality of a trial feasible solution to the primal problem. If there is a corre¬ 
sponding feasible solution to the dual problem such that the values of both objec¬ 
tive functions are equal, then the two solutions must be optimal in their respective 
problems. (Explain why in a sentence or two.) 

►An interesting corollary of the Dual Theorem is the following: 

Theorem of Complementary Slackness. Let xf, for j = 1,2,..., n, and yf, 
for i = 1, 2,..., m, be corresponding feasible solutions to the primal and dual prob¬ 
lems, respectively. Then both are optimal if and only if 

~ = 0 for I = 1, 2 ,..., m 

- fj) = 0 for j=l,2,...,n. 

This implies that whenever a constraint in one of the problems holds with strict inequal¬ 
ity, so that there is slack in the constraint, the corresponding variable in the other 
problem equals 0. 

We used convenient canonical forms for stating the primal and dual problems in (1) 
through (6). For any particular problem, you may have to employ the methods in 
Chap. 3 to transform the original statement to conform with either (1), (2), and (3) or 
(^)j (5), and (6). You will gain further perception of duality relationships by seeing the 
pair of problems stated in the following expanded canonical forms: 

• n 

(i) maximize 
subject to 

(ii) jCj ^ for i = 1, 2,..., 


h^m 


Dual Problem Solution 137 


• 5.5 • 


n 

(iii) yi for i = A + 1, ^ + 2,..., m 

j-i 

(iv) for j = 1,2,.. 

(v) Xj unrestricted in sign for j = A: + 1, Ar + 2,..., n, 
and 

m 

(i') minimize 2 

<»i 

subject to 

(ii') 2 otjyt for j = 1, 2,..., i 

t«l 


(iii') 

(iv') 

(V') 


2 “ny* = 


yi^o 


for j = k 1, A: + 2,.. 
for i = 1, 2,..., A 


yi unrestricted in sign for t = A + 1, A + 2,.. 




,m. 


The Dual Theorem is valid for the above pair of problems. 

Test your facility with the idea of duality by this exercise. Take as correct the relation¬ 
ships (1) through (6). Then derive the correspondence between (iii) and (v') by employ¬ 
ing the devices in Chap. 3, The duality relationships can be summarized as follows: 


Primal (Maximize) 

Objective function 
Right-hand side 
Jth column of coefficients • 
ith row of coefficients 
jth variable nonnegative 
jth variable unrestricted in sign 
2 th relation an inequality {<) 
ith relation an equality 


Dual (Minimize) 

Right-hand side 
Objective function 
Jth row of coefficients 
ith column of coefficients 
Jth relation an inequality (>) 
j*th relation an equality 
ith variable nonnegative 
ith variable unrestricted in sign. 


◄ 


5.5 SOLUTION TO THE DUAL PROBLEM 

By now, you must be wondering how to calculate a solution to the dual problem.' 
But like the character in Moli^re who is delighted to hear he has been speaking 
prose all his life without knowing it, you have already calculated dual solutions 
without knowing it. 

Optimal Values of Dual Variables, (a) The coefficients of the slack variables in 
Row 0 of the final simplex iteration of a maximizing problem are the optimal values 
of the dual variables, (b) The coefficient of variable xj in Row 0 of the final simplex 
iteration represents the difference between the left- and right-hand sides of the jth dual 
constraint for the associated optimal dual solution. 

To illustrate, consider the Product-Mix Selection Example originally written as 
maximize 4xi + 9xs -{- 1 Ix^ 
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subject to 

\Xy+ 1X2 + \X2 + 1^4 ^ 15 
7 xi 5^2 ”1” 3^*3 ”}■ 2^4 120 

3a:i + 5^2 + IOats + 15a:4 < 100 
a:^ > 0 for j .= 1,2, 3, 4. 
Verify that the dual problem is 


( 1 ) 

subject to 

minimize 15)', + 12())'2 + 10(^j 


Iji + 7^2 + 3^13 ^ 4 


b"! + 5^2 + 5^3 ^ 5 

( 2 ) 

bi + 3 j’ 2 + IQ^a > 9 


b. + 2 j -2 +15^-3 ^11 
> 0 J 2 ^ 0 J 3 ^ 0 . 


Look at the coefficients of the three slack variables in Row 0 of the final itera¬ 
tion, which we have been referring to as (F), given in Sec. 5.1. You can conclude 
from (a) above that the optimal values of the dual variables are 


(3) 


= Y yt = o = 


(4) 


Copy these values as you will want to refer to them below. 

First verify that the constraints in (2) are satisfied: 

!(¥)+ 3(4)= ^>4 
1(Y)+ 5(4)= ^>5 
l(Jf) + 10(4) = ^ ^ 9 

i(¥) + 15(4)=^^ii. 

Second, check that the value of the dual objective function is the same as the 
value of the primal objective function: 

(5) 15(J^) + 120(0) + 100(4) = 

The values in (3) must be optimal, since they satisfy all the dual constraints and 
yield an objective-function value equal to the optimal primal value. 

Finally, calculate the differences between the left- and right-hand sides of (4). 
For example, the second and third constraints give 


( 6 ) 


38 3 

T-5 = t 

y - 9 = 0. 


These are the coefficients of Af 2 and Xj, respectively, in Row 0 of (F), as claimed in 
(b) above. 
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By reference to the notion of duality, you now can deepen your understanding 
of what is really happening in the simplex method. Interpret the coefficients of 
the slack variables in Row 0 of the primal problem at each iteration as trial values 
of the dual variables. Likewise view the other coefficients in Row 0 as the differ¬ 
ence between the left- and right-hand sides of (2), using these trial values for the 
dual variables. Then the simplex method can be seen as an approach that seeks feasibility 
for the dual problem while maintaining feasibility in the primal problem. As soon as feasible 
solutions to both problems are obtained, the iterations terminate. 


♦One more time. The preceding explanation of how to find the optimal 
values of the dual variables was predicated on the model being specified with all 
(<) inequality constraints. Suppose instead that it is written as 


(7) 

subject to 

( 8 ) 


maiximize 




^=1 


bi 


for 1 = 1,2,.. 


X)>Q for j = 1,2,.. 


m 


n. 


For simplicity, assume that the variables have been arranged so that Xi, Xj,,.., 
x,rt is an optimal basis. Then you can find the optimal values of the dual variables 
by solving the m simultaneous linear equations 


(9) 


S = cj for j=l,2,...,m. 

ial 


►The duality result is readily apparent when you employ the matrix notation that we 
developed in Sec. 4.10. In such terms, the dual pair of problems is 

maximize cx subject to minimizesubject to 

(i) Ax ^b yA'^c 

X ^ 0 ^ 0. 

As you saw in Sec. 4.10, a primal solution can be written as 

(ii) Xb = B^^b. 

The corresponding objective-function value is 

(iii) CBB~^by 
and the accompanying conditions for optimality are 

(iv) CbB-^A — c ^ 0 CbB"* ^ 0 . 

To verify that the associated optimal dual solution is 

(v) ye = cbB-^ 

only observe that (iv) then demonstrates that the feasibility conditions for the dual 
problem are satisfied and (iii) shows that the values of the primal and dual objective 
functions arc equal. Note also that if the primal constraints arc equalities, then (v) 
continues to hold, but j>jb no longer has to be nonnegative. A 
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5.6 MORE POSTOPTIMALITY ANALYSIS 

We promised that if you studied the Dual Theorem, you then would be better 
able to understand and do sensitivity analysis. Now we make good on our word. 
To start, we review what you have already learned at the beginning of the chapter, 
this time making clear where duality enters the picture. Afterwards, we answer 
some new sensitivity questions. 

Obj ective function. Recall in the Product-Mix Selection Example that 
the dual restriction corresponding to the primal variable X 2 is 

(1) Iji + 5 j '2 + 5j>3 ^ 5. 

If the objective function coefficient of X 2 becomes (5 -f S), then (5 + S) appears 
on the right-hand side of (1). Substituting the optimal values of the dual variables 
into (1), where (5 + 8 ) is used, yields 

A2) l(Jf)+5(0)+5(f)>5 + S, 

or 

(3) j-^S. 

Thus the current dual solution remains feasible provided 8 does not exceed y. 
If 8 is made larger than this fraction, the dual solution is no longer feasible, and 
consequently the primal solution is no longer optimal. This conclusion agrees with 
what we found in Sec. 5.2. 

Right-hand-side constants. In Sec. 5.3, we derived in an ad hoc 
fashion that the coefficient of a slack variable in Row 0 of an optimal solution 
represents the incremental value of another unit of the resource associated with 
that variable. And in the preceding section we stated that the optimal value, of 
a dual variable is the very same coefficient. Putting the two statements together, 
we have the following: 

Interpretation of the Dual Variables. The optimal value of a dual variable 
indicates how much the objective function changes with a unit change in the associated 
right-hand-side constant, provided the current optimal basis remains feasible. 

This interpretation accords with the fundamantal equality relation (11) of the 
Dual Theorem in Sec. 5.4, which states: 

optimal value of x^ 

= S (right-hand-side constants) X (optimal dual variables). 

The optimal values of the dual variables are often called shadow prices. 
When the right-hand-side constants represent quantities of scarce resources, the 
shadow price indicates the unit worth of each resource as predicated on an optimal 
solution to the primal problem. For the Product-Mix Selection Example, the 
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value ^ is the shadow price for the first constraint (man-weeks), and similarly 
0 holds for the second constraint (pounds of Material Y), and ^ for the third con¬ 
straint (pounds of Material Z). Therefore, an additional man-week increases the 
profit level by ; an additional pound of Material Z increases the profit level by 
4; but an additional pound of Material Y does not improve the profit level. Why? 
Because Material Y is already in excess supply, as evidenced by the slack variable 
Xq being in the optimal basis. 

In general, a resource in excess supply is indicated by the slack variable for that 
resource appearing in the final basis at a positive level. You would expect the cor¬ 
responding shadow price to be zero, because additional excess supply is of no value. 
This is precisely what occurs. Since the slack is in the basis, its final Row 0 coef¬ 
ficient is zero. 

Interpreting the values of the dual variables as shadow prices leads to an insight¬ 
ful view into the meaning of the dual problem. In the context of the Product-Mix 
Selection Example, think of each dual variable as representing the true marginal 
value of its associated resource, assuming that the company is acting optimally. 
Then (4) indicates that total profit is the same as evaluating the total worth of all 
the resources in scarce supply. Interpret each coefficient as the consumption of 
the ith resource by thejth activity. The summation 2r=i represents the under¬ 
lying economic cost of using the jib. activity, evaluated according to the shadow 
prices. The constraints of the dual problem ensure that at an optimal solution, 
the profit of an activity can never exceed its true economic worth. What is more, 
you will never employ an activity if its profit is less than its economic worth. 

t>The following example shows why the above interpretation of the dual variables has 
to be qualified with the proviso that the current basis remains feasible. Consider 

(i) maximize — a*i — Xo 
subject to 

2.V, — X 2<2 

(ii) —Afi + 2^2 ^ 0 

A'l ^0 a*2 ^ 0. 

The solution .v, = atj = 0 is clearly feasible and thus is optimum. Why? [You may find 
it helpful to draw a two-dimensional diagram of the solution space implied by (ii) and 
single out the unique optimum.] 

Add slack variables A ’3 and to the inequalities in (ii). Then if you choose x^ and X 4 
to be basic variables, after pivoting, you will find that atj = 2 , ^^4 = 0 , which is a degen¬ 
erate solution. The corresponding dual variables are>’i = j'g = Further, the associ¬ 
ated Row 0 relation will be Xq + ;r| -f a :2 = 0, so that both nonbasic variables have 
strictly positive coefficients. If, instead, you choose Xi and X 3 to be the basic variables, 
then after pivoting, you will find that Xi = 0 , Xs = 2 , and the corresponding variables 
are = 0, ^2 = Further, the Row 0 relation will be ;co 4- 3 a :2 -f IX 4 = 0, so that 
both nonbasic variables have strictly positive coefficients. Thus, because the optimal 
solution leads to a degenerate basis, you have found different optimal values for (^i,j> 2 )> 
each set corresponding to one of the optimal bases. Suppose now the second constraint is 



142 Sensitivity Testing and Duality \ • 5.6 • 

(iii) —+ 2x2 ^ Sj. 

For §2 nearly zero, which value for ^2 indicates how the objective function will change? 
If 82 > 0, then = ^2 = 0 is still optimal, and so ^2 = 0 is correct. The basis with 
Xi and X2 is not feasible for 82 > 0 (;ci has a negative value). If 82 < 0, then ^2 == 1 is 
correct, for then you must have > 0 for feasibility. ^ 

Technological coefficients. To begin consideration of sensitivity tests 
on the ai)y suppose an entirely new activity is added to the model. Is it advantageous 
to enter it into the basis ? |The easiest test is to check whether the associated dual 
constraint is satisfied by the current values of the dual variables. If not, the new 
activity should be introduced. 

Consider the Product-Mix Selection Example, as given by (I) in Sec. 5.1. 
Suppose you add another variable: 

+ Ixs Row 1 

(5) -f -fxa Row 2 

+ 17x8 Row 3. 

Let the profit coefficient of Xg in the objective Junction be At what value 
of Cg is it attractive to enter Xg? The associated dual relation is 

(6) Iji + ^2 + 17^3 > cs. 

Substitute the current optimal values of the dual variables in (6) to obtain 

(7) l(^)+^(0) + 17(4)>rg, 
or 

(8) 14 > Cg. 

Therefore, if Cg exceeds 14, you should enter Xg into the basis. 

►Assume rg = 20. Then the coefficient of Xg in Row 0 of (F) is 14 — 20 = — 6 . What 
are the coefficients of Xg in other rows of (F) ? This question may be answered in the 
following way. 

View the coefficients in (5) row by row as equivalent to the sum of 1 times the coeffi¬ 
cients of X5, y times the coefficients of Xg, and 17 times the coefficients of X7, as given 
in (I): 

[1(1) 4 - ^(0) + 17(0)]xg = Ixg Rowl 

(iv) [l(0) + f(l) + 17(0)]xg= ?xg Row 2 
[1(0) + ?(0) + 17(l)]xg = 17xg Row 3. 

Then the coefficients of Xg in (F) can be computed row by row as the same multiples 
of the corresponding coefficients of Xg, Xg, and X7 in (F). Specifically, 



[1(¥) + ?(0) + = ■ 

-1^8 

Row 1 

(v) 

[!(-¥) + f(l) + = 

T^8 

Row 2 


[l(-f) + f(0) + 17(|)K = 

2xg 

Row 3. 
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l£xj is a nonbasic variable, you can study the effect of changing its technological 
coefficients in exactly the same fashion. To illustrate, X 4 is nonbasic in the opti¬ 
mal solution of the Product-Mix Selection Example. Consider the possibility that 
its coefficient in Row 3 is altered by S. Then the associated dual restriction is 

(9) 1j^i+2^2 + (15 + 8)^3>11. 

Substitute the current optimal values of the dual variables in (9) to obtain 

(10) I(j^)+ 2(0) + (15 + S)(4)^11 
or 

(11) S^-T- 

Therefore, if S is smaller than —you should enter into the basis. 

If Xj is a basic variable, analyzing the effect of changing a technological 
coefficient is more complex. The analysis involves a simultaneous consideration 
of both the primal and dual problems. The derivation goes beyond the scope of 
this text, but can be found in several advanced books on linear programming. 

5.7 SUMMARY 

In Secs. 5.1 to 5.6, you have seen how to probe the sensitivity of an optimal 
solution of a linear programming problem to changes in the model formulation. 
In particular you studied how to 

(i) Find the range of variation in each objective-function coefficient and 
right-hand-side constant over which the current basis remains optimal, 
provided the basis remains feasible. 

(ii) Evaluate the economic impact of specific changes in the model’s data. 

(iii) Revise a previously optimal solution after adding new variables. 

The mechanics for all these postoptimality analyses are straightforward exten¬ 
sions of the simplex-method arithmetic. But duality is the key idea that ensures the 
mechanics are correct. 

Every linear optimization model has a dual formulation. In solving one of 
these problems, you automatically solve the other. As you progress in learning the 
fundamentals of operations research, you will find the notion of duality frequently 
reappearing. 

*5.8 DUAL SIMPLEX ALGORITHM 

Given the reciprocal relations between the primal and dual problems, another 
possibility for solving a linear program seems plausible. Consider an algorithm/or 
the primal problem in which, until the final iteration, each trial primal solution is 
infeasible as it contains negative-valued variables; but the associated dual solution 
stays feasible at every iteration. This is the idea of the dual simplex algorithm. 
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There are at least two practical reasons for studying the dual simplex algorithm. 
One is that it sometimes permits you to easily select an initial basis without 
having to add any artificial variables. You will observe how this can happen in 
the example below. Second, it aids in certain types of sensitivity analysis, as you 
will see in the subsequent section. 

We begin the discussion with an illustration. Examine the problem 

( 1 ) minimize 2 xj + IX 3 
subject to 

Ixj + 1^2 — 1^3 ^ 5 

( 2 ) Ixi — 2 x 2 + 4 x 3 > 8 

^1 > 0 xj > 0 X3 > 0. 

Here it is convenient to designate (1) and (2) as the primal problem, and so its 
dual is 

maximize 5ji + ^2 

Iji + lj'2 < 2 

— Iji + ^2 ^ 1 

> 0 > 0 . 

As before, if you let Xq be the value of the objective function and add surplus 
variables to ( 2 ), you can write the primal problem as 

Ixo — 2xj — 1 x 3 =0 Row 0 

(5) lx, + 1 x 2 — 1^3 — 1-^4 =5 Row 1 

Ixi — 2 x 2 + 4 x 3 -- Iats =8 Row 2 . 

A starting solution with Xq, X 4 , and Xj as basic variables is infeasible, since 
= — 5 and X5 = — 8 when Xj = X2 = X3 = 0 . As a result, the current value 
= 0 is too small; you can see from (1) and (2) that Xq will be positive when 
feasibility occurs. But notice that for this infeasible starting solution, the coefficients 
in Row 0 satisfy the optimality condition in Simplex Criterion I (Minimization) 
stated in Sec. 4.6. Therefore, we say that this initial basis is dual-feasible. 
[Verify that = 0 and j ^2 = 0 satisfy the constraints (4).] In this method, we 
seek to maintain dual feasibility but remove the primal infeasibilities. 

The method. Proceeding by analogy with the simplex algorithm, we con¬ 
struct a dual simplex algorithm. First we formulate a rule indicating 
which variable is to be removed from the current basis. 

Dual Simplex Criterion I. If there are basic variables having negative values, 
select one such variable that is most negative (say Xjt). If all basic variables have non- 
values, an optimal solution has been obtained. 


(3) 

subject to 

(4) 
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To apply Criterion /, rewrite (5) after multiplying Rows 1 and 2 by (—1); 
this gives 

Ixo — 2xi — 1^3 =0 Row 0 

(6) — \xi — \x 2 + 1^3 + 1^4 = —5 Row 1 

— Ixj + 2x2 — 4^3 + Ixfi = —8 Row 2 . 

According to Criterion /, remove the basic variable x^ since it is more negative 
than x^. 

Next we provide a rule selecting a variable to be introduced into the basis. Keep 
in mind that you want to preserve dual feasibility. After the change of "basis, the 
coefficients in Row 0 are to remain nonpositive. 


Dual Simplex Criterion II (Minimization), (a) Take the ratios of the current 
coefficients in Row 0 of the nonbasic variables to the coefficients in the row of the 
currently leaving ;c* (ignore ratios with zero or positive numbers in the denominator), 
(b) Select the minimum ratio, which occurs 
for a variable x^. 


The calculations indicated by Criterion II 
are performed in Fig. 5.1. Since you know 
that Xj is to replace x^ in a trial basis, you 
can perform the same change-of-basis 
calculations as you do in the ordinary 
simplex method. Start the pivot operation 
on (—4), the coefficient of x^ in Row 2; 
this yields 

Ixq — 2xi — 1^3 

(7) — —1x2 + 1^3 + 1^4 

“ T^2 + 1^3 


Nonbasic Variables 


^2 

*^3 

Current Row 0-r 

-2 

0 

-1 

Coeffs. in Row 2 = 

-1 

2 

-4 

Ratios 

2 

— 

.25 

Minimum 

.25 


Figure 5.1. Iteration 1: Dual Simplex Cri- 
lerion U for Leaving Basis. 


= 0 Row 0 

= --5 Row 1 

— = 2 Row 2 . 

Thus you have created a coefficient 1 for x^ in Row 2. Next create coeffi¬ 


cients equal to 0 in Rows 0 and 1: 

Row 0: multiply Row 2 by 1 and add to Row 0, 
Row 1: multiply Row 2 by —1 and add to Row 1, 

The result is 


Ixq — Jxi — fx 2 —= 2 Row 0 

(8) f -^2 +1^4+T^5 = —7 Row 1 

|xi — fxj + 1x3 — 1^5 = 2 Row 2 . 

The corresponding basic solution is 

(9) Xo = 2 X 4 = —7 X 3 = 2. 

As the infeasibilities are removed, the objective function progresses toward the 
optimal value. 
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Iteration 2. Once again apply Criterion I and find that should leave the 
basis. Figure 5.2 exhibits the calculations for Criterion //. Observe that is to enter 
the basis in place of x^. Carrying out the usual change-of-basis calculations trans¬ 
forms (8) into 

1^0 — -■ 1^4 — = 9 Row 0 

(10) ^Xi + Ixz — 2x^ — -Jats = 14 ^ Row 1 

+ 1^3 — Ixi — ^Xi = 9 Row 2 . 

The associated basic solution is 

(11) ATo = 9 ;c2 = 14 Xi = 9. 

At this iteration all the basic variables 
are nonnegative, and consequently the 
calculations terminate with (11) as the 
optimal solution to the primal problem (1) 
and (2). The optimal values for the dual 
variables are minus the coefficients of the 
slack variables in Row 0 of the final 
iteration of a minimizing problem. Con¬ 
sequently, [yi = —(—1) = 1] and [yz = 
— (—^) = are optimal in (3) and (4). 

Maximization. The algorithm has 
been explained for minimization problems 
and is e 2 isily altered for meiximization problems. At each iteration, all the coef¬ 
ficients of the nonbasic variables in Row 0 are nonnegative. There is no change in 
Dual Simplex Criterion /, but you now apply the following. 

Dual Simplex Criterion II (Maximization), (a) Take the ratios of the current 
coefficients in Row 0 of the nonbasic variables to the coefficients in the row of the 
currently leaving x* (ignore ratios with zero or positive numbers in the denominator). 
'* (b) Select the maximum ratio, which occurs for a variable Xj. 

Computational characteristics. We do not discuss the properties 
of the dual simplex algorithm as we did for the simplex method in Chap. 4. Most 
of this would be simply repetitious. For completeness, however, we do mention 
how the algorithm indicates when the primal problem has no feasible solution. 
I ; In applying Dual Simplex Criterion //, if the coefficients in the row of the presently 
j,/leaving x,^ are all nonnegative, then the primal problem is infeasible. (In this 
instance, the dual problem has an unbounded optimal solution.) 

We also call attention to one feature that can be a drawback to the dual simplex 
approach. If the iterations are stopped short of convergence, the current basic 
solution for the primal problem is not feasible. 

Many advanced computer programs for linear optimization models include 
the dual simplex algorithm. In these codes, the computer automatically examines 


Nonbasic Variables 

X, X 2 X 5 

Current Row 0*r 

7 2 1 

4 4 4 

Coeffs. in Row 1 = 

5 2 1 

4 4 4 

Ratios 

i 1 - 

Minimum 

1 


Figure 5.2. Iteration 2: Criterion II for ^4 
Leaving Basis. 
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the constraints and objective function and decides whether to apply the dual 
method. 

Comparison of primal and dual algorithms. We list the con¬ 
nections between the two methods we have examined for solving linear program¬ 
ming problems. 

(i) Both methods progress from one basic solution to a neighboring one. At 
each iteration, there is only one change in the composition of the set of 
basic variables. 

(ii) The value of the objective function at any iteration is at least as close to 
optimum as it was at the previous iteration. 

(iii) For both methods, Criterion / seeks to reduce an infeasibility. In the simplex 
method, the rule eliminates a dual infeasibility. In the dual simplex 
method, the rule removes a primal infeasibility. 

(iv) For both methods. Criterion II seeks to maintain feasibility. In the simplex 
method, the basic variables remain primal-feasible. In the dual simplex 
method, the coefficients in Row 0 stay dual-feasible. 

These properties can be restated succinctly as follows. 

(1) Applying the simplex method to a linear programming model is identical 
to the process of applying the dual simplex method to the dual problem. 

(2) Applying the dual simplex method to a linear programming model is 
identical to the process of applying the simplex method to the dual problem. 

*5.9 ADDITIONAL AND TIGHTER CONSTRAINTS 

In real applications of linear programming, it is not unusual to want to add new 
restrictions after you have obtained a preliminary solution. At times you may 
find it necessary to impose additional constraints once you have examined the full 
implications of a trial optimal solution. On other occasions you may purposely 
omit some constraints from the initial formulation in order to lighten the computa¬ 
tional burden. Empirical evidence suggests that the simplex method computations 
increase roughly as a cubic function of the number of constraints. Hence you actually 
are well advised to be conservative in deciding on the number of constraints to 
start with. Frequently, you have a priori knowledge that certain restrictions are 
not likely to be binding in an optimal solution. These conditions, which sometimes 
are referred to as secondary constraints, are added later only if required. 

Testing whether a new constraint is satisfied by a current solution is the simplest 
case of postoptimality analysis. Merely insert the current optimal values for the 
basic variables and check whether the constraint is met. If not, the dual simplex 
method can be applied to restore feasibility. 

To illustrate, consider the Product-Mix Selection Example, and suppose the 
following constraint is added to formulation (I): 

(1) 14xi 7xi + 1x4 + 1^8 = 150 Row 4 , 
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where > 0 is the slack variable for this relation. Given the current basic optimal 
solution, you see that 

(2) 14(^) + 7(^) + 1(0) + 1x8 = 150, 
or 

(3) X 8 = -5. 

The negative value for Xs signals that the current solution does not satisfy the 
new constraint. Therefore, you proceed as follows. 

Add xs to the basis set of the enlarged problem. By the usual pivot operations, 
eliminate the basic variables Xi and X 3 from (1) by using Rows 1 and 3 of formula¬ 
tion (F). Specifically, multiply Row 1 by (—14) and Row 3 by (—7) and add the 
results to ( 1 ) to yield 

(4) “12x2 — 1 x 4 — 17x5 + 1^7 + 1^8 = —5. 

Now apply the Dual Simplex Criterion II (Maximization). You find that X 2 
enters the basis in place of Xs. The resultant ordinary change-of-basis calculation 
produces a feasible, and therefore optimal, primal solution: 

_8325 _ 575 

^ ' _ 3930 _ 650 _ 35 

*’“'84 

Note that in many problems you will need more than one iteration of the dual 
simplex algorithm to restore primal feasibility. By using the above approach, you 
can simultaneously add several new constraints. 

Right-hand sides again. In the analysis of right-hand-side sensitivity 
in Secs. 5.3 and 5.6, we stopped short of stating what to do if a right-hand-side 
constant were altered so as to make the current optimal basic solution no longer 
feasible. In that case, a basic variable has turned negative and you would initiate 
the dual simplex algorithm, starting with the previous final iteration. 

For example, you saw in the Product-Mix Selection Example that if the right- 
hand side of Row 1 is (15 + 18), then the optimal basis remains feasible only 
for — 4 ^ ^ S ^ according to (2) in Sec. 5.3. Further, you learned that if 8 
lies below this range, then Xj turns negative. To restore feasibility, you would 
remove Xi from the basis, and apply Dual Simplex Criterion II (Maximization) to 
determine the variable to enter. You would use the same procedure if 8 were above 
the permissible range. 

*5.10 UPPER-BOUNDED VARIABLES 

In many important linear optimization models, each variable is bounded above: 
( 1 ) xj < Uj for j = 1 , 2 ,..., n. 

Since the scaling ofx^ can be chosen freely, no loss in generality arises by assuming 

Uj = 1. 
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Of course, (1) can be included with the rest of the inequalities in solving such 
a model. But the simple form of (1) leads to a solution technique that avoids 
enlarging the model by the n constraints in (1). The method requires only a 
minor modification in the rules of the simplex method. 

The relation (1) can be viewed as 

(2) xj + I with Xj > 0, 

where now has been set equal to 1. As the iterations progress, it may be neces¬ 
sary to remove Xj by substitution using 

(3) Xj = I — Vj. 

This substitution causes Xj to appear explicitly in the model. At later iterations, 
it may be necessary to remove Xj by using 

(4) x'j = I — xj. 

As a result, Xj then reappears explicitly. 

We will state that an upper-bound substitutionoccurs whenever (3) 
or (4) is utilized. 

The algorithm consists of the Simplex Criterion I (Maximization), stated without 
change, as well as Simplex Criterion U modified as follows. 

Simplex Criterion II (Bounded Variables), (a) Take only the ratios of the 
current right-hand side to the positive coefficients of the entering variable. Let be the val¬ 
ue of the minimum ratio, which occurs for the basic variable in Row k. (b) Take only the 
ratios of (current right-hand side — 1) to the negative coefficients of the entering variable. 
Let r 2 be the value of the minimum ratio, which occurs for the basic variable in Row k . 
(c) Let r = minimum (fj, r 2 ). (d) If r ^ 1, do not perform any change-of-basis calcula¬ 
tion. Make the upper-boimd substitution for the entering variable, (e) Otherwise, if 
r= fi, perform the standard change-of-basis calculation, pivoting in Row k. (f) Other¬ 
wise, if r = r 2 , perform the standard change-of-basis calculation, pivodng in Row k^ 
Then make the upper-bound substitution for the variable previously basic in Row k'. 

When the algorithm terminates, the model will contain explicitly either x^ or 
Xy If x^ appears as a nonbasic variable, its optimal value is 0. If is a basic vari¬ 
able, then it is presently at its optimal value. If xJ appears as a nonbasic variable, 
then the optimal value of Xj is 1. Ifx^ is a basic variable, then the optimal value of 
Xj is 1 — xJ. An example is given below. 

►With an equally simple alteration, the dual simplex algorithm can be made to accom¬ 
modate upper-bounded variables. Criteria / and //are unchanged, but the upper-botmd 
substitution is employed. State how you think the algorithm would work. As a starting 
point, take a solution that is optimal without the upper bounds (assume the optimum 
value is finite), but infeasible with respect to these bounds. ^ 
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Example « Consider this problem 

(5) maximize 4xi + 10^2 + + IIX4 

subject to 

Iat, + l«s + 1 a:j+ 

(6) 3a:, + 5a:s + IOa:, + ^ 

A O 

0 ^ ^ 1 for j = 1, 2, 3, 4. 

Let Xs and x^ be the slack variables for the first and second inequalities. Note 
that these variables are not bounded by 1, but this will not cause any difficulties 
in the calculations. 

The initial basic solution consists of the slack variables, and is shown as itera¬ 
tion 1 in the condensed tableau in Fig. 5.3 (this form was presented in Sec. 4.9). 
According to Simplex Criterion I (Maximization), variable x^ enters the solution. 
Since there are no negative coefficients for X4, 

(7) r = r, = minimum(|-,^)=^>I. 

Therefore Simplex Criterion II (Bounded Variables) indicates that an upper-bound 
. substitution is to be made for ^4, yielding the result labeled as iteration 2. Observe 
that the tableau contains the variable x[ signifying the substitution, and the current 
values of the basic variables have been changed accordingly. 

Next X 2 enters the solution. Once again, there are no negative coefficients and so 

,ox . . /3 17 \ 17 

(8) r = r, = mimmum ^j = 

thus X 2 enters and x^ leaves the basis. After the usual change-of-basis pivot step, 
the coefficients are shown as iteration 3. 

At this point, xj is to be introduced, and 

_ 1 21 

(9) r, — 2g.Q Ti — _3 — 80' 

Hence xj enters the basis and X 2 leaves at its upper bound. Therefore, apply the 
change-of-basis pivot step, and then the upper-bound substitution for X 2 ^ The 
result appears as iteration 4. 

Verify that now Xi enters the basis and x^ leaves, since r = rj < 1, giving the 
values in iteration 5. Check that, as a result, x^ enters the basis and x[ leaves at its 
upper boimd, requiring another upper-bound substitution. The final tableau 
is shown as iteration 6. The optimal values for Xx and x^ appear in the “Current 
Values” column; since xj is nonbasic, the optimal value of X2 = 1. 

In reviewing the iterations, be sure to observe how first x^ enters the solution at 
its upper bound, necessitating an upper-bound substitution; then later on, x\ 
enters the basis; and ultimately, xj leaves the basis at its upper bound, once again 
necessitating an upper-bound substitution and causing Xx to appear in the final 
tableau. 
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Figure 5.3. Condensed 
Simplex Tableau for the 
Bounded-Variables Exam¬ 
ple. 
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*0 

105 

8 

-2 

2 

II 

-19 

3 

^5 

103 

80 

2 

5 

1 

5 

-1 

2 


Xj 

17 
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REVIEW EXERCISES 

The problems in exercises 1 through 7 refer to the Product-Mix Selection Exam¬ 
ple in Sec. 4.4, with the initial system of equations being given by (I) and the 
final system by (F), as shown at the beginning of Sec. 5.1. Think carefully about 
each problem, because in many of the exercises you do not need to repeat all the 
computations of the simplex algorithm to obtain the answer. 

1 (a) Assume the coefficient of X2 in the original objective function is 4 °, instead of 5 . 

Calculate the coefficients of Xi in Row 0 at each iteration of the simplex method, 
and verify that at the final iteration the coefficient is ^(= y — |). 

(b) Answer part (a) for a coefficient of 5 |, instead of 5 . 

(c) Assume the coefficient of x^ in the original objective function is 10 , instead of 11 . 
Calculate the coefficients of x^ in Row 0 at each iteration of the simplex method, 
and verify that at the final iteration the coefficient is y(= ^ — 1). 

(d) Answer part (c) for a coefficient of 12 , instead of 11 . 

(e) Assume the coefficient of x^ in the original objective function is (9 + 8). Over 
what interval for 8 does the basic solution in (F) remain optimal ? What variable 
do you enter into the solution when 8 lies below the interval ? When 8 lies above 
this interval ? 

(f) Assume the slack variable x^ has a coefficient 8 in the original objective function. 
Answer the questions posed in part (e). 

2 Is the basic solution in (F) still optimal if the original objective function is to maximize 

(a) 6xi + 5x2 + 9x3 + 12x4? 

(b) 3 Jxi + 5^X3 + 9x3 + 12^X4? 

(c) 3^X1 + 5x2 + 9x3 + 12x4? 

(d) 6ixi + 5 ?X 2 +9x3+ 11x4? 

(e) 4 xi + 5?X2 + 8x3 + 12^X4? 

(f) 4 xi + 5 x 2 + 7^X3 +12x4? 

(g) 4 xi + 5 x 2 + 13^X3 + 12x4? 

(h) 4 xi + 5 ?X 2 + 13 x 3 + 12x4? 

(i) In each case above, if the basic solution in (F) is not optimal, state which variable 
you enter into the next basis. 

3 Is the basic solution in (F) still optimal if the original objective function is to maximize 

(a) 7 xi + 8x2 + 14 xj + 18x4? 

(b) 8x1 + 9 x 2 + 13 xs + 21x4? 

(c) llxi + IGxj- + 32x3 + 46 x 4 ? 

(d) 12xi + 17 x 2 + 33x3 + 45 x 4 ? 

(c) 33xi + 50x2 + 103x3 + 163x4? 

(f) In eacli case above, if the basic solution in (F) is not optimal, state which variable 
you enter into the next basis. 

♦4 Assume the coefficient of Xi in the original objective function is (4 + 8,) and, similarly, 
the coefficient ofxj is (9 + 83). 
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(a) Using a manner analogous to the derivation of (5) and the resultant inequalities 
in ( 6 ), in Sec. 5.2, derive a set of inequalities for the pair ( 8 i, 85 ) such that the 
basic solution in (F) remains optimal provided the pair satisfies all these inequali¬ 
ties. 

(b) Draw a graph of the region implied by the set of inequalities in part (a). 

(c) Assume the coefRcient ofxi in the original objective function is Ci, and similarly 
the coefficient of is c^. Analogous to part (b), draw a graph of the region for 
(tj, rj) such that the basic solution in (F) remains optimal provided this pair of 
coefficients lies within the displayed region. 

5 (a) Assume that, the right-hand-side constant in Row 2 of (I) is 100, instead of 120. 

Calculate the right-hand-side constants in all the rows at each iteration of the 
simplex method, and verify that at the final iteration the constant in Row 2 is 

185 3|5 _ 20 ). 

(b) Answer part (a) for a constant (120 — instead of 120. 

(c) Assume that the right-hand-side constant in Row 1 of (I) is 11, instead of 15. 
Calculate the right-hand-side constants in all the rows at each iteration of the 
simplex method, and verify that the basic solution in (F) is still optimal. 

(d) Answer part (c) for a constant of 20, instead of 15. 

(e) Assume that the right-hand-side constant in Row 1 of (I) is (15 -f S). Verify the 
assertion in (2) of Sec. 5.3 that the basic solution in (F) remains feasible provided 
8 satisfies —< 8 ^ 

(f) Assume that the right-hand-side constant in Row 3 of (I) is (100 8 ). Over 

what interval for 8 does the basic solution in (F) remain feasible? 

6 Does the basic solution in (F) remain feasible if the right-hand-side constants in Rows 

1, 2, and 3 are, respectively, 

(a) 20, 110, and 130? 

(b) 25, 75, and 215? 

(c) 15, 30, and 185? 

(d) 18, 150, and 40? 

(e) 22, 150, and 80? 

(f) In each case above, indicate the optimal values of each variable and the objec¬ 
tive-function value if the basic solution in (F) remains feasible. Otherwise, indicate 
the infeasibility. 

♦7 Assume that the right-hand-side constant in Row 1 is (15 -}- 81 ) and in Row 3 is 

(100 -f- 83 ). 

(a) Derive a set of inequalities for the pair ( 81 , 83 ) such that the basic solution in (F) 
remains feasible provided the pair satisfies all these inequalities. 

(b) Draw a graph of the region implied by the set of inequalities in part (a). 

(c) Assume the right-hand-side constant in Row 1 is 61 , and, similarly, that the 
constant in Row 3 is b^. Analogous to part (b), draw a graph of the region for 
(^ 1 , ^ 3 ) such that the basic solution in (F) remains feasible provided this pair of 
constants lies within the displayed region. 

(d) How do your answers to the preceding parts change if the right-hand-side constant 
in Row 2 is 121 ? Is 119? 
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The problems in exercises 8 through 11 below refer to the Product-Mix Selec¬ 
tion Example in Sec. 4.4, which is repeated at the beginning of Sec. 5.5. 

8 In parts (a)-(h), modify the example as indicated and write the dual problem. 

(a) The primal objective function to be maximized is 6 a:| + 8^:3 -f- Vxj + 12 x 4 . 

(b) The right-hand-side constants in Rows 1,2, and 3 arc 25,80, and 150, respectively. 

(c) The coefficients of in Rows 1, 2, and 3 are 1, 4, and 9, respectively. 

(d) The coefficients of a :2 in Rows 1, 2, and 3 are 1, 6 , and — 8 , respectively, and the 
objective-function coefficient is —3, instead of 5. 

(e) There is an additional variable z having coefficients in Rows 1, 2, and 3 of 1, 1, 
and 18, respectively, and an objective-function coefficient of 13. 

(f) There is an additional constraint 4xi + 1x2 ^ 50. 

(g) All the modifications in the above parts are in effect. 

(h) Examine the dual problem as given by (1) and (2) in Sec. 5.5. Specify two 
feasible solutions and the corresponding values of the objective function. 

♦9 Review the iterations of the simplex algorithm, as given in Sec. 4.4. At each iteration, 

(a) What are the trial values for the dual variables? 

(b) What are the associated infeasibilitics in the dual problem, and the corresponding 
value of the dual objective function ? 

♦10 Find optimal values for the dual variables, assuming that the basic solution in (F) of 

Sec. 5.1 is optimal and the objective-function coefficients of Xi and a *3 arc, respectively, 

(a) 7 and 14. (d) 12 and 33. 

(b) 8 and 15. (c) 33 and 103. ^ 

(c) 11 and 32. 

11 (a) Assume the coefficient of ^4 in the objective function is (9 -f- 6 ). ^Use the dual 
constraint associated with A 4 to derive the largest value for 3 that leaves the basic 
solution in (F) optimal. 

(b) Find the optimal value of the primal objective function if the right-hand-side 
constants in Rows 1, 2, and 3 are, respectively, 

16, 120, and 100 15, 120, and 101 16, 120, and 101 14, 121, and 101; 

14, 120, and 100 15, 120, and 99 14, 120, and 99 16, 119, and 99. 

♦(c) Assume the right-hand-side constant in Row 1 is (15 4- S)* Will the objective 
function increase by more or less than -^^8 for 8 > For S < —fS? Give 
an economic justification of your answers using the context of the Product-Mix 
Selection Example. 

(d) Suppose the coefficient of Xj in Row 3 is (5 4 - 8 ). How small can 8 be such that 
the basic solution in (F) remains optimal? 

(e) Suppose the coefficient of X 2 in Row 1 is (1 -f 8 ). How small can 8 be such that 
the basic solution in (F) remains optimal ? 

(f) Suppose you add a variable z having coefficients in Rows 1, 2, and 3 of 1, 1, and 
18, respectively. What is the smallest coefficient for z in the objective function 
such that the final basic solution in (F) remains optimal? 

(g) Answer part (f) assuming the coefficient of z in Row 3 is 16. 

♦(h) What arc the coefficients for z in Rows 1, 2, and 3 of (F), given the coefficients for 
z stated in part (f) ? Answer the same question using the data in part (g). 
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*(i) Suppose the coefficient of Xi in Row 3 is (3 + 8 ). Discuss how you would find an 
interval for B such that the basic solution in (F) remains optimal. 

*12 This problem refers to the example given in (1) and (2) of Sec. 5.8. 

(a) What variable do you remove from the basis at iteration 1 if the right-hand-side 
constant in the second constraint of (2) is 10, instead of 8 ? If the constant in the 
first constraint is 10, instead of 5? If the constant in the first constraint is 15, 
instead of 5, and in the second constraint is 10, instead of 8 ? 

(b) Assume that you remove from the basis at iteration 1 . Determine which 
variable you enter into the basis, and what its new value will be if the objective- 
function coefficients of Xi, Xj, and x^ are, respectively, 

1, 0, and 2 8 , 8 , and 8 4, 1, and 3 4, and i, 

instead of 2 , 0 , and 1 . 

(c) Answer part (b) where the coefficient of Xj in the second constraint of (2) is 2, 
instead of — 2 . 

(d) Answer part (b) where the coefficient of Xj in the second constraint of (2) is 2, 
instead of 4. 

(e) What occurs when you remove X 4 from the basis at iteration 1, instead ofx 5 ? 

*13 Determine whether the basic solution in (F) for the Product-Mix Selection Example 

remains feasible when the right-hand-side constant in the constraint (1) of Sec. 5.9 is 

(a) 140. (c) 170. 

(b) 160. (d) For each of the above parts, find an optimal solution. 


*14 Suppose the right-hand-side constant in Row 1 of (I) for the Product-Mix Selection 
Example is 5, instead of 15. Find an optimal solution. Use the dual simplex algorithm 
and start with the equations (F), appropriately revised to reflect the change in the 
constant. 


15 Explain your understanding of the following terms; 


postoptimality analysis 
sensitivity analysis 

* ranging analysis 
*multiple objective functions 

* multiple right-hand sides 

* parametric programming 
duality 


primal and duarprograms 
shadow prices 
*complementary slackness 

* dual-feasible 
*secondary constraints 

* upper-bounded variables 
*upper-bound substitution. 


COMPUTATIONAL EXERCISES 

16 Consider the following problem 

maximize — 2 xi — Ixj -j- 3xj — 2 x 4 


subject to 
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1^1 -f 3 x2 — 1^3 f 2^4 < 7 Resource A 
— Ixj 2 x 2 + 4 x 3 <12 Resource B 

— Ixj — 4 x 2 + 3 x 3 + 8 x 4 ^ 10 Resource C 
every xj ^ 0 . 

If you add X 5 , Xg, and Xj as slack variables, you have at the final iteration of the 

simplex method 

•^0 + + J^X 4 + jxj + Jxo =11 Row 0 

+ 1^’2 + + 1^5 + Tff^O = 4 Row 1 

— TT^l + 1-^3 + 1-^4 + ~ 3 _RoW^ 

yXi + 10 x 4 + 1 x 5 — ^Xq + 1 x 7 = 11 Row 3 . 

(a) State optimal values for each Xj and the objective function. Is the optimal solution 
unique? 

(b) For each nonbasic variable, give an interval for its objective-function coefficient 
such that the basic solution in part (a) remains optimal. 

(c) Answer part (b) for each basic variable. 

♦(d) Assume the coefficient of X 2 in the objective function is ( — 1 + 82 ), and similarly, 
the coefficient of X 3 is (3 + S 3 ). Derive a set of inequalities for the pair (S 2 > ^z) 
such that the basic solution in part (a) remains optimal provided the pair satisfies 
all these inequalities. Draw a graph of the region implied by this set of inequalities. 

(e) Give an interval for each right-hand-side constant such that the basic solution in 
part (a) remains optimal. 

♦(f) Assume that the right-hand-side constant for Resource A constraint is (7 + 81 ) 
and for Resource B constraint is (12 + 82 ). Derive a set of inequalities for the 
pair ( 81 , 82 ) such that the solution in part (a) remains feasible provided the pair 
satisfies all these inequalities. Draw a graph of the region implied by this set of 
inequalities. 

(g) Write the dual problem. Indicate optimal values for the dual variables and 
calculate the associated value of the dual objective function. 

(h) Write an economic interpretation of each dual variable and illustrate your inter¬ 
pretation with numerical examples. 

(i) For each nonbasic variable, use the associated dual restriction to derive an interval 
for its objective-function coefficient such that the solution in part (a) remains 
optimal. 

(j) Suppose two new variables Xg and x^ are added to the model. Assume the coeffi¬ 
cients for xs in the constraints for Resources A, B, and C are 5, —3, and 1, respec¬ 
tively, and the objective-function coefficient is 2. Assume that the coefficients for 
Xg in the constraints arc — 2 , 10 , and 12 , and the objective-function coefficient is 
—4. Can the solution in part (a) be improved? If so, show how. If not, indicate 
what happens if Xg is introduced into the basis. 

♦(k) Find the coefficients for Xg and Xg appropriate to the equations for Rows 0, 1, 2, 
and 3 shown above. 

(l) Suppose the coefficient of Xi in the Resource A constraint is (1 + 8 ). How small 
can 8 be such that the solution in part (a) remains optimal? How large? 

(m) Suppose the coefficient of x, in the Resource B constraint is (—1 + 8 ). How 
small can 8 be such that the solution in part (a) remains optimal? How large? 

(n) Answer the questions in parts (1) and (m) for the coefficients of X 4 . 
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17 In each of the parts below, state optimal values for the decision variables and the 

objective function. For each variable, give an interval for its objective-function 
coefficient such that the basic solution at the final iteration of the simplex method 
remains optimal. Give an interval for each right-hand-side constant such that the 
basic solution at the final iteration remains optimal. Write the dual problem. Indicate 
optimal values for the dual variables and calculate the associated value of the dual 

objective function. For each nonbasic variable, use the associated dual restriction to 

derive an interval for its objective-function coefficient such that the final solution 
remains optimal. 

(a) The One-Potato, Two-Potato Problem in Sec. 1.6 and exercise 18 of Chap. 4. 

(b) The problem in exercise 19 of Chap. 4. 

(c) The problem in exercise 20, part (a), of Chap. 4. 

(d) The problem in exercise 20, part (b), of Chap. 4. 

(c) The problem in exercise 21 of Chap. 4. 

18 Suppose at some iteration in solving a linear programming model, you arrive at the 
relations shown in parts (a) and (b). Note that Xq is being maximized and the variables 
^ 4 > and Xq arc the slacks in the constraints for Resources A, B, and C, respectively. 
Find an optimal solution and indicate whether it is unique. State how much an 
additional unit of each resource is worth. Give optimal values for the dual variables. 
Interpret the meaning of the dual variables, and illustrate your interpretation with 
numerical examples. 

(a) 

Xq — 4x2 2x8 =100 Objective function 

— 1x2 + 6x4 + 1x5 4- Sxg = 12 Resource A 

Ixi -}- 5x2 4- 10x4 4- 5x8 = 20 Resource B 

4- 1x2 + 1^3 + 1^4 + 10x8 = 10 Resource C 

(b) 

Xq — 2xi 4- 3x2 — 1 x 5 = 30 

2 xi — 2x2 4- lJ'f4 + 4 x 5 =6 

2xi 4 - Ixj 4 - 4x5 = 12 

4xi 4- 4 x 2 — 0 x 5 4 - X8 = 16 

19 Consider the problem 

maximize — lOxj 4- 24x2 4“ 20 x 3 4- 20 x 4 4- 25x8 

subject to 

— Ixj 4 - 1x2 4- 2 x 3 4- 3x4 + Sxj ^ 19 

— lx, 4 - 4 x 2 + 3 x 3 -f- 2 x 4 4 - 1 x 5 ^ 57 

every Xj ^ 0 . 

(a) Write the dual problem and verify that a feasible solution is = 4 and >2 == 3. 

(b) Use the information in part (a) to derive an optimal solution to both the primal 
and dual problems. 


Objective function 
Resource A 
Resource B 
Resource G 
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♦(c) Suppose the constraint 4 ^2 + ;cs 4- ^4 4- ^5 ^ 20 is imposed on the above 
. problem. Find optimal primal and dual solutions. 


20 Consider the problem 

maximize — Ixj 4- 7x2 “ 5^3 + ^^^4 

subject to 

3xi -f 4x2 + 5 x 3 + 6 x 4 < 24 
— Ixi 4- Ixg — 2 x 3 4- 2 x 4 < 4 
every x^ ^ 0. 

(a) Write the dual problem and verify that a feasible solution is = 1 and = 4. 

(b) Use the information in part (a) to derive an optimal solution to both the primal 
and dual problems. 

♦(c) Suppose the constraint Xi 4 - ^‘2 + 4 - J ^4 4 - <^5 < 8 is imposed on the above 

problem. Find optimal primal and dual solutions. 


21 Each part below refers to an example in Sec. 3.4. Write the dual problem, and draw 
a solution space represention. Indicate a solution to the dual problem on the graph, 
and calculate the corresponding values for the dual variables and the dual objective 
function. 

(a) Relations (1) through (4). 

(b) Relations (5), and (2) through (4). 

(c) Relations ( 6 ) through (9). 

(d) Relations (10) through (13). 

♦22 In parts (a) and (b), assume that Xi and X 2 yield an optimal basic solution. State the 
dual problem and optimal values for the dual variables. Find the largest values for 
^3 and ^4 that allow the assumption that Xj and X 2 comprise an optimal basis. 

(a) Maximize — Ixi 4- ISxo 4- 4- ^ 4 J '^4 subject to 

Ixi 4- 2x2 4- 8 x 3 4 - 4 x 4 < 3 
—3xi 4- 4x2 “ 8 x 3 — 6 x 4 < 1 
every x^ 0. 

(b) Maximize — 13xi 4- 24x2 4- c^x^ 4- ^ 4^1 subject to the same constraints as in 
part (a). 


♦23 Each problem below was solved in the exercises of Chap. 4 by the Big M Method. 
Find an optimal solution using the dual simplex algorithm. 


(a) Exercise 24, part (b). 

(b) Exercise 24, part (e). 

(c) Exercise 24, part (f). 

(d) Exercise 25, part (a). 


(e) Exercise 25, part (b). 

(f) Exercise 25, part (f). 

(g) Exercise 28, where you 
minimize x^ 4 “ 


♦24 For each of the examples below from Chap. 4, use the dual simplex algorithm on the 
dual problem to find an optimal solution to the dual problem. 
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(a) The example in exercise 19, 

(b) The example in exercise 20, part (a). 

(c) The example in exercise 20, part (b). 

(d) In each part above, show how to find an optimal solution to the primal problem 
from the system of equations at the terminating iteration. 

(e) In each part above, compare the primal and dual trial solutions at each iteration 
of the simplex and the dual simplex methods. 

(a) Use the dual simplex algorithm to solve the dual of the Product-Mix Selection 
Example, given by ( 1 ) and (2) in Sec. 5.5. 

^b) Consider the system of equations (I) for the Product-Mix Selection Example, as 
given at the start of Sec. 5.1. Introduce and remove x^ from the basis. Verify 
that the resultant trial values of the dual variables are dual-feasible. Are the basic 
variables in the primal problem feasible? If so, indicate optimal primal and dual 
solutions. If not, apply the dual simplex algorithm to obtain optimal solutions. 

Consider the problem in exercise 16. In each part below, state whether the optimal 
solution for that problem remains feasible when the model is augmented by the spe¬ 
cified constraint. Determine optimal primal and dual solutions. 

(a) ATi + Afj + Xj + ^4 ^ 10. 

(b) -h X 2 + X 3 -f- Ar 4 ^ 6 . 

(c) ATg + ATj ^ 0. 

(d) Xg H- X 3 ^ - 1 . 

(e) Afi + X 2 + X 3 + ^4 ^ 10 - 
Consider the following problem 

T 

minimize 

tTl 

subject to 

>r +J’i 

yt-i+yt'^U for ( = 2,3, ...,T 

everyyt ^ 0. 

An example of this model is the Kleen City Police Department Problem, stated in 
exercise 30 of Chap. 2. 

(a) Display the model in full for T = 5, where fi = 8 , r 2 = 7, = 10, r 4 = 10, and 

r 5 = 2 . 

(b) Calling the above problem the dual, write the associated primal problem. 

(c) Solve the dual by the dual simplex algorithm, and indicate the associated optimal 
primal solution. (Hint: you need no more than five iterations.) 

(d) Solve the primal by the simplex algorithm and indicate the associated dual solu¬ 
tion. (Hint: you need no more than five iterations.) 

(e) Find optimal primal and dual solutions when rj = 9, instead of 8 . 

*28 Solve each problem by the boimded-variables algorithm in Sec. 5.10. 

(a) Maximize 5xi -f- Mx 2 + 24^3 -f- 20^4 subject to 


*25 


*26 


*27 
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Ixi -I- 6x2 + 12^3 + 8x4 ^ 8 
2xi + 2x2 H- 12x3 + 4x4 ^ 4 
0 ^ Xj ^ 1 for j = 1, 2, 3, 4. 

(b) Maximize 9xi + 1x2 — 1^X3 — 5x4 subject to 

—3x1 4- 2x2 4- 9 x 3 4- 1^4 ^ 7 
6x1 4- 16x2 — 12x3 — 2x4 < 10 
0 ^ xj ^ 1 for j = 1, 2, 3, 4. 


MIND-EXPANDING EXERCISES 

♦29 Duality and Complementary Slackness 

(a) Show that the Dual Theorem implies the Theorem of Complementary Slackness. 

(b) Show that the Theorem of Complementary Slackness implies the Dual Theorem. 

♦30 Show how the Dual Theorem for the primal problem (1) through (3) and the dual 
problem (4) through (6) in Sec. 5.4 can be extended to cover the pair of problems (i) 
through (v) and (i') through (v') stated at the end of Sec. 5.4. 

31 Consider the problem 

71 

maximize 2 

subject to 

2 fltjXj = 4- for i = 1, 2,..., m 

^-1 

every x^ ^ 0. 

(a) Suppose you have found an optimal basic solution for a fixed value of 8, say 8. 
Derive upper and lower bounds for 8 such that, for 8 in that interval, the same 
set of basic variables remains optimal. 

(b) Apply your answer in part (a) to the Product-Mix Selection Example where 

8 = 0, cf = —1, c* = 0, c* = 0; and c* = 2; and bf = 1, = —20, and 

^? = 0 . 

32 Sensitivity Testing, Consider the linear programming problem 

71 

‘ maximize ^ c^xj 

subject to 

71 

for I = 1, 2 ,..., m 

^-1 

every Xj ^ 0. 

Let F be the optimal value of the objective function. 
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Suppose that the cj, 0 ^, bi are not known with certainty but are known to lie within 
the intervals 

cj^Ci<cj br^bt-^bt. 

Derive upper and lower bounds for F which make use of only the end points of the 
intervals. 

♦33 Dual Simplex Algorithm ^ 

(a) Suppose all the constraints are inequalities. Derive a method for obtaining a dual- 
feasible solution. 

(b) Suppose some of the constraints are equalities. Derive a method using artificial 
variables for obtaining a dual-feasible solution. 

♦34 Upper-Bounded Variables 

(a) State how you would revise the dual simplex algorithm to accomodate upper- 
bounded variables. 

(b) Apply your method to the example in (5) and (6) in Sec. 5.10. 

(c) Apply your method to part (a) in exercise (28). To part (b). 

35 Two-Person, Zero-Sum Matrix Games, Consider a game between Player I and Player II. 
Player I may select any one of m choices, indexed i = 1, 2,..., m. Player II may 
select any one of n choices, indexed j = 1, 2,..., n. If Player I selects i and Player II 
selects j, then Player I receives the amount Oij, and Player II receives — flfj. (Note 
that Oij may be negative.) 

For example, suppose each player can select the integer 1 or 2. If Players I and II 
both choose 1, then assume an = 2; if Players I and II both choose 2, then assume 
^22 = 4. If Player I chooses 1 and Player II chooses 2, then assume an = —3; and if 
Player I chooses 2 and Player II chooses 1, then assume ^21 = — 3. The payofis Oij to 
Player I can be displayed as the matrix 

Player II 

‘[J "I] 

Suppose that neither player actually makes the selection himself, but transmits 
instructions for making his selection to an impartial referee. Both players know all the 
but neither player knows the instructions of his opponent. 

Assume each player is permitted to instruct the referee to make a choice by means 
of a random selection. Specifically, Player I provides selection probabilities • • • > 
where each ^ 0 and ^^9 likewise. Player II provides selection 

probabilities xi, • • • > where each ^ 0 and Sj-i = 1. Each of these prob¬ 

ability selections is called a mixed strategy, 

(a) Given probabilities xj, let be the maximum expected value of Player I’s payofis 
among all his selections t = 1, 2,..., m. Formulate a linear programming model 
for choosing xj such that is as small as possible. An optimal solution is called 
Player II’s minimax strategy, 

(b) Given probabilities let Vi be the minimum expected value of Player I’s payofis 
among all of Player II’s selections j = 1,2,..., n. Formulate a linear program- 






162 Sensitivity Testing and Duality 


ming model for choosing yi such that Vi is maximized. An optimal solution is 
called Player I*s maximin strategy, 

(c) Apply the Dual Theorem to show that the minimum of equals the maximum 
of Vi, This result is referred to sometimes as Von Neumann^s Fundamental Theorem 
for Two-Person^ Constant-Sum Matrix Games and sometimes as the Minimax Theorem, 
Let the common value be K, which is called the value of the game, [Hint: You can 
assume without loss of generality that all > 0; see part (d).] 

(d) Show that optimal values for Xj and yi remain optimal when the payoffs are 
aij + k for any constant k, and the associated value of the game is K + A:. 

(e) Show why feasible xj and always exist. 

(f) Find optimal probabilities for the numerical example above. 

(g) Verify that the expected value that Player I receives is 

m n 

t=l 

Show that this amount equals V when both players use optimal Xj and found 
in parts (a) and (b). 

(h) Show that both players need not assign positive probabilities to more than the 
smaller of (m, n) of their selection possibilities. 

(i) Show that if, for optimal xj, 

then^i = 0 is optimal. Similarly, if, for optimal 

> V, 

1^1 

then = 0 is optimal. 

(j) Show that if = 1 and Xj = 1 are optimal then aij equals both minimum 
(^ti> flt2> • • • > ^(n) and maximum {a^j flji? • • •» ^m})» The element aij is called 
a saddle point for the payoff matrix. Further, if a^i = a^, then an = a^j = aij, 

(k) Show that if m = n, Ajj = 0 for z = 1, 2,..., m, and a^ = ^a^i (called a skew- 
symmetric matrix game), then V = 0, and^t = X( in an optimal solution. 

(l) Suppose bi) are payoffs to Player II, where bu need not equal — flij, but where 

+ ^iJ — This is known as a constant-sum game. Show how to convert 
a constant-sum game to a zero-sum game having the identical optimal strategies. 

36 Warehouse Problem, The partnership of Byer, Long, Geller, Short trades in the purchase 
and sale of agricultural commodities. It owns a large storage tank with a capacity of 
C gallons, in which it keeps soybean oil. At the start of the present period, designated 
Period 1, the firm owns G C) gallons. The firm wants a profit-maximizing plan 
for selling and purchasing soybean oil over a horizon of T periods. (To keep the 
formulation of the problem simple, assume the firm places no value on soybean oil 
left at the end of Period T.) This model is often referred to as the Warehouse Problem, 
Suppose the firm can forecast for each Period t the selling price pi as well as the 
purchase cost Ci for each gallon of oil. For a variety of economic causes, the selling 
price is sometimes larger and sometimes smaller than the purchase cost. Assume 
that a purchase in Period t is not delivered until the very end of the period (too late 
for a subsequent sale in that period); thus the firm is prohibited from “making a 
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killing” by both purchasing and selling an unlimited amount in a period when the 
selling price exceeds the purchase cost. 

Let wt be the number of gallons sold and Xt the number of gallons purchased in 
Period t, where every ^ 0 and Xt ^ 0. These amounts must satisfy two types of 
constraints. First, there are T capacity constraints that state the gallonage at the very 
end of each period cannot exceed the limit C. Thus, in Period 1 the restriction is that 
G — Wi + Xi ^ C, where G is a known constant. Similarly, in Period t, the amount 
G plus the difference between total purchases and total sales over Periods 1, 2,..., t, 
must not exceed the capacity C. 

Second, there are T sales constraints that state the gallonage sold in Period t cannot 
exceed that which is on hand at the beginning of the period. Thus, in Period 1 the 
restriction is that Wi ^ G. In Period t, the restriction is that Wt cannot be larger than 
G plus the difference between total purchases and total sales over Periods 1, 2,..., 
t - 1. 

(a) Display the model in full for T = 5. 

(b) For each Period t, let be the dual variable for the capacity constraint and Zt 
the dual variable for the sales-quantity constraint. Display the dual problem for 
r= 5. 

(c) Devise an algorithm for determining optimal values of the dual variables in the 

order yrt 2r>^r-b • • • > J'b Given these optimal values, explain how co 

derive an optimal plan of selling and purchasing. 

(d) Apply your method to the case where G = 100, G = 75, and 

Pi = 2 = 3 p,= I p, = 2 p, = 3 

Cl = 2 C2 = I C 3 = 1 ^4 = 4 Cs = 1. 

How would your answer change if C = 200 ? 

(e) Suppose gallonage at the end of Period t is charged an inventory holding cost of 

hi per gallon. (Note that inventory at the end of Period t is G plus the difference 

between total purchases and total sales over Periods 1, 2,.,., t.) Show how to 

modify your answers in parts (a), (b), and (c) to account for storage costs. 

* (f) Characterize the form of an optimal policy for the Warehouse Problem, 

* (g) Show how to modify the formulation when there are T liquidity constraints that 

limit the cash amount of a purchase in each p>eriod to be no larger than the 
accumulated cash over the preceding periods (assume K is the initial amount of 
cash at the start of Period 1). 
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CHAPTER 


Optimization in 
Networks 


6,1 SIGNIFICANCE OF NETWORK MODELS 

Network optimization models—usually special cases of linear programming 
models—have a twofold importance. They often pertain to problems of product 
distribution. Consequently, they are economically significant for many commercial 
enterprises that operate several plants and hold inventory in local warehouses. In 
addition, networks have a mathematical structure identical to that of other opera¬ 
tions research models that seem unrelated at first glance. But these two reasons 
alone do not warrant your singling out network models to study in greater detail. 

The key justification is that the mathematical characteristics of network models 
are so special that by exploiting these structural properties you can obtain major 
efficiencies in finding optimal solutions. In actual industrial applications, network 
models often contain thousands of activities and hundreds of constraints, so that 
using a streamlined algorithm is not only worthwhile but sometimes a practical 
necessity. By investigating networks, you also benefit from seeing how a variety of 
apparently disparate operations research models are amenable to an insightful 
unifying mode of analysis. 

This chapter concentrates on network model formulation. The first several 
examples deal with planning decisions and will illustrate why network terminology 
provides a convenient graphical description of the optimization problems. You 
will easily recognize the connections between the components of the network and 
the elements of the actual problem. In the subsequent examples, the network 
representation is purely formal; for these you will have to rely on making abstract 
identifications to establish the counterparts. 

In Chap. 7, you will focus on algorithms for finding optimal solutions to network 
models. 
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. 6.2 . 


6,2 CLASSICAL TRANSPORTATION PROBLEM 

The transportation (or distribution) problem was an early 
example of linear network optimization and is now a standard application for 
industrial firms having several manufacturing plants, warehouses, sales territories, 
and distribution outlets. The model’s primary usefulness is for planning. In this 
instance, the strategic decisions involve selecting transportation routes so as to 
allocate the production of various plants to several warehouses or terminal 
points. You already saw an exzimple of this sort in the Saur Milk Company Prob¬ 
lem of Sec. 2.7. Some companies find it necessary to review their distribution 
decisions as often as every month, especially if their order mix is subject to con¬ 
siderable variation. But typically a firm draws up such a distribution plan yearly. 

The mathematical description of the classical transportation problem is 


(1) 

subject to 

(2) 

2 ^ *^i 

m n 

minimize J] 2 CxjXxj 

for t = 1, 2,. .,, m 

(supply) 

(3) 

2 ^ 

for J = 1, 2 ,.... n 

(demand) 

(4) 

xi)>0 

for all i and J. 



The network and technology tables for this model are shown in Figs. 6.1 and 6.2. 
(You may find it helpful to review Sec. 2.7, where we first introduced this model.) 

In the standard interpretation of the model, there are m supply points with 
items available to be shipped to the n demand points. Specifically, Plant i can 
ship at most Sx items, and Demand Point j requires at least items. The Sx and 
are fixed in reference to a stated time interval or planning horizon. The cost of 
shipping each unit from Plant i to Demand Point j is Cxj. The objective is to select. 



Figure 6.1. Network for Transportation Problem. 
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(a) Transportation Table. 


Shipping Activities 



^11 ^12 ^13 • • • 

^21 ^22 ^23 • • • ^2n 

• • • 

^/n1 ^/n2 ^m3 ... ^mn 
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1 1 1 1 
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VI 
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m 
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• 
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m 




1 1 1 • • • 1 
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-1 

-1 


-1 

VI 

2 

-1 
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3 

-1 
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-1 

<-Oj 
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• 
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• 

n 

-1 

-1 


-1 

s-0„ 


^II ^12 ^13 • • • ^1/» 

Cgi C 22 ^23 ... 

• • • 

^/n2 ^m3 • • • 

Minimize 


(b) Technology Table for a Transportation Problem. 

Figure 6.2 

for the duration of the horizon^ a routing plan that minimizes total transportation 
costs. 

A key result in network theory is that among all the optimal solutions to the 
model (1) through (4), there is at least one in which each Xfj is integer-valued, 
provided the Si and JDj are all positive integers, which we assume from here on. 
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Therefore, strengthening (4) to 

(5) = 0, 1, 2 ,... 

does not adversely affect the value of (1). What is more, the simplex method will 
find such an optimal solution. These results are demonstrated in Chap. 7. 

If the unit cost of producing an item differs from plant to plant, then this cost 
is included in the determination of If for physical or economic reasons a certain 
plant is inaccessible to a particular demand point, then the associated Xij is elimi¬ 
nated or, if more convenient, the corresponding Cij is defined to be arbitrarily large. 
To simplify the discussion, assume > 0. Then (3) can be rewritten with equal¬ 
ities. 

For the model to possess a feasible solution, it certainly is necessary that total 
supply is at least 2is large as total demand, 2^.1 ‘S't > S”=i There are a number 
of applications in which you would expect to find the total supply in excess of the 
total demand requirements. For example, Si sometimes represents the production 
capacity of Plant i during the planning horizon, rather than an amount of the 
commodity actually manufactured for distribution at the start of the period. In 
analyzing a standard transportation model and devising an optimizing algorithm, 
however, it is convenient to assume that total supply equals total demand: 

( 6 ) = 

1 = 1 j-'l 

Employing a simple-minded formal device permits you to assert (6) without 
any loss of generality: create a fictitious destination with a requirement of 
Si ‘S'l — Sj Then let this fictitious destination be labeled the nth. Let = 0 
so that the interpretation of is “slack capacity at Plant i.” The sum of the capac¬ 
ities now equals the sum of the requirements. Consequently, with this device the 
relations (2) can just as well be written as equalities. Therefore, unless staled other¬ 
wise, hereafter assume (6) holds and (2) and (3) are equalities, so the model is 

m 71 

(7) minimize 2 2 

1=1 >=i 

subject to 

(8) 2 = for I = 1, 2, . . ., m (supply) 

(9) = Dj for j = 1, 2,. .., n (demand) 

(10) Xij = 0, 1, 2,.. . for all i and 7, 

where all the Si and Dj are positive integers satisfying (6). 

Applying the model. Implicit in the mathematical description of the 
model (7) through (10) is the assumption that only a single type of commodity 
is being shipped. Why? Because in meeting the demand requirements, the model does 
not distinguish among the sources of supply. All the supply arriving at Demand 
Point j is commingled insofar as satisfying the demand constraint is concerned. 
We emphasize the single-commodity assumption because it would seem to 
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restrict the application of the model. After all, how many multiplant firms dis¬ 
tribute only one product? Occasionally, a multiplant, multiproduct company 
can establish a separate routing pattern for each of its major commodities; but 
usually there is a decided economic advantage to restricting the number of plants 
that serve a warehouse or sales district. Therefore, most firms determine a complete 
distribution system by taking explitit account of the entire product line. As you can 
imagine, you have to use ingenuity to fit an actual distribution problem into the 
mold of a standard transportation model. It is impossible to catalog here all the 
devices that have proved successful for this purpose in actual practice. One 
approach is. sketched below to illustrate the basic notions. 

Establish the product mix required over the planning horizon at each Demand 
Point j. Designate the total amount Dj of all products in some convenient and mean¬ 
ingful common unit of measurement, such as tons. Analogously, specify Si in the 
same units. To calculate Cij assume that if Plant i ships a ton to Demand Point j, 
the ton is comprised of the precise product mix required at Demand Point j. 

How usable and good will the resultant numerical solution be? To answer 
sensibly, you must keep two factors in mind. First, the essence of the solution is a 
plan for product distribution during a specified time interval. That is, the model 
provides an assignment of plants to demand destination points. T’he numerical 
values of the Xij are inherently approximate, since in most real applications the 
values of Dj are only forecasts of requirements during the planning horizon. 
Therefore, the Xij solution values do not represent amounts actually transported, 
but merely estimate the order of magnitude of future shipments. Second, the 
plan’s relative merit must be judged against whatever practical alternatives the 
firm can devise, including, of course, the current routing. In line with these 
points, many companies have improved their profits significantly by adopting 
a distribution plan based on a standard transportation model solution. 

What’s in a name. We conclude with a wwd about the nomenclature of 
network models. All the network models discussed in this and the following chap¬ 
ters are examples oPlinear programming problems. With the exception of the two 
models at the end of this chapter, by an appropriate definition of symbols, all the 
problems can be shown to be examples of the model formulated above as (7) 
through (10). Therefore, in a formal mathematical sense, these are all illustrations 
of the classical transportation problem, even though they may have nothing to 
do with shipping a commodity. However, the category of transportation problems 
itself can be subdivided to give prominence to certain significant models in which 
additional assumptions are made. To illustrate, when n = m and St = Dj = \ for 
all i and J, the model is termed an assignment problem^ for reasons made clear in Sec. 
6.4. The assignment problem in turn encompasses an important subclass of so-called 
shortest- (or longest-) route models. These are illustrated in Secs. 6.5 and 6.6. To 
keep matters straight as you read, ask yourself what the additional structural 
assumptions are in each transportation model that make it convenient to narrow 
the problem’s designation with a special term. 
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► Several of the apparently restrictive assumptions of the transportation model arc 
easily modified or eliminated by appropriate reformulations. To give you an idea of 
these extensions, a few are summarized. 

Seasonal variation. The transportation model has been described as a method 
of finding a plan for a single time period, such as a span of 12 months. In certain instances, 
a company may want to shift its distribution routing plan at different times of the year 
to reflect seasonal variation in the pattern of demand requirements or supply availabili¬ 
ties. If commodity items available in one month cannot be inventoried to satify the 
requirements of subsequent months, then the firm only need solve 12 separate trans¬ 
portation problems, one for each month. Most companies facing serious seasonal varia¬ 
tions, however, do have the opportunity to hold inventories in anticipation of demand. 
Considerations of optimal seasonal inventory buildups can be accommodated by a 
straightforward reinterpretation of the symbols in the transportation model. 

To illustrate, suppose a firm with two plants and three demand points desires to 
establish an inventory distribution plan for January, February, .. . , December. Assume 
that company policy is to have no seasonal inventory on hand at the beginning of the 
year. Then let Si, for z = 1, 2,..., 12, denote the availability of the product for the 
first plant for the 12 months, and similarly, for i = 13, 14, . . ., 24, refer to 
the second plant. By an analogous convention, let Dj, for J = 1,2,..., 12, refer to the 
first destination’s requirements; for J = 13, 14, . . ., 24, to the second’s; and for 
j = 25, 26,..., 36 to the third’s. Assume Siti •S’t = Sjti (This restriction can be 
weakened, if necessary.) 

Certain must be eliminated to prevent the supply in midyear being used to meet 
the requirements at the start of the year. For example, you must remove ^0.25, which 
would represent shipping items available at the first plant in June to meet the January 
requirements at the third destination. In determining for the remaining Xij, you may 
want to include an inventory holding cost. This figure would reflect expenses directly 
associated with an inventory buildup, such as warehouse rental or interest charged on 
short-term borrowings if bank financing is used. 

Capacity limitations. Occasionally you must add capacity restrictions on 
the Xij. The simplest class of restrictions is 

(•) Xti ^ U(J. 

When (i) is appended to (7) through (10), the model is often referred to as the 
capacitated (or bounded variables) transportation problem. More 
general capacity constraints include bounding sums of the such as for several plants 
and all demand points, or for several demand points and one plant. Frequently, adding 
these capacity constraints entails only a minor increase in the computational effort 
required to find an optimal solution; but, of course, they can be so restrictive that a 
feasible solution is not possible. 

Variable supply. In certain situations. Si may have to be treated as one of the 
variables in the optimization. When this happens, the transportation model is imbedded 
in a larger linear programming problem. With advanced algorithmic techniques, such 
as the decomposition algorithm described in Sec. 15.11, it still is possible to exploit the 
structural properties of the transportation network model in the computations. 
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Another important extension of the classical transportation model is the facility to 

route an item available at Plant i to Demand Point j through an intermediate transship- 

merit point. This consideration is discussed in detail in the next section. ^ 

6.3 TRANSSHIPMENT MODEL 

It is important to extend the classical transportation model to include cases in 
which a location can act as a point of transshipment. For example, transshipment 
often occurs in the distribution system of national department store chains. Such 
companies typically have regional warehouses that ship to smaller district ware¬ 
houses, which in turn ship to the retail stores. In this illustration, the district ware¬ 
houses are the transshipment points. The transshipment model is a-useful tool for 
a company deciding on the optimal number and location of its warehouses, since 
the network analysis yields a minimum-cost routing plan for any specific con¬ 
figuration of warehouses. An additional reason for studying the model is that other 
operations research models which turn out to be mathematically equivalent to 
network problems have the transshipment form. 

The model is illustrated here by means of a commercial example which, although 
hypothetical, is suggested by a real application arising out of the operations of 
military logistics systems. In the preceding section the problem was introduced 
by a mathematical representation and was subsequently characterized by verbal, 
graphical network, and tabular descriptions. Now the process will be reversed in 
order to give you practice in a variety of initial formulations. 

The Rock-Bottom Discount Store Problem, This company has 
eight large major appliance discount stores located in several states. The Sales 
Department has decided to substantially reduce the price of a certain costly item 
in order to close out the stock now on hand. Before launching the advertising 
campaign, management wants to position its current inventory among the eight 
stores according to its sales expectations at each location. To do this, it is necessary 
to redistribute some of the stock. 

Figure 6.3 is a diagrammatic map; the numbered nodes, or points, represent 
the eight stores. A positive value next to a node signifies the amount of inventory 
to be redistributed to the rest of the system. A negative value indicates the addi¬ 
tional amount of stock required for the sale. Thus Stores 1 and 4 have excess 
stock of ten and two items, respectively. Stores 3, 6, and 8 require three, one, 
and eight more items, respectively. The inventory positions of Stores 2, 5, and 7 
are to remain unchanged. 

Observe that an appliance may be shipped through Stores 2, 4, 5, 6, and 7. 
Consequently, these locations are termed transshipment (or intermedi¬ 
ate) points. Each remaining store is termed asourceifit has excess inventory, 
and a sink if it needs stock. Consequently Store 1 is a source, and Stores 3 and 8 
are sinks. 

The value Ci^ > 0 refers to the cost of shipping each of these items along the indi- 
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Gated route. To transport an appliance from Store 1 to Store 6 , the total shipping 
cost is Ci 2 + C 2 S + ^ 50 . The shipping route from Store 1 to Store 6 uses 2 and 5 as 
transshipment points. The model allows for a shipment between 4 and 5 in either 
direction. Assume in this situation that ^ for the following reason. At the 
time the shipment must be made the company has space available in its own truck 
only on the route from 4 to 5. It has to incur a higher cost in the reverse direction 
because it must hire a common carrier. After carefully inspecting the figure, you 
will realize that you cannot rule out either one of the routes between 4 and 5 
until the entire problem has been analyzed. In this example, the company’s 
objective is to redistribute its inventory at the lowest total transportation cost. 

Equivalence to a transportation problem. Before translating 
the preceding paragraphs into formal mathematical terms, we describe two ways 
to convert the transshipment problem to a standard transportation problem. 
Such conversions are of practical significance because they permit you to use 
readily available computer programs written to solve standard transportation 
problems. You need only supply the computer the data for the transshipment 
model in one of the formats below. Another reason for interest in the equivalence 
is that it ensures there is no difficulty in finding an optimal solution with integer- 
valued variables. 

In the first approach, you start by finding the least-cost ways to send one item 
each from Store 1, which has an excess supply, to Stores 3, 6 , and 8 , which require 
more items. Denote the associated minimum costs by the symbols Cia, Cio, and Ci^, 
Turning to Store 4, which has an excess supply, you continue similarly by finding 
Cizy Cie, and C 4 s» Then you can proceed by using a standard transportation problem 
table, as illustrated by the trial solution in Fig. 6.4. Note that in this condensed 
table there is one row for each store with an excess supply, and one column for 
each store with a demand requirement. The transshipment points are not shown 
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Point 


Point 

1 


^43 


^46 


^48 


Supply: 

10 


explicitly. But if, say, a least-cost way to 
ship a unit from Store 4 to Store 8 is via 
Store 7, then the trial level of = 2 
indicates that six units are transshipped 
from Store 4 through Store 7 destined 
for Store 8. 

The alternative approach, presented 
next, often has certain advantages over 
the first method in actual large-scale 
applications. Specifically, the alternative 
does not entail what might be a consid¬ 
erable computational effort to calculate 
all the least-cost routes required by the 
first approach. In addition, it may have 
fewer variables in the resultant standard transportation problem. And, finally, 
it can more easily handle capacity constraints on each route. 


Demand: 


Note: Cjj is the least cost of shipping 
a unit from Point / to Point y. 


Figure 

Tabic. 


6.4. Condensed Transshipment 



Figure 6.5. Expanded Transshipment Table. 


Supply: 

10 

12 

14 

12 

11 

12 


To see the idea of the second alternative, study Fig. 6.5, which is constructed 
from the network in Fig. 6.3. The rules to be followed in making up such a table 
are 

(i) Designate a row for each source. Its Si value is the stock supplied. 

(ii) Designate a coliunn for each sink. Its value is the amount of stock 
demanded. 
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(iii) Designate a row and a column for each transshipment point. Let be the 
point’s net stock position. If stock is supplied, Tj^ is a positive number. 
If stock is demanded, 7\. is a negative number. Then for Point k, let 

Tlfc + and = B, where B is the sum of the stock available at all 
points. 

(iv) Permit Xtj, for i 9 ^ 7 , only for arcs existing in the original network. For a 
transshipment Point k, also permit Xjj^ with Cj^c == 0. 

Thus Rule (i) pertains to Store 1 and Rule (ii) to Stores 3 and 8. With 5=12, 
Rule (iii) states, for example, that 2)4 = 12 and 54 = 2 + 12 = 14;Z)5= 12 and 
^5 = 0 + 12 = 12 ; and = 12 and ^0 = -1 + 12 = 11 . 

To achieve the second conversion to a standard transportation model, B has been 
introduced in the role of a fictitious buffer stock at each transshipment 
point. If k is such a point, B has been included in both and Df^, Therefore the 
sum of all the Si remains equal to the sum of all the The value of must be 
sufficiently large to accommodate any transshipment amount that is optimal. 
One simple way to achieve this condition is to let B be the sum of the stock avail¬ 
able at all points. The value of 5 — x^k represents the total amount of stock trans¬ 
shipped through Point k. 

A feasible routing forthe problem is shown in Fig. 6 . 6 . Study the table carefully. 



Supply: 

10 

12 

14 

12 

11 

12 


Begin with the first row of the table, representing the supply available at Source 
Point 1. Note that the 10 units available are shipped to Point 2. Next look at the 
row and column for Point 2. Observe that S 2 = D 2 = B = 12, because Point 
2 is a transshipment point without any supply or demand. Since the solution 
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already has Xi 2 = 10, you must have X 22 = 2 in order that the'demand require¬ 
ment £>2 = 12 be satisfied. This in turn means that the other 10 units of iSi can 
be shipped out, and are shown going to Point 3, which is a sink, and Point 5, 
which is a transshipment point. In other words, the ten units from Point 1 are 
transshipped through Point 2 to Points 3 and 5. 

Skipping down to the fifth row, you can see that the 7 units entering are then 
shipped out to Points 4 and 6. In particular, 1 unit enters Point 6 from Point 5. 

Now look at the row and column for Point 6. Observe that iS^ = 11 whereas 
Dq = 12, in keeping with the requirement for one unit at Point 6. Since x^^ = 1 
and Dq = 12, you must have ;reo = i 1 i so the unit entering Point 6 is not shipped 
out. 

Complete your investigation, and verify that Store 1 ships three units to Store 
3 via Point 2, one unit to Store 6 via Points 2 and 5, and six units to Store 8 via 
Points 2, 5, 4, and 7. Store 4 ships its two units to Store 8 via Point 7. 

Mathematical model. To complete your understanding of the trans¬ 
shipment model, you should now turn to a precise mathematical description 
of the problem. This formal characterization is conveniently shown by the tech¬ 
nology in Fig. 6.7. Observe there is one equation for each store, or node, of the 


Shipping Activities 



X,2 

^23 ^25 

^43 ^45 ^47 

^54 ^56 

^67 

^78 


1 

1 






= IO 

2 

-1 

1 1 





= 0 

3 


-1 

-1 




= -3 




1 1 1 

-1 



= 2 



-1 

-1 

1 1 



= 0 

6 




-1 

1 


= -1 

7 



-1 


-1 

1 

= 0 

8 






-1 

=-8 


C,2 

^23 ^25 

^43 ^45 ^47 

^54 ^56 

^67 

^78 

Minimize 


Figure 6.7. Transshipment Example. 


network. There is also one variable x^j for each arc to represent the amount to 
be shipped from Point i to Point j. A positive value on the right of the equality 
sign designates stock available for redistribution. A negative value indicates a 
requirement at that store. In summary, the linear programming technology in 
Fig. 6.7 is an immediate translation of the network diagram in Fig. 6.3. 

Observe that whereas there are only 8 equations and 10 variables in the tech¬ 
nology table of Fig. 6.7, there are 13 row and column restrictions and 15 variables 
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shown in the expanded transshipment table of Fig. 6.5. The increase of five in each 
of these dimensions of the problem is due to the presence of the five transshipment 
points. But this increase in size has a negligible computational impact because 
the transshipment variables are always in a solution. 

► It is not immediately evident that the optimization problem in the technology table of 
Fig. 6.7 is equivalent to that in the transportation table of Fig. 6.5. The correspondence 
can be argued as follows. By a quick inspection you can sec in Fig. 6.5 that the row 
for Store 1 (a source) and the columns for Stores 3 and 8 (each a sink) agree with the 
constraints for these stores in Fig. 6.7. Thus the only analytic difficulties are for the 
rows and columns of the transshipment points. 

Consider one of these points, say 4. In Fig. 6.5 the row and column restrictions of the 
transportation table are 

(i) X 43 + X 44 + ^45 + -^47 =2 + 5 (supply) 

(ii) ;c 44 H a :54 = 5 (demand). 

Subtracting (ii) from (i) yields 

(hi) ^^43 + X 45 + X 47 — X 54 = 2 , 

which is the restriction for Point 4 in Fig. 6.7. The same manipulation applies to the 
other transshipment points. Therefore, any feasible solution to the system in Fig. 6.5 is 
also feasible for the system in Fig. 6.7. 

Provided B is sufficiently large, any solution to the system given by the technology 
table in Fig. 6.7 is also feasible in Fig. 6.5. To test this assertion, consider Point 4 again. 
The assumption is that (iii) is satisfied. Leave the values of Xij unchanged and let 
X 44 = 5 — X 54 , so that both (i) and (ii) hold. Then > 0 if 5 is large enough. 

Finally, to demonstrate equivalence, note that since all = 0, any feasible solution 
for one formulation is also attainable in the other system at the same value of the objec¬ 
tive function. ^ 

6.4 ASSIGNMENT MODEL 

The assignment problem can be posed succinctly as follows: each of n tasks can 
be performed by any one of n agents. The cost of Task i being accomplished by 
Agent j is Assign one agent to each task to minimize the total cost. 

Why should this problem interest you? As usual, one reason is that the model 
can be applied—admittedly not very often in this case—to consequential business 
situations. An example is provided in the next paragraph. A more compelling 
reason is that this problem provides a vital link between linear programming 
problems and so-called combinatorial problems. The latter include those situations 
in which the unknown activity levels must be integer-valued. Such problems are 
studied in more detail in Chaps. 8, 10, and 13. 

The Blip-Bleep Company Problem. This manufacturer of complex 
electronic equipment has an order for several thousand units of a newly designed 
product, which is assembled from component modules. The company’s manage¬ 
ment has decided to subcontract for n of the components, and has selected n 
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outside subcontractors who have demonstrated high-quality performance capa¬ 
bility on previous occasions. Each job is sizable enough to prevent a subcontractor 
from accepting more than one task. Every subcontractor is required to submit a 
bid on each of the modules, and thereby specify the total cost for which it would 
bill the electronics company for producing the separate components. A few 
individual bids are far out of line, indicating that some of the subcontractors 
really do not want to accept a particular job assignment. With this information, 
the electronic equipment firm- lets out the n contracts for the n components so that 
the total cost incurred is minimal. 

Mathematical model. To formulate the assignment problem 

in mathematical programming terms, define the activity variables as 

1 if Task i is performed by Agent j 
P otherwise 
for z = 1, 2,..., w and j = 1, 2,..,, n, 

and let ry be the corresponding cost. In the Blip-Bleep Company illustration, the 
index i refers to the component number and the index j to the subcontractor. 
Then the optimization model is 


(2) 

subject to 

minimize ^ 2 

( 3 ) 

2 ATy = 1 for z = 1, 2, . . ., n 

j=i 

( 4 ) 

2xy = 1 forj = 1, 2,.. ., 7z 

( 5 ) 

Xij = 0 or 1 for all i and j. 


Notice that the assignment model (2) through (5) is a special case of the standard 
transportation model, where m = n and Si == = \ [in (7) through (10) of 

Sec. 6.2]. It is now accepted terminology to refer to any standard transportation 
problem where m = n and Si = = I as an assignment problem. Observe, 

however, that the above illustration as well as most applications of the assignment 
model do not pertain to the optimization of transportation or shipping activities. 

6.5 SHORTEST-ROUTE MODEL 

Like the assignment model, the shortest-route problem can be explained 
simply: given a network in which each arc has an associated length, find 
a shortest path to a specified node from any of the other nodes. This problem 
actually encompasses a broad range of important operations research applications, 
including replacing equipment and scheduling complex projects. Because these 
specific illustrations are so diverse, it is helpful at the start to fix in mind the basic 
ideas of the model in general terms. 
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General description. Consider a network comprised of a set ofnodes^ 
certain pairs of which are connected by directed (that is, direction-oriented) 
arcs. For example, look at the network in Fig. 6.8. Usually, one node is distin- 



Figure 6.8. Example of Shortest-Route Model. 

guished as the terminal. Then the problem is to find a shortest path to the 
terminal from at least one other designated point, and sometimes from every other 
point. The amount Cfj represents the length associated with traversing the arc that 
starts at Node i and ends at Node j. 

In applications of the model, the Cfj actually may be measured in units other 
than distance. To illustrate, Cfj may denote the cost of going from Node i to Node 
j. In that case, the problem is to find a least-cost path. Or may represent 
the time to travel between the nodes. Then the objective is to find a mini¬ 
mum-duration path. 

Frequently in these applications, Cfj does not equal Cjf, Further, some nodes 
may not be connected directly, which can be indicated by letting the correspond¬ 
ing Cij = CO, and usually the so-called triangle inequality Cij < + Ci^j 

does not hold for all possible i,j\ and k, 

A network like the one in Fig. 6.8 may contain paths that are cycles. This 
means that for two or more nodes it is possible to find a path leading ouc of a node 
and eventually back again. In Fig. 6.8, there are many cycles, one such being from 
Node 2 to Node 7 to Node 3 and back again to Node 2. If the total length of a 
path around a cycle is negative, then by repeatedly traversing this cycle, the objec¬ 
tive function can be made arbitrarily small. Therefore, without imposing any fur¬ 
ther restrictions on the problem as stated, an unbounded solution can occur. Such 
a possibility will now be ruled out by assuming that if a network contains cycles^ the 
total length of every cycle path is nonnegative. This condition is innocuous for the typical 
operations research application, where often all > 0. 
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From source to sink* Suppose the problem is to find a best path to the 
terminal from a single origin point. The model is easily shown to be mathematically 
equivalent to an assignment problem. To demonstrate this equivalence for the 
network in Fig. 6.8, designate Node 1 as the terminal and Node 8 as the origin. 
Then consider the ordinary transshipment problem in which a unit of stock is 
available at Node 8 (a source) and is required at Node 1 (a sink). All other nodes 
of the network are treated as transshipment points. 

This view leads to the formulation in Fig. 6.9, which corresponds to an assign- 



1 

1 

1 

1 

1 

1 

1 


Figure 6.9. Tabular Representation of Shortest-Route Example. 

ment model with the buffer stock ^ = 1. Notice that the shortest-route problem 
is a special case of the assignment model in which the objective coefficients = 0 
appear in the so-called subdiagonal of the table. 

A mathematical description of the shortest-route problem is 


( 1 ) 

subject to 

( 2 ) 

( 3 ) 


minimize 


(t,» In 
network 


Ci)Xii 


S. ^k) 2 — 


(*,» In 
network 


1 forA; = j (source) 

0 for all other k 
— 1 forA: = r (sink) 

Xi) > 0 for all (t, j) in network. 


«.*) ip 
network 


Write out the technology table corresponding to (1), (2), and (3) for the example 
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in Fig. 6.8—there will be one equation for each node and one variable for each 
arc. To illustrate, for A: = ^ = 8, (2) indicates 

(4) ^85 + -^88 + ^87 = 1 (Source Node 8) 

and for k = 7, (2) states 

(5) Xi 2 + X 73 + XjQ — X 27 — Xs 7 = 0 (Node 7). 

It is much easier to solve a shortest-route problem than to solve a general 
assignment model. Chapter 7 presents an efficient algorithm for the shortest- 
route problem. 

There are countless seemingly different applications of the shortest-route prob¬ 
lem. Remember the one in the Rock-Bottom Discount Store Problem in Sec. 6.3? 
There you learned that the transshipment model can be approached by finding 
first a least-cost route from each node having available stock to every node requir¬ 
ing additional stock, and then an optimal solution to a standard transportation 
model using these minimum-cost paths. Here is another application. 

Equipment replacement. The Rhode-Bloch Trucking Company is 
preparing a leasing plan for transportation equipment extending over n — \ 
years. The company can meet its requirements by leasing a new piece of equip¬ 
ment at the beginning of Year 1 and keeping it until the beginning of Year 
j ^ 71, If j < n, then the company replaces the equipment at the beginning of 
Year j and keeps it until the beginning of Year k (< n), etc. The cost figure 
(for 1 < I < J < n) embodies the rental fee plus the expected running and main¬ 
tenance costs of equipment leased at the start of Year i and replaced at the start of 
Year j. The network for the problem is shown in Fig. 6.10 for n = 6. In this 



Figure 6.10. Rhodc-Bloch Trucking Company Equipment Replacement Network. 


characterization, a unit of stock is made available at Node 1 and routed to Node 
6, Each transshipment node appearing in an optimal solution indicates the year 
a replacement is to occur. 
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Notice there are no cycles in the network in Fig. 6.10; consequently it is called 
acyclic. (Represent the network in terms of an assignment model table, like 
that of Fig. 6.9. Order the nodes in the rows as 1, 2,.. ., 5, and in the columns 
2, 3, . . . , 6. Because the network is acyclic, you will find no permissible entries 
below the subdiagonal.) In Chap. 7, you will find that a very simple algorithm is 
available for determining the shortest route in an acyclic network. 

> Grand Land Development Company Problem. This example is included 
to demonstrate the versatility of the shortest-route problem. As in the preceding case, 
the network turns out to be acyclic. Both these illustrations fall in the category of 
deterministic, dynamic, and multistage optimization models. Therefore the discussion 
here previews what will be treated in greater detail in Chap. 10. 

This land development company is about to draw up an investment plan for the 
current year. The concern has a total capital of G hundred thousand dollars available for 
allocation. There are n development opportunities under consideration. The firm may 
invest in any number of these ventures, and is limited only by the amount of capital it 
has available. The minimum investment required to buy into Alternative i is /fj, and 
the company assesses the associated present value to be If an additional k hundred 
thousand dollars are invested, however, the resultant present value is estimated as i;|*. 
Assume vt^k+i > The company wants to allocate C among the investment alter¬ 
natives so as to maximize the total present value. 

To set up the model in concrete terms, suppose C = 10, /z — 4, and the minimal 
amounts are pi.= 6, p 2 = 4,/>3 = 3, and /?4 = 1. Let the additional investment amounts 
be in increments of one hundred thousand dollars, so A: = 1, 2,.... (In this example. 
Alternative 4 could be thought of as an investment in securities instead of a land develop¬ 
ment project.) The network for the illustration appears in Fig. 6.11 and the correspond¬ 
ing transshipment tabular form in Fig. 6.12. 

The network is constructed as follows. There is one column of nodes for each invest¬ 
ment alternative. In the node designation (i, c), i refers to the alternative. The value c 
signifies an amount of capital available for possible investment in Alternative i, given 
that certain other investment decisions are made about the previous alternatives. Each 
arc leading out of Node (i, c) represents a specific decision about Alternative i. The 
arc ends at Node (i -|- 1, c'), where c' is calculated as the capital available for possible 
investment in Alternative i + 1, given the specific decision made about Alternative i. 

For instance, examine Node (3, 4). If the company decides either to make the minimal 
investment in Alternative 1 and no investment in Alternative 2, or to make no invest¬ 
ment in Alternative 1 and allocate )&2 4-2 = 4-l-2 = 6 hundred thousand dollars to 
Alternative 2, then 4 hundred thousand dollars are available for po^ible investment in 
Alternative 3. These are the two decisions represente’d by the arcs entering Node (3, 4). 
With the 4 hundred thousand dollars, the firm may decide either not to invest in Alter¬ 
native 3; or to invest at the minimal level p^ = 3; or to invest at the level + 1 = 4, 
which would exhaust the available capital. These three decisions are represented by the 
arcs leaving Node (3, 4). 

Thus every feasible pattern of investments can be represented as a path through the 
network starting at Node (1, 10) and ending at Node (5, 0), the latter signifying that 
the total capital is to be fully expended among the four alternatives. To test your under¬ 
standing of the network, formulate several possible investment plans and find their 
paths in the network. Then select a few different paths and state the implied investments. 
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Figure 6.11. Grand Land Development Network. 


In this problem the length of the arc is the associated present value vuc of the decision 
involved. Since the objective function is to maximize total present value, the problem 
really is to find the longest path in the network. As you know from earlier chapters, by 
changing the sign of the objective function, a longest-path problem may be trivially 
converted to a shortest-path problem, so the two models have essentially identical 
structures. Since the network is acyclic, the optimal value of the objective function is 
finite. 

The above model is summarized in mathematical terms as follows. Let 
amount of capital invested in Alternative i, 
and define the increasing function 

fO forxt<^i 

» (i/jjfc for xi= pi-{- k where A: = 0, 1, 2,.... 
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Figure 6.12. Tabic for Grand Land Development Example. 
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Then the model is 

n 

(i) maximize 2 h (^i) 
subject to 

(ii) 2 ^ ^ 

(iii) Xj = 0, 1, 2,.. for each i. 

Thus the example has illustrated how network analysis can be applied to a problem of 
optimizing a nonlinear Objective function where the variables are restricted to integer 
values and subject to a single constraint. 

Traveling Salesman Problem. This rather facetiously named problem 
refers to the following situation: a trayeling salesman, named Pierre plans to visit each 
of n cities, starting and ending at City 1. The distance between City i and City j is rij. 
What is the shortest tour Pierre can take? 

This optimization problem and variants of it actually are faced by numerous com¬ 
panies that need to draw up routes for delivery trucks and the like. The mathematical 
model also fits situations that are entirely different. For example, the model is pertinent 
for an ice cream manufacturer who must optimally schedule the order of flavors to be 
run on his equipment. Let cjj be the amount of time spent cleaning and preparing the 
machinery when Flavor j is scheduled to follow Flavor i. The rjj vary widely, since 
flavors that are near-variants (vanilla and vanilla plus chocolate chips) can be run in 
quick succession, whereas other pairs require significant cleanup time (chocolate 
followed by coconut). The manufacturer wants to schedule his production to minimize 
the total amount of set-up time on the machine. 

Returning to the salesman’s problem, let 


{ 1 if a tour includes traveling from City i to City j (for i j) 

0 otherwise. 

Now suppose you apply the assignment model (2) through (5) in the preceding sec¬ 
tion, with all the omitted, to produce a minimum-distance routing through each city. 
The constraints in the assignment problem ensure that the route includes a departure 
from each city, and similarly an arrival at each city. 

Will an optimal solution to the assignment problem provide a tour the salesman can 
use? Unfortunately, it may not. The resulting solution may include two or more 
disconnected cycles. For instance, Xjj = Xjs = Xji = 1 and X 45 = Xjg = • • • = = 1 can 

occur. Thus one route is through Cities 1, 2, and 3, and an entirely separate route is 
through Cities 4, 5,..., n. Therefore what seems to be a modest extra requirement 
that the solution is a bona fide tour—that is, it contains only one cycle—in fact makes 
this combinatorial problem considerably harder to solve. You will return to the problem 
in Chap. 13. ^ 


*6.6 CRITICAL PATH SCHEDULING 

This version of the longest-path problem is frequently applied to sched¬ 
ule major projects, such as the construction of a multistory office building, a 
maintenance overhaul of large-scale equipment, a research and development 
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program for a new weapon system, and the market introduction of a new product. 
Critical path scheduling is also designated by a variety of acronyms and other 
names, such as CPS, CPM, and PERT. The approach has been developed to 
such an extent that adequate coverage of all its nuances would require an entire 
chapter and take you too far afield from the topic of the present chapter. However, 
you will find it instructive to see how the longest-route model can be applied to 
this sort of scheduling. The description to follow contains a number of simplifica¬ 
tions, but the structural essentials are retained. 

The Swift Building Gonoipany Problem. This firm must complete 
a construction project comprised of n tasks. Management has estimated the time 
needed for completing each task, and has established a precedence-ordering among 
the tasks to indicate which must be finished before each particular task can com¬ 
mence. As you will see in the illustration below, it is necessary to specify only the 
jobs that immediately precede a task to determine such an ordering. The company 
wants to calculate the earliest possible completion date for the entire project. 

Suppose the project consists of five jobs. A, B, 

G, D, and E. It is convenient terminology to add 
a fictitious Job F that is to start when the entire 
project is finished. The precedence-ordering 
among the tasks is exhibited in Fig. 6.13 along 
with the completion times for each task. Thus 
Job G cannot start until Job A is finished, and 
Job E cannot begin until both Jobs B and D are 
finished. The entire project is finished as soon as 
both Jobs G and E have ended. 

To formulate the corresponding mathematical 
optimization model, let the variables represent 
the starting time for each of the projects. Intro¬ 
duce only those variables essential to the analysis. 

Note that Jobs A and B can be considered as 
beginning at time 0, because they have no 
predecessors. Similarly Job G can be considered as starting at the same time as 
Job D, since both have the same immediate predecessor, namely Job A. Therefore, 
you need to include only the variables 

y^jj, = starting time of Jobs G and D 
yjs = starting time of Job E 
yr = starting time of Job F. 

Recall that^jp. in fact is the time at which the entire project is finished. 

The appropriate linear progranuning model is 

(1) minimize y^ 

subject to 


Job 

Immediate 

Predecessors 

Completion 

Time 

A 

— 

tA 

B 

— 


C 

A 

fc 

D 

A 

• h 

E 

B,D 

t 


C,E 

— 


Figure 6.13. Swift Building Ck)m- 
pany Example: Precedence Order¬ 
ing. 
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( 2 ) 

ycD ^ Ia 

( 3 ) 

ys ^ Ib 

( 4 ) 

ys ^ Id y CD 

( 5 ) 

yp ^ h ArycD 

( 6 ) 

yp ^ Ie ~\-yE* 


Relations (2) through (6) are mathematical translations of the precedence rela¬ 
tions in Fig. 6A3, and use the completion times given there. To illustrate, (3) 
and (4) require that Job E does not start until both Jobs B and D are completed. 
The completion time of Job B is merely tg, and of Job D is its own starting time 
plus completion time. Notice restrictions (2) through (6) make it wwnecessary to 
impose an explicit constraint that the variables be nonnegative. Consequently, 
from a formal point of view, the variables are unconstrained in sign. 

The technology of the model is summarized in Fig. 
6.14. At first glance it does not seem to have a network 
structure, but further study reveals that its dual comes 
close to the desired format. With the few simple 
manipulations, given in the special material below, 
you can show the dual is a longest-path problem. 
The longest route itself is termed the critical 
path, since if the completion time of any job on this 
route increases, the finish date is delayed aceordingly. 
(A critieal path is not necessarily unique.) 

In actual applications of critical path scheduling, 
which often encompass several hundred jobs, the 
analyst uses information such as that given in Fig. 
6.13 and immediately draws a corresponding net¬ 
work. However, the algorithm that finds the longest 
path in the constructed acyclic network really solves 
the problem expressed in terms comparable to (1) through (6). 


ycD >'f yp 


1 


1 

Al 

-1 1 


-1 1 

^ k 

-1 1 


1 

Minimize 


Figure 6.14. 
Tableau of 
Company. 


Critical Path 
Swift Building 
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Restriction: Xjj > 0 


Figure 6.15. Swift Building Company: Tab¬ 
leau of Dual Problem. 



Figure 6.16. Swift Building Company: Network of 
Dual Problem. 
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More sophisticated versions of critical path scheduling treat the job completion 
times as variables subject to a constrained optimization. In such cases, the analysis 
determines the tradeoff possibilities for reducing total project duration by increas¬ 
ing expedite costs. 

>Take the three dual relations implied by Fig. 6.14 and sum them to form a redundant 
fourth equation. Recall that the dual relations are equalities, since the three variables 
in Fig. 6.14 are unconstrained in sign. Second, take each of the same three relations and 
multiply both sides by (—1) to change the signs of the coefficients. 

The resultant model is shown in Fig. 6.15, with the redundant equation appearing in 
the top row. The associated network is drawn in Fig. 6.16. The problem is to find the 
longest route in an acyclic network; the problem is structurally equivalent to a shortest- 
path model. ' ^ 

* 6.7 EMPLOYMENT SCHEDULING 

The next application of network analysis demonstrates how it is sometimes 
necessary to manipulate the original formulation of a model in order to discover 
that the problem has a network equivalent. 

The Spitzen-Pollish Company is a contract maintenance firm that provides and 
supervises semiskilled manpower for major overhauls of chemical processing equip¬ 
ment. A standard job frequently requires a thousand or more men and may extend 
from one or two weeks to a few months. Since the client’s plant often is not located 
in a major metropolitan area, the maintenance company may have to transport 
workmen for several hundred miles. The firm’s operating expenses include the 
costs of transportation, food, and on-site housing, as well as the men’s wages. 

For a routine assignment, the firm can estimate fairly accurately the number of 
crews required on a day-to-day basis for the duration of the job. The daily require¬ 
ments fluctuate enough to impel the company to vary the number of crews on site 
during the overhaul period, in an effort to keep operating costs at a minimum. 
However, there are costs that do not depend on how long a crew remains on site: 
costs associated with recruiting, transporting, and then briefing a new crew. The 
firm sometimes finds it economical, therefore, to retain crews in excess of require¬ 
ments if the men will be needed a few days later. Thus for each contract, the firm 
seeks a work force schedule that minimizes total labor expenses. 

To construct the mathematical model that produces an optimal schedule, 
assume that the overhaul begins at the start of Period 1 and ends at the start of 
Period n. Let 

Xij = number of crews beginning work at the start of Period i and terminating 
at the start of Period j (for 1 < i <i ^ «), 

and let > 0 be the associated total operating cost of such a crew. Assume Ci^ 
increases if the interval of employment increases; that is, 

(1) Cii< Cm, for h<i <j^k. 
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Finally, let be the number of crews actually needed during Period k, for 
A = 1, 2,..., n — 1. 

Then the constraints of the problem are 

( 2 ) = 

(3) for A = 2, 3,..., n — 2 

( 4 -) 2 **» = K-i 

1 = 1 

(5) every xy = 0,1, 2,, 
and the objective function is 

71-1 71 

(6) minimize 2 2 

1=1 j=i+i 

Relation (2) states that the total number of crews starting work in Period 1 
equals that period’s requirements. Because of (1), it is not economical to have 
excess crews at the start. By the same token, relation (4) specifies that there are 
no excess crews in the final work period, (n — 1). At intermediate periods, Spitzen- 
Pollish may find it desirable to have more crews on site than are essential, which 
is pennitted by the inequality in (3). 

The model is displayed in its entirety in Fig. 6.17, where n = 6. Nonnegative 



Figure 6.17. Spitzen-Pollish Company Employment-Scheduling Model. 


surplus variables 5*, A = 2, 3, 4, have been added to convert (3) to equalities. 
Any connection between the technology in Fig. 6.17 and that of a network model 
is not obvious; but educated intuition suggests that further insight might be devel¬ 
oped by transforming the technology into another equivalent format. The trian¬ 
gular appearance of the coefficients in successive sets of columns provides the clue 
for the transformations on the rows of Fig. 6.17. 

Take Row 4 and subtract it from Row 5. Similarly, subtract Row 3 from Row 4, 
Row 2 from Row 3, and Row 1 from Row 2. The results are shown in Rows 
5', 4', 3', and 2' of Fig. 6.18. Then multiply Row 5 by (—1), thereby changing the 
signs of the coefficients on the right-hand side, and add this redundant relation 
as Row 6' in Fig. 6.18. 
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Figure 6.18. Spitzcn^Pollish Equivalent Network Formulation. 


Now the problem has the form of a transshipment model. As a consequence, the 
integrality stipulation in (5) causes no added difficulty in finding a solution. The 
network for the model is given in Fig. 6.19, where Node 1 is the source and Node 6 



Figure 6.19. Network Representation of Spitzen-Pollish Employment- 


Scheduling Model. 

is the sink. The other nodes are transshipment points. The corresponding trans¬ 
shipment array appears in Fig. 6.20. The buffer stock quantity can be set at 

(7) jB = .5 ^ 1,^* — ^*-1 1 + -^s]' 

Notice that without any further assumptions about the Rt, the model is not a 
shortest-route problem, and the activities s* appear below the subdiagonal of the 
transportation table in Fig. 6.20. However, if = 1> for * == 1, 2,..., 5, then 
you can eliminate the J* because no cost would be saved in routing a unit from 
Node j to Node k if* <j. Furthermore, B = 1 in (7). Thus, in this case, the model 
does simplify to a shortest-route problem in an acyclic network. What is more, the 
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R^-R^+B 

R^-R^^B 

^ 5 -^ 4+8 


Figure 6.20, Tabular Form of Spilzen-Pollish Employment-Scheduling Model. 

network is identical to that for the equipment replacement problem considered in 
Sec. 6.5 and illustrated by the network diagram in Fig. 6.10. In Chap. 10, you 
will see other special applications of the model (2) through (6). 


*6.8 OVERVIEW OF NETWORK MODELS 

This chapter has demonstrated how certain operations research problems can 
be couched in terms of optimizing within a network. The present section puts in 
focus the concepts you have learned so far, and describes in broader terms the 
entities ai^d concepts encompassed in network models. The exposition will also 
introduce you to the standard vocabulary of these models. 

Definitions. A network (or linear graph) consists of a set of nodes 
(or points, vertices) and a set of arcs (or edges, links) connecting 
various pairs of the nodes. Each arc has specified orientation (or direction). Con¬ 
sequently, the network is said to be directed. 

A simple notation serves to describe a directed network: number the points 
1, 2,..., and designate an arc starting at Node i and ending at Node j by 
(f,y). Assume there is only one arc (i,j), and call an item traversing (i,j) a 
unit of flow along that arc. The network is bipartite if the riodes can be 
partitioned into two groups Gy and such that for every arc (i, j), Node i is in Gi 
and Node j in G 2 . The network for a transportation problem, shown in Fig. 6.1, 
is bipartite. 

In all the preceding examples of this chapter, a unit of flow originating at 
any Node i and coursing (i, j) remains a unit on arrival at Node j. That is, no 
loss or gain occurs in the flow along any arc. (The impact of this assumption is 
explored in an illustration in the next section.) Consequently, a complete state¬ 
ment of a network structure is apparent from a table of structural coefficients 
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(like Figs. 6.2, 6.7, 6.15, and 6.18), Such a table is referred to as a node-arc 
incidence matrix. Each node is represented by a row in the table, and each 
arc by a column. The direction of flow on the arc (z,j) is indicated by a coefficient 
1 in the row for Node i and —1 in the row for Node j. 

A sequence of arcs {ignoring their direction) that connects Node i to Node j is 
termed a path between the two nodes. When i = j, the path is a loop. A net¬ 
work is connected provided there is a path between every pair of points. In 
most applications of network models, the linear graph is connected. When the arcs 
in a path from Node i to Node j are all oriented so that it is actually possible 
for a unit of flow to traverse the path, the more restrictive term directed chain 
is often applied. (Consequently, throughout this chapter, wherever the word 
“path” has appeared, you may substitute “directed chain.”) For example, the 
first approach suggested for reducing a transshipment model to a standard trans¬ 
portation problem, illustrated in the condensed transshipment table of Fig. 6.4, 
requires finding all the directed chains from nodes having available stock to nodes 
requiring additional supply. A directed chain beginning and ending at the same 
node is a directed cycle. An acyclic network is one without any directed 
cycles. 

A tree. One special network of importance arises when there are p nodes, 
exactly p — 1 arcs, and the network is connected. This structure is called a 
tree, and does not contain any loops. (Draw a picture with ^ = 6 nodes placed 
randomly on the page. Then construct a connected network using only five arcs. 
Rotating the page and using a little imagination, you will see that your graph 
looks like one or more branches of a tree. What happens when you try to include 
a loop while constructing the network?) For example, consider the problem of 
finding shortest routes to a particular node (the terminal) from every other node 
in a general network. Assume the shortest routes are unique. A solution will be 
a set of directed chains ending at the terminal. If each arc not used by at least one 
of these chains is deleted from the network, the arcs remaining will form a tree. 
(Why will the result be a connected network ? Why will it have no loops ? Why 
will it have — 1 arcs?) 

General formulation. In defining a network, it is also useful to specify 
for the time period under consideration a capacity > 0 on the total flow starting 
at Node i and ending at Node j. For the examples in the preceding sections, the 
value of each Uij was given implicitly as either infinite, implying no effective flow 
capacity restriction on an arc, or zero, signifying that flow was not permitted from 
Node i directly to Node j. 

Two more kinds of quantities are needed to characterize a network in optimiza¬ 
tion applications. The first is the net flow values 7^ at each Node k. If T* is posi¬ 
tive, then more units of flow must leave the node than enter it, and vice versa 
if is negative. If Tj, is zero, all flow entering the node must also leave. It is 
convenient and innocuous to postulate 
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( 1 ) 

The second kind of data required is the cost data Cij of each unit of flow travers¬ 
ing arc Here it has been assumed that for every directed cycle in a network 
the corresponding sum of the Cij is nonnegative. 

Let 

Xtj = amount of flow over arc ( 1 , 7 ) during the planning horizon. 

Then a network optimization problem of considerable generality and applica¬ 
bility is 

V V 

( 2 ) minimize 2 2 ^ij 
subject to 

(3) XI ^tjfc = for k = 1,2,,..,/? 

i»i 

(4) 0 < < Uij for all i and j. 

The constraints (3) are frequently called flow coaservatiosi or material 
balance equations. By the methods you learned in this chapter, the model 
(1) through (4) can always be viewed as a capacitated transportation problem. 
The following result will be demonstrated in Chap. 7: 

Integrality Theorem: If all and are integers, then the value of (2) will 
not be increased by further stipulating that each must be an integer. 

Whenever a linear programming problem is of the form (1) through (4), then 
it has a network equivalent. Sometimes a problem’s original statement must 
be manipulated, such as by combining equations, changing signs of coefficients, 
and adding a redimdant relation, or by considering the dual in order for it to 
appear as (1) through (4). Once the model is in this form, however, each relation 
in (3) corresponds to a node in the network, each variable Xij to an amount of 
flow along a permissible arc (i, j), and each to the cost of a unit of flow through 
the arc. 

►The model (1) through (4) is occasionally stated somewhat differently, and the tech¬ 
nique of conversion is of sufficient interest to be noted here. The modification arises 
when there is more than one Tjt that is strictly positive or more than one Ti that is 
strictly negative. For the sake of generality, assume there are more than one positive and 
more than one negative Tjt. 

Add two fictitious nodes, 0 and p 4- 1- For each positive T^y add the arc (0, k) with 
UQt = Tje- For each negative T*, add the arc (k,p + 1) with u/c,p+i = —T*. Let F 
be the sum of the positive 7*. Then modify the model (1) through (4) by replacing (3) 
with 
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(ii) 

= 0 for A = 1,2,.,., p 


t»0 

(iii) 

— ^ xt , p+i = — F. 


t=l 


As a result, there is now a single source (Node 0) and a single sink (Node p + 1), 
all other nodes being transshipment points. M 


*6.9 GENERALIZED NETWORK PROBLEM 

You* would expect that the Integrality Theorem in the preceding section would 
fail to hold if the underlying structure of the network were altered. Indeed, this 
is true when a unit of flow originating at Node i becomes aij > 0 units on arrival 
at Node j. For aij> 1, there is a gain, and for Oij < 1, a loss. The node-arc 
incidence matrix for the model is revised by substituting the appropriate — 
for each —1. 

One illustrative application of the generalized network model occurs for a steel 
fabricator, with several manufacturing plants. On a monthly basis, the Vice- 
President for Production, with his staff, determines the allocation of customer 
orders to each of the plants. The manufacturing capacity of Plant ils Si tons per 
month. There are n distinct products, and for the current month the total customer 
demand for Product j is also measured in tons. Because the plants differ in 
their fabricating facilities, a ton of capacity at Plant i allocated to making Product j 
yields tons of that commodity. In other words a “ton of capacity” at a plant 
is really a standard unit of measurement, and the differing indicate correspond¬ 
ing relative production efficiencies. A ton of Product j manufactured at Plant i 
costs Cijy and if the plants are geographically dispersed, the Cu may also include 
differential shipping charges. Let 

Xij = tons of capacity at Plant i allocated to fabricating Product j 
during the month. 

The mathematical model is 


m n 


(1) 

minimize ^ 2 CijXu 

<-i 

subject to 

(2) 

2 ^ ~ 2,.. 

(3) 

> Dj forj =1,2,.. 

(4) 

Xtj > 0 for all i and j. 


If a certain product cannot be produced at a plant, remove the in (1) through 
(4). The technology is displayed in Fig. 6.21. Notice how it closely resembles the 
technology for a standard transportation problem, shown in Fig. 6.2. For this 
reason, the linear programming model (1) through (4) is often called the 

generalized transportation (or weighted distribution) problem 



Demands Plants 
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Figure 6.21. Generalized Transportation Problem. 


* 6.10 MULTICOMMODITY NETWORK 

The discussion of the classical transportation problem in Sec. 6.2 made it clear 
that the formal mathematical statement assumed shipping only a single commod¬ 
ity. The text indicated a way in which a true multicommodity situation might be 
well enough approximated in the classical problem. But when the application 
involves p products to be routed through a capacitated transshipment network, and 
the pattern of stocks and requirements differs significantly from product to product, 
the problem may be too complex to handle by a one-commodity approximation. 
In this context a capacity constraint Uij means that the total flow of all commodities 
from Node i to Node j must not exceed the given bound. (Suggest an example or 
two in which an aggregate constraint of this type applies.) 

For an inkling of the complications that arise in a multicommodity model, 
consider the situation exhibited in Fig. 6.22. The network consists of three nodes. 
No cost is associated with traversing arcs (1,2), (2, 3), and (3, 1); however, a cost 
c per unit flow is charged on arcs (1, 3), (3, 2), and (2, 1). One unit of Commodity 
A is available at Node 1 and required at Node 3. Similarly, a unit of Commodity 
B is available at Node 2 and needed at Node 1, and a unit of Commodity C is 
available at Node 3 and wanted at Node 2. If there are no arc capacity restrictions, 
the optimal solution, with zero total cost, is to route each item from its origin 
through, a transshipment point to its destination. 

Now suppose there is a one-unit maximum flow constraint on each arc. If the 
bound is imposed separately on each commodity, the previous solution remains 
optimal. But suppose instead that the restriction is on the total flow of all com- 
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Commodity A\ +1 
Commodity B\ -1 



Commodity B: +1 
Commodity C\ —1 


Figure 6.22. Multicommodity Network, 


modities on an arc. (Test your skill by writing out a full tableau description of the 
model, given this assumption. In what way does the tableau structure differ 
significantly from that in* Fig. 6.7?) The previous routings are ruled out, because 
together they imply a flow of 2 on the arcs (1, 2), (2, 3), and (3, 1). The only 
optimal routing is to send half a unit of each commodity along the previous rout¬ 
ings and the remaining half unit directly from its origin to its destination. The 
associated minimal cost now is ^c. Thus connecting three separate flow problems, 
merely by impo^^ing common capacity constraints, is sufficient to destroy, the inte¬ 
grality result cited at the end of Sec. 6.8. The computational task of finding op¬ 
timal flows in such networks is likewise more difficult. 


REVIEW EXERCISES 

1 Consider the transportation problem (1) through (4) in Sec. 6.2. Let the number of 
supply points m = 4 and the number of demand points /z = 5. 

(a) Write the model in full detail (that is, display the objective function and the 
constraints without using the summation symbol). 

(b) Draw the corresponding network in Fig. 6.1, and construct the two tables in 
Fig, 6.2. 

2 Excess Supply. Consider a transportation problem in which 

•S'i = 5 1 S 2 — 6 = 7 and = 2 1)2 = 3 D 3 = 4. 

Create a fictitious destination and construct a table like that in Fig. 6.2a in which total 
supply equals total demand. Be sure you indicate the cost coefficients Cij for the 
fictitious destination and its Dj. 
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♦3 Seasonal Variation, Suppose a firm having two plants and three demand points is 
planning a production schedule for four periods. Assume over Periods 1, 2, 3, and 4 
that Plant 1 has available supplies of 4, 5, 6, and 7, respectively, and similarly Plant 2 
has available supplies of 8, 9, 10, and 11, respectively. Assume over Periods 1, 2, 3, 
and 4 that Demand Point 1 has demands 3, 3, 10, and 10, respectively; similarly, 
Demand Point 2 has demands 4, 2, 6, and 6, respectively; and Demand Point 3 has 
demands 2, 6, 4, and 4, respectively. Let Cij be the cost of shipping a unit from Plant i 
to Demand Point j in any period. 

(a) Display the optimization problem in a table like that in Fig. 6.2a. Be sure you 
indicate which must be eliminated. 

(b) Is there a feasible solution to this numerical example? Justify your conclusion. 
Give a general rule for discerning whether such a problem has a feasible solution. 

(c) Explain how to modify the formulation when the total supply exceeds total 
demand. 

(d) Suppose each unit of inventory held at the end of a period incurs the cost h. 
Will adding this cost change the optimal solution in part (a) ? In part (c) ? 

Exercises 4 through 9 refer to the case of the Rock-Bottom Discount Store, 
displayed in Fig. 6.3. 

4 In each part below, state which stores are transshipment (or intermediate) points, 
which are sources, and which are sinks. Also state whether the resulting map has a 
feasible shipping pattern that meets the demand requirements. Assume the removal 
of the route from 

(a) Store 4 to Store 5. , 

(b) Store 5 to Store 4. 

(c) Store 6 to Store 7. 

(d) Store 2 to Store 5. 

(e) Store 4 to Store 7. 

(f) Store 4 to Store 3. 

5 In each part below, state which stores are transshipment (or intermediate) points, 
which arc sources, and which are sinks. Assume a route is added from 

(a) Store 7 to Store 4. 

(b) Store 6 to Store 8. 

(c) Store 8 to Store 6. 

(d) Store 3 to Store 4. 

(c) Store 1 to Store 3. 

(f) Store 3 to Store 1. 

6 In each part below, construct a condensed transshipment table like that in Fig. 6.4. 
Show a feasible solution, and indicate whether or not your solution implies a unique 
routing from the supply point to the demand point. Construct an expanded trans¬ 
shipment table like that in Fig. 6.5; indicate the sources and sinks. 

(a) Stock of 10 available at Store 1, and stocks of 2 and 8 required at Stores 3 and 8, 
respectively. No stock available or required at any other store. 
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(b) . Stocks of 10 and 4 available at Stores 1 and 5, respectively, and stocks of 3, 2, 1, 

and 8 required at Stores 3, 4, 6 , and 8 , respectively. No stock available or required 
at any other store. 

(c) Stocks of 10, 1,2, and 5 available at Stores 1, 2, 6 , and 7, respectively, and stocks ^ 
of 3, 1, 6 , and 8 required at Stores 3, 4, 5, and 8 , respectively. 

(d) Answer the above parts when the route from Store 6 to Store 7 is removed. Also 
state whether there is a feasible solution. 

7 (a) Consider the routing in Fig. 6 . 6 , Explain in detail how this routing implies that 

Store 1 ships three units to Store 3 via Point 2; one unit to Store 6 via Points 2 
and 5; and six units to Store 8 via Points 2, 5, 4, and 7; and that Store 4 ships 

its two units to Store 8 via Point 7. Display this routing on a map like that in 

Fig. 6.3. 

(b) Show how Fig. 6.6 is altered if Store 1 ships six units to Store 8 via Points 2, 5, 6 , 
and 7. 

(c) Show how Fig. 6.6 is altered if Store 1 ships eight, instead of six, units to Store 8 , 
and Store 4 ships two units to Store 3, instead of to Store 8 . 

(d) Explain in detail the routing implied in Fig. 6.6 by letting 

^12 = ^0 <^22 = 2 ^ 25 = 10 X 43 = 3 X 44 = 11 

, *54 = 1 *55 = 2 JCsd = 9 ^«» = 3 X „ = 8 AT,, = 4 *,8 = 8. 

Display this routing on a map like that in Fig. 6.3. 

8 Add routes from Store 3 to Store 2; Store 3 to Store 4; Store 5 to Store 3; Store 7 to 
Store 6 ; and Store 3 to Store 8 . Construct a technological table like that in Fig. 6.7 
for the stocks available and required in the specihed part of exercise 6 . 

(a) Part (a). 

(b) Part(b). 

(c) Part (c). 

9 Suppose there is one unit of stock available at Stores 1, 4, and 5, and a requirement of 
one unit of stock at Stores 3, 7, and 8 . No stock is available or required at Stores 2 
and 6 . 

(a) Construct a condensed transshipment table like that in Fig. 6.4. Indicate whether 
the table yields an assignment problem. 

(b) Construct an expanded transshipment table like that in Fig. 6.5. Indicate whether 
the table yields an assignment problem. 

10 Explain how a standard transportation problem can be converted into a (large) 
assignment problem. Illustrate your technique using an example with two supply 
points having stocks •S'l = 2 and S 2 = 3, and two demand points having requirements 
Dj = 1 and = 4. 

11 (a) Display the technology table corresponding to (1), (2), and (3) in Sec. 6.5 for 

the shortest-route example in Fig, 6 . 8 . 
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(b) Assume every ^ 0. Verify that the constraints in (2) can just as well be written 
as *‘greater-than-or-equal” inequalities. Then display the associated dual maxi¬ 
mization problem. 

12 Consider the shortest-route example in Fig. 6.8. Add arcs from Node 4 to Node 7 
and from Node 6 to Nodes 3 and 7; delete arcs from Node 4 to Node 3, from Node 6 
to Node 5, and from Node 7 to Node 2. 

(a) Draw the revised network, and construct the corresponding tabular representa¬ 
tion like tliat in Fig. 6.9. 

(b) For part (a), display the technology table corresponding to (1), (2), and (3) 
in Sec. 6,5. 

*(c) Assume every cij ^ 0. Verify that the constraints in (2) can just as well be written 
as “greater-than-or-equal” inequalities. Then display the associated dual maxi¬ 
mization problem. 

13 Consider the shortest-route example in Fig. 6.8. Add arcs from Node 1 to Nodes 3 
and 4, and from Nodes 5 and 6 to Node 8. 

(a) Construct a tabular representation like that in Fig, 6.9 where the sink is Node 2 
and the source is Node 7. 

(b) For part (a), display the technology table corresponding to (1), (2), and (3) in 
Sec. 6.5. 

*(c) Assume every €^>0. Verify that the constraints in (2) can just as well be 
written as “grealer-than-or-equal” inequalities. Then display the associated dual 
maximization problem. 

14 Shortest’Route Problem, Consider the Rock-Bottom Discount Store Problem displayed 
in Fig. 6.3, For each part below, draw a network representing the associated shortest- 
route problem, construct a tabular representation like that in Fig. 6.9, and describe 
a technology table corresponding to (1), (2), and (3) in Sec. 6.5. Find a shortest route 
from 

(a) Store 1 to Store 3. 

(b) Store 1 to Store 8. 

(c) Store 4 to Store 8. 

15 Equipment Replacement, Consider the example of the Rhode-Bloch Trucking Company, 
displayed in Fig, 6.10. 

(a) Show a routing through the network corresponding to replacing equipment in 
Years 3 and 5. In Years 2 and 5. In Years 3 and 4. In Year 3 only. In Year 4 only. 
Indicate the corresponding costs incurred over the horizon. 

(b) Construct a tabular representation of the problem like that in Fig. 6,9. 

(c) Describe a technology table corresponding to (1), (2), and (3) in Sec. 6.5, 

(d) Suppose that the cost of running and maintaining the equipment for one, two, 
three, four, and five consecutive periods is 1,3,6, 10, and 15, respectively. Suppose 
that if a rental at the start of Period 1 extends over one, two, three, four, or five 
consecutive periods, the rental cost is 5, 9, 13, 16, or 19, respectively. Th.esc 
rental costs increase by one each period; for example, if a rental at the start of 
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Period 3 extends over one, two, or three consecutive periods, the costs arc 7, 11, 
or 15, respectively. Calculate each Cij from these data. 

(e) Assume every Cfj ^ 0. Verify that the constraints in (2) can just as well be written 
as “greater-than-or-equar* inequalities. Then display the associated dual maxi¬ 
mization problem. 

16 Equipment Replacement. Consider the example of the Rhode-Bloch Trucking Company, 
displayed in Fig. 6.10. Revise the network for the case where n = 8, and each piece of 
equipment must be kept at least three years. 

* 17 Critical Path Scheduling. Consider the case of the Swift Building Company, as shown in 
Fig, 6.13. Suppose the immediate predecessors for Job C are Jobs A and D; for Job D 
is Job B, instead of Job A; and for Job E is only Job B. 

(a) Reformulate the linear programming model appropriate to the new restrictions. 

(b) Display the corresponding critical path tableau, as in Fig. 6.14. 

*(c) Display the corresponding dual problem, as in Fig. 6.15, and construct the associ¬ 
ated network, as in Fig. 6.16. 

*18 Critical Path Scheduling. A network representation for the Swift Building Company case 
is shown in Fig. 6.13. An alternative network representation can be constructed as 
follows. Let Node S represent the starting point; all arcs out of Node S have 0 length. 
Let Node T represent the terminating point. In addition, there is a node for each job. 
There is an arc from Node i to Node j if Job i is an immediate predecessor of Job j; 
the length of the arc is the time to complete Job i. Any jobs that can start imme¬ 
diately have S as an immediate predecessor. Any job not followed by other jobs has 
an arc into Node T. Node T serves the same purpose as Job F in Fig. 6.13, and so 
Job F can be replaced by T. 

(a) Draw this alternative network for the case in Fig. 6.13. 

(b) Compare Fig, 6.13 with the above network representation. 

(c) Draw the network for the case in exercise 17. 

*19 Employment Scheduling. Consider the case of the Spitzen-Pollish Company discussed 
in Sec. 6,7, Let n = 8, and assume that a crew must be kept for at least two periods. 

(a) Display the technology as in Figs. 6.17 and 6.18. 

(b) Draw the associated network as in Fig. 6.19, 

(c) Construct the associated tabular form as in Fig. 6.20. 

*20 Employment Scheduling, Consider the case of the Spitzen-Pollish Company discussed in 
Sec. 6.7. Let n = 9, and assume that a crew must be kept for at least one period but 
not longer than four periods. Answer the questions in e.xercise 19. 

*21 Employment Scheduling. n = 6 in the model of Sec. 6.7. Let the wage cost per 

crew in Period k be designated as it;*, where ^ 

Wi = 100 W2 = 150 Wi = 180 Wi = 200 = 120. 

A crew hired to start in Period k incurs the hiring cost gk, where 

^ 1=1 ^2 = 2 ^3 = 3 ^4 = 2 ^ 5 = 1 - 





200 


OPTOnzATiON IN Networks 


A crew released at the end of Period k incurs the severcnce cost /t, where 
/, = .3 /, = .5 /, = .7 /, = .4 = 

If a crew works t consecutive periods, the resultant transportation, housing, recreation, 
and related expenses are where 

=12 ^2 = 13 ^3 =14 ^4 = 18 h = 25. 

Calculate the appropriate values for all the objective-function coefficients Cij, 

♦22 (a) Randomly place six nodes on a page. Construct a connected network using only 
five arcs. What happens if you try to include a loop when constructing the network? 
What happens if you add one more arc? 

(b) Repeat part (a) with seven nodes and six arcs. 

♦23 Consider the problem of finding shortest routes to a terminal from every other node 
in a general network with p nodes. Assume the shortest routes are unique. Then a 
solution will be a set of directed chains ending at the terminal. Reduce the network by 
deleting each arc not used by at least one of these chains. Explain why the result will 
be* a connected network, why it will have no loops, and why it will have p — 1 arcs. 

♦24 Multicommodity Network, Consider the example in Fig. 6.22. Write a full tableau 
description of this model including the restriction that the total flow of all commodities 
on an arc cannot exceed one unit. How does the tableau structure differ significantly 
from that in Fig. 6.7? 


25 Explain your understanding of the following terms: 


. transportation (or distribution) 
problem 

♦capacitated (or bounded-variables) 
transportation problem 
transshipment (or intermediate) point 
source point 
sink point 


buffer stock 
assignment problem 
shortest-route problem 
node 

directed arc 

terminal 

cycle. 


♦26 Explain your understanding of the following terms: 


longest-path problem 
critical path 

network (or linear graph) 
node, point, or vertex 
arc, link, or edge 
bipartite network 
node-arc incidence matrix 
path 

loop ^ 

connected network 


directed chain 
directed cycle 
acylic network 
tree 

flow conservation (or material balance) 
equations 
flow gain (or loss) 
generalized transportation (or 
weighted-distribution) problem 
multicommodity network. 
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FORMULATION EXERCISES 

27 The Nick U. Moore Razor Blade Company has announced a revolutionary product 
development, and the sales response to the company’s advertising program has been 
gratifying. The company has two manufacturing plants and three distributing ware- 

. houses located in different parts of the United States. The company ships its razor 
blades to the warehouses by rail in carload lots. 

This month’s supply at Plants 1 and 2 . are = 100 and S 2 = 200, respectively. 
The sales potentials at Warehouses 1, 2, and 3 are Di = 150, D 2 = 200, and = 250, 
respectively. As you can see, potential demand vastly exceeds available supply, and 
consequently some demand will go unsatisfied. 

Assume the cost of shipping a carload from Plant i to Warehouse j is and that 
the sales revenue per carload at Warehouse j is py (The company is able to charge 
different prices for its razor blades in different parts of the country.) 

(a) Formulate a transportation model to obtain a profit-maximizing solution. Be sure 
to indicate how you would compute each objective-function coefficient rij. 

(b) Construct the associated table like that in Fig. 6.2a. Be sure to indicate the values 
for each Si and as well as each 

28 The Longview Manufacturing Company must meet demand commitments for 
Pi, iDg,..., Z)j>r units of its product in Periods 1, 2,..., N, respectively. The com¬ 
pany’s production capacity in Period t is units, assuming that its work force is 
employed for one shift a day. The associated direct cost of each unit is pi. By using 
overtime, the company also can produce ei more units at a direct unit cost of where 
qi > pi. Because the demand commitments fiuctuate considerably, the company 
expects that it may have to build inventory in some periods to meet demand in sub¬ 
sequent periods. Each unit of inventory at the end of Period t incurs a holding charge 
At. 

(a) Assuming Z)^, and ei are integer-valued, formulate a transportation model 
that will find a cost-minimizing production and inventory plan. Be sure to indi¬ 
cate how to compute each objective-function coefficient. 

(b) Using your answer in part (a), construct a table like that in Fig. 6.2a for the data 
(i\r=4) 


Di = 12 

b 

II 


18 

1)4 = 22 

nil = 

m2 = 8 

mj = 

10 

M 

II 

.1=8 

^2 = 0 

^3 = 

5 

^4= 4 

0 

II 

^,= 15 

and 

hi 

= 1 for all /. 


*(c) Suggest an easy-to-apply rule for finding an optimal plan for such a model, in 
which pi^ qi, and hi may differ from period to period. Then employ your sugges¬ 
tion to solve part (b), 

29 The Sinbad Steamship Company operates a freighter schedule between Ports X and Y 
and Ports A, B, and G. The schedule for the next 15 days is shown in the first table. 
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Dates 

Port of 

Origin 

Port of 
Destination 

3 • 

X 

A 

4 

Y 

A 

6 

X 

G 

6 

Y 

A 

9 

X 

B 

9 

Y 

A 

10 

X 

A 

10 

X 

C 

10 

Y 

B 

13 

Y 

B 

15 

Y - 

B 

15 

Y 

C 


After a freighter goes from a port of origin to a port of destination, it may return to 
either port of origin. The total time (in days) to sail from port to port, in either direc¬ 
tion, is shown in the second table. 



A 

B 

G 

X 

2 

3 

2 

Y 

1 

1 

1 

2 

1 


Suppose three freighters are at Port X and three at Port Y on Day 1. Let the 
expense of returning a freighter to Port X from Ports A, B, and G be pA, put and pc, 
respectively, and similarly let the cost of returning a freighter to Port Y be qA, qu) 
and qcy respectively. 

The Sinbad Company wants to find a routing for ships from Ports A, B, and C 
back to Ports X and Y that minimizes total cost. Show how this problem can be solved 
by a transportation model. 

Let a supply point correspond to (Date t. Port of Destination i), which specifies 
that at Date t, one or more freighters (depending on the schedule requirements above) 
arrive at the Port of Destination i (either Port A, B, or C), and are ready to be returned 
to a port of origin (either Port X or Y). For example, since on Date 6 a freighter must 
leave Port X and go to Port C, taking two days of travel time, there is at least one 
freighter available at Supply Point (6 2, C). 

Let a demand point correspond to (Date t. Port of Origin j), which specifies that 
on Date t one or more freighters (depending on the schedule requirements above) sail 
from Port of Origin j (either Port X or Y). For example, a demand point is (9, X). 

Note that the shipping times make it impossible to send a freighter from some of 
the supply points in sufficient time to meet- the requirements at some of the demand 
points. For example. Supply Point (6 -f- 2, C) cannot meet the requirement at Demand 
Point (9, X), because it takes two days to go from Port C to Port X. 

Construct a transportation table that indicates the structure of the model. Be sure 
to indicate the appropriate St, Dj, and every cjj. Display a feasible solution in the table, 
and interpret your solution in terms of an actual schedule for the six freighters. 
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30 Consider the case of the Sinbad Steamship Company in exercise 29. Suppose the 

company wants to find the minimum number of freighters required to meet the schedule, 
regardless of shipping costs. Show how to solve this problem by a transportation 
model. Construct a transportation table that indicates the structure of the model. 
Be sure to indicate the appropriate Su every 

Try to determine a feasible solution in the table for five freighters, and interpret 
your solution in terms of an actual schedule. Try to do the same for four freighters. 

{Hint: add a fictitious supply point having a large number of freighters available 
that can be used to meet any requirement, and a fictitious demand point for any 
excess supply of freighters. Verify that the objective is to minimize the number of 
freighters that meet requirements from the fictitious supply point.) 

31 The Marcus Metal Company mines ore at 
‘ Locations 1 and 2, transports it to processing 

mills at Locations 3 and 4, and ships the final 
product to Locations 5, 6, and 7, where it is 
sold. A schematic map of the locations and 
product flow is given in Fig. 6.23. 

Let Si and »S '2 represent the maximum 
supply available at Locations 1 and 2, respec¬ 
tively, and D^, Dq, and Dj the demand re¬ 
quirements that must be met at Locations 5, 

6, and 7, respectively. The total maximum 
supply exceeds the total demand require¬ 
ments. 

Assume mi is the cost of a unit of supply at Mine i (i = 1, 2), is the processing 
cost of unit of supply at Mill j (j = 3, 4), Su is the shipping cost of a unit of supply 
from Mine i to Mill j (i = 1,2 and j = 3, 4), and tjjc is the shipping cost of a unit of 
supply from Mill j to Location k (j = 3, 4, and A: = 5, 6, 7). 

(a) Construct a condensed transshipment table like that in Fig. 6.4. Be sure to indi¬ 
cate in detail how to calculate each objective-function coefficient cij, 

(b) Construct an expanded transshipment table like that in Fig. 6.5. Be sure to indi¬ 
cate how to calculate each objective-function coefficient. 

(c) Construct a technology representation like that in Fig. 6.7. Use inequalities 
where appropriate to indicate that 
total supply exceeds total demand. 

32 Consider the transshipment network in Fig. 

6.24, where a positive number at a node 
indicates a supply and a negative number 
a requirement. Note there are two arcs, 
one in each direction, between every pair 
of nodes. The unit cost of shipping each 
item from Node i to Node j is Cij need 
not equal cji. 

(a) Construct a condensed transshipment 
table like that in Fig. 6.4. 




Figure 6.24 
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(b) Construct an expanded transshipment table like that in Fig. 6.5. Be sure to 
indicate the values for the supplies and demands. 

(c) In the table of part (b), indicate the following shipment plan; five units from 
Node 5 to Node 4 via Node 3; five units from Node 2 to Node 4 via Nodes 5 and 3; 
eight units from Node 2 to Node 1 via Node 5; and 12 units from Node 3 directly 
to Node 1. 

(d) Give another routing having the same cost as that in part (c). 

(e) Construct a technology representation like that in Fig. 6.7. 

♦(f) Formulate the technology in part (e) so that the supply constraints arc “less-tljan- 
or-cqual” inequalities, and the demand constraints arc “greater-than-or-equal” 
inequalities. Then write the associated dual maximizing problem. 

33 Consider the transshipment network in Fig. 6.25, where a positive number at a node 

indicates a supply and a negative number a requirement; nodes with 0 indicate there 

2 


Figure 6.25 

is neither a supply nor a demand. Note there are two arcs, one in each direction, 
between connected nodes. The unit cost of shipping each item from Node i to Node j 
is ciji Cfj need not equal Cji, 

(a) Construct a condensed transshipment table like that in Fig. 6.4. Show a feasible 
solution, and indicate whether your solution implies a unique routing. 

(b) Construct an expanded transshipment table like that in Fig. 6.5. Are there any 
sources or sinks? Be sure to indicate in your table the values for the supplies and 
demands. 

(c) In the table of part (b) indicate the following shipment plan: two units from 
Node 3 to Node 1 via Nodes 2 and 4; two units from Node 6 to Node 1 via Node 4; 
four units from Node 6 to Node 5 via Node 4; two units from Node 8 to Node 5 
via Nodes 3, 2, and 4; and five units from Node 8 to Node 7 via Nodes 3,2,4, and 
5. 

(d) Construct a technology representation like that in Fig. 6.7. 

34 The McRcady Company operates a machine shop. Today’s order file consists of n 
jobs, each of which can be manufactured on any of n different machines. Let be 
the amount of time (make-ready or setup plus manufacturing) to complete Job i on 
Machine j. Formulate an optimization model to get all the orders completed with a 
minimum total expenditure of time. 
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35 Joe Friendly, the Associate Dean of a large business school, must make a class align¬ 
ment schedule for the faculty. Specifically, he must assign n professors to teach n 
classes. In the past, the students have filled out teacher-effectiveness rating question¬ 
naires, so the Associate Dean has information as to how well the students liked the 
professors when they have taught these several classes. (In some cases, a professor has 
never taught a particular class, so the Associate Dean must guess at a rating. In other 
cases, the professor does not wish, or is not prepared, to teach a particular class, and 
so the Associate Dean must rule out this assignment.) 

(a) Explain how the Associate Dean might use a mathematical model to maximize 
student happiness. 

(b) In your opinion, is this any way to run a business school? 

36 Mrs. Flo Updegraph, the room scheduling clerk in a liberal arts college, has n classes 
to assign to n available rooms. Certain assignments are ruled out for reasons such as 
the class size exceeds the room size, the classroom is unavailable for use at the required 
time, or the classroom is too long a walk from the professor’s office. Suggest a mathe¬ 
matical model that might assist the rqom scheduler. 

37 The Licberman Supply Company stocks an item that deteriorates with time as mea¬ 

sured in weekly periods. Suppose that Liebcrman has on hand four such items, indexed 
i = 1, 2, 3, and 4, the present age of each being denoted by Ai. He has contracted to 
sell his stock as follows: he must deliver one item weeks from now, one item ^ weeks 
from, now, one item /j weeks from now, and one item ^4 weeks from now. The revenue 
he receives for each item is a function of its age at the time of delivery; this function is 
denoted as A being the relevant age. 

Formulate an optimization model that can enable Lieberman to determine which 
item he should supply at each delivery date so that he can maximize his total reve¬ 
nue. 

38 The Overland Airlines has many scheduled flights between New York and Los Angeles; 
some are direct flights and others make intermediate stops. Assume there are an equal 
number of flights, N, each day in ' odi directions. Let those daily flights initiating in 
New York be indexed i — 1, 2,..., A^, and those in Los Angeles be index< .l j = 
1,2,,,,, N, If a jet aircraft is repeatedly assigned to Flight i out of New York and 
Flight J out of Los Angeles, then the airline can calculate the resultant number of 
nonflight hours per week for planes assigned to this round-trip route. Such downtime 
is due not only to the delay in Los Angeles between the arrival of Flight i and the 
departure of Flight j, and in New York between the arrival of Flight j and the depar¬ 
ture of Flight i, but is due also to the maintenance time required by the particular 
pairing. Explain how you can apply a mathematical optimization model to find 
pairings for flights between New York and Los Angeles that minimize total downtime 
per week of all the aircraft. 

How'does the formulation change if an aircraft need not be assigned to the same 
flight out of New York and Los Angeles on each trip? 

39 Consider the network in Fig. 6.26. The problem is to find a shortest route from Node 11 
to Node 1. Let Cij be the length of arc 
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(a) Construct a tabular representation like that in Fig. 6.9. 

(b) For part (a), display the technology table corresponding to (1), (2), and (3) in 
Sec. 6.5. 

40 Consider the network in exercise 39. Add an arc from Node 1 to Node 11. Suppose 
the problem is to find a shortest route from Node 9 to Node 3. Answer the questions 
posed in exercise 39. 

41 Equipment Replacement, The Goode-O’Toole Company runs a machine shop contain¬ 
ing an expensive drill press that must be replaced periodically as it wears out. The 
Vice-President of Manufacturing has just authorized installing a new model, but has 
asked the foreman to devise an optimal replacement plan for the next seven years, 
after which the drill press will no longer be needed. 

Let Pi be the cost of a new model in Period t, where 

/>i=100 />2=105 />3=110 />4=115 />5=120 i&o = 125 /?7 = 130, 

so that the currently authorized installation incurs the cost 100. Let vjc be the salvage 
value of a press that is sold at the end of k periods of use, where 

yj = 50 ^2 = 25 i >3 = 10 i ;4 = 5 ^5 = 2 v^= I Vj = 0. 

Thus, if the new press is sold at the end of the first period, Goode-O’Toole realizes a 
revenue of 50; if it is sold at the end of Period 2, the revenue is 25; if it is sold at the 
end of Period 7, the revenue is 0. 

Let Tic be the operating cost of a piece of equipment during its kth consecutive 
period of use, where 

= 30 r 2 = 40 r 3 = r 4 = 50 = 60 Tq = 70 r^ = 100. 

(a) Formulate a shortest-route problem to find an optimal replacement policy. 

(b) Construct a tabular representation of the problem like that in Fig. 6.9. Be sure to 
calculate every cij, 

(c) Display a technology table corresponding to (1), (2), and (3). 

*42 Utopian Construction Company wants to apply critical path scheduling to two 
projects. For each, formulate the appropriate linear programming model. Display the 
corresponding critical path tableau, as done in Fig. 6.14. Construct a network like 
that in Fig. 6.16. Throughout, let tj denote the completion time for Job j. 
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(a) 


(b) 


Job 

Immediate Predecessors 

A 

- 

B 

- 

G 

A 

D 

A 

E 

B,G 

F 

B,G 

G 

D,E 

H 

F 

Job 

Immediate Predecessors 

A 

- 

B 

- 

G 

A 

D 

A 

E 

B 

F 

B 

G 

D 

H 

C,F 


*43 In exercise 18, an alternative technique was outlined for constructing a network 
representation for a critical path project. Apply this method to the designated part of 
exercise 42. 

(a) Part (a). 

(b) Part (b). 

*44 After finding the critical path in the project in part (a) of exercise 42, the Utopian 
Company realizes that it must schedule overtime labor in order to fulfill its contractual 
obligation to complete the project by time T. 

The network representation of the project has six nodes, indexed i = 1, 2,..., 6 , 
where Node 1 denotes the start of the project and Node 6 designates that all the tasks 
are complete. Let 

di, = shortest possible interval of time required to complete 
the task associated with arc (i, j); 

each dij is a known constant, and typically involves utilizing overtime to the fullest 
extent possible. 

Let the decision variables be 

yij = the interval of time exceeding dij to complete the task 
associated with arc (i,j), 

ti = the earliest point in time to start any task leading out 
from Node i. 

The and are the unknowns in the optimization problem. These values must 
satisfy the constraints 

li + dii 4 - yij < t) for every arc ( 1 , 7 ). 

In addition, <5 ^ T’ to meet the contractual commitment. 
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If full overtime is not used on the task associated with arc (i,j), then > 0; 
assume the associated cost savings is Sijyij. 

(a) Write a linear programming formulation of the optimizing problem. Show all 
the constraints and the entire objective function. 

♦(b) Write the corresponding dual problem. Is it a network problem? If so, explain 
its structural components. 


*45 In each part below, draw the network corresponding to the following node-arc 
incidence matrix: 


^12 ^14 

X23 ^24 

^52 ^53 

1 1 



-1 

1 1 

-1 


-1 

-1 

-1 

-1 

-1 



1 1 1 


(b) 


*12 ^13 ^14 

X23 ^24 X25 

•^32 ^35 

^43 ^45 

1 1 1 




-1 

1 1 1 

-1 


-1 

-1 

1 1 

-1 

-1 

-1 


1 1 


-1 

-1 

-1 


♦46 Consider the constraints 

^12 + ^13 + Xii — X41 < 6 

X12 + .V32 ^ 5 
^41 + ^42 + ^43 — ^14 < 4 

^14 + ^34 ^ 2 

every ^ 0. 

Draw a network corresponding to these restrictions. {Hint: convert the inequalities to 
equalities by adding slack variables; then add a redundant restriction corresponding 
to a hetitious node.) 

♦47 Explain how to convert a transshipment network with capacity restrictions on flow¬ 
through nodes into an equivalent (enlarged) transshipment problem with capacity- 
restrictions on the flow through arcs. {Hint: convert a transshipment node into twcM 
nodes connected by an arc.) 

♦48 Caterer Problem, The Cole Food Company, a catering firm, requires fresh napkiris 
at the start of Day j, where j = 1, 2,..., T. Normal laundering takes one full day a— 
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B cents a napkin; rapid laundering takes overnight at C cents a napkin. New napkins 
may be purchased at A cents a napkin. The caterer wants a plan for purchasing and 
laundering napkins to minimize total costs^ subject to meeting all his fresh-napkin 
requirements. 

Let Xj be the number of napkins purchased for use on the Jth day; Vj the number 
of clean napkins left over (at the end of the day); the number sent for normal 
laundry service (at the end of the day); Zj the number sent for rapid laundry service 
(at the end of the day); and the number of soiled napkins on hand and not sent to 
the laundry (at the end of the day). The technology matrix can be summarized by 
the table. 


Xi uj yj zj sj 

Allocation of clean napkins in Day j 
Allocation of soiled napkins in Day j 

Allocation of clean napkins in Day j + 1 
Allocation of soiled napkins in Day j + 1 

Allocation of clean napkins in Day j -f 2 
Allocation of soiled napkins in Day j + 2 

Costs 


(a) Write in full the model for T = 5. 

(b) Draw the associated network. (Hint: add a redundant restriction corresponding 
to a fictitious node.) 

*49 Multicommodity Network, Figure 6.27 depicts a 
transshipment network over which two kinds of 
resources, Commodity 1 and Commodity 2, can 
flow. Each commodity may flow along any arc in 
either direction. 

An imlimited supply of Commodity 1 is avail¬ 
able at Nodes Ni and Nj; the final destination for 
this commodity is Node N 3 . An unlimited supply 
of Commodity 2 is available at Node N 4 ; the final 
destination for this commodity is Node Ni. On the 
arc between Node Ni and Node Nj, there is an 
upper capacity limit bij on the toted amount of 
flow, that is, on the flow of both commodities in both 
directions. 

Each unit of Commodity 1 which reaches its final destination is worth Viy and each 
unit of Commodity 2 which reaches its final destination is worth The problem is to 
find a feasible flow of commodities through the network having maximum total value. 
This problem can be viewed in two ways, as explained in each part below. 

(a) Let 

Xijjfc = the amount of flow of Commodity k from Node Ni to Node Nj, 

Siii = the amount of Commodity k originating at Node N], 



-1 


-1 


-1 


-1 


B 
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where in this example you need only Jn, s^u '^ 42 ' Write the flow constraints for 
each commodity at each node, the flow capacity restrictions along each arc, and 
the objective function. 

(b) An alternative way to analyze the network is to enumerate for both commodities 
every chain from each source node to each destination node through any possible 
intermediate node. 

Let^t be the apiount of flow along Chain t. Each chain at unit flow level uses 
up a unit of capacity along the specific arcs comprising the chain; consequently 
there is a total flow constraint for each arc. Write the flow constraints along each 
arc and the objective function. 

(c) Compare the two formulations. 

*50 Construct a small generalized network problem having a unique optimal solution that 
is not integer-valued. 


MIND-EXPANDING EXERCISES 

51 A branch of the armed services is considering a long-range manpower training pro¬ 
gram. To keep the exposition simple, suppose at the beginning of the horizon, there 
are Mi men in each Skill-Class i, where / = 1, 2, 3. The requirements for men of 
Skill-Class i in Period t are designated as rn. It takes one period to retrain a man from 
one skill-class to another. The military service may also recruit new men; a new 
recruit takes two periods to train. There are limits on the number of men that can be 
taught a particular skill in a period, and on the number of recruits. Discuss how 
you might formulate a network optimization model to provide an effective recruiting, 
training, and retraining program. 

52 Consider a classical transportation problem. In each part below, further constraints are 
added. Suggest a way to reformulate the model to yield an enlarged transportation 
problem in standard format (like that in Fig. 6.2a). 

(a) Each activity has a capacity restriction xij ^ 



(c) ^11 + ^12 + J'fis ^ ^ ' where n > 3. 

53 Consider a capacitated transshipment problem where each Xij < cij. Suggest a way 
to reformulate the model to yield an enlarged transportation problem in a standard 
format (like that in Fig. 6.2a). 

,54 Review the case of the Klcen City Police Department in exercise 30 of Chap. 2 and 
the general formulation in exercise 27 of Chap. 5. 

(a) Show that when the number of periods, T, is an even number, the model is equiva¬ 
lent to a network problem. Display the network for the Kleen City case. 

(b) Explain why the model is not equivalent to a network problem when the number 
of periods, T, is an odd number. 
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55 Bottleneck Production Line Assignment Problem, Suppose each of n men is to be assigned to 
performing one of n jobs on a single production line. Man i assigned to Job j can 
produce items per unit of time, where ^ 0. The problem is to assign the men to 
maximize the rate of production for the entire line. The bottleneck is whatever man in 
the assignment produces at the slowest rate. Hence, the optimization problem is to 
pick an assignment that makes the bottleneck rate as large as possible. 

(a) Verify that a solution to such a problem does not depend on the actual magni¬ 
tudes of the A|j, but only on their ordering (or ranking). 

(b) Given the validity of part (a), rank the Ajj, and calculate the associated amount 

= 1 — where m is the rank order of a^^. For simplicity of exposition, 
assume all the Ajj are distinct. Thus, the smallest a<j yields a corresponding 
bij = \ — 2”*; the next smallest yields a corresponding = 1 — 2"*; and 
the largest A^j yields a corresponding = 1 — 2"”*. 

Show that an optimal solution to the assignment problem using the h, also 
solves the bottleneck problem. 

*56 Consider the general network optimization formulation in Sec. 6.8. Assume that (1) 
and (3) arc stated as “greater-than-or-equal” inequalities. Show how such a problem 
can be reformulated as an enlarged network problem in which once again the sum of 
the 7\ equals 0 and the flow conservation constraints are properly stated as equalities. 
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CHAPTER 


Network 

Algorithms 


7.1 TOPICS IN FOCUS 

By now you are familiar with many applications of linear programming, 
including important examples of optimization within a network. You were advised 
in Chap, 4 that rarely, if ever, would you solve a linear programming model by 
hand computations. Nevertheless, it was explained that you need to understand the 
fundamental mathematical ideas in an optimizing algorithm to become a skillful 
user of the technique. For this reason, you mastered the key concepts of the simplex 
method. Chapter 5 illustrated the prime significance of the notion of duality in 
linear models with numerous examples of sensitivity testing. By the end of that 
chapter you had been introduced to many of the insights provided by linear pro¬ 
gramming. 

Chapter 6 indicated that the size of network models is so large in practice that 
a naive application of the simplex method would be grossly inefficient and in some 
instances not even possible, given the capabilities of available computer programs. 
The examples of Chap. 6 supported this statement, for they demonstrated how a 
large number of variables can occur in even small-scale network problems. But 
you were informed that it is possible to exploit the special structure of a network 
technology so that the required computational effort becomes reasonable. 

At this point you may question the rationale for further explaining such linear 
programming algorithms, rather than proceeding swiftly to other introductory 
topics in operations research. Do any worthwhile insights remain? If so, why 
not leave them to the technical specialists ? We offer two principal reasons for study¬ 
ing the particular techniques here. 

To appreciate the first reason, you must recognize that the future direction of 
operations research is toward applicatiohs of staggering, dimensions. High-speed 
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electronic computing machinery^ combined with successful operations research 
experience, has generated a steady growth in corporatewide centralized planning, 
information, and control systems. A company need not abrogate a policy of decen¬ 
tralized operating responsibilities to engage in total enterprise planning. To the 
contrary, such a comprehensive approach provides the very data needed to ensure 
that a divisionalized management scheme is truly efficient. Integrated planning 
leads to consistent targets that reflect corporate objectives and at the same time 
recognize the operating limitations of the component divisions. 

To make such large-scale applications a practical reality, the design of efficient 
methods of solution will receive increasing emphasis. Consequently, you will need 
a proper perspective on these developments, and therefore you should understand 
what is meant by exploiting special structure. To do so, you must see the concept illus¬ 
trated, and this chapter serves the purpose by discussing network algorithms. 

A second reason for focusing on these techniques relates to their connection with 
dynamic and multistage models having deterministic linkage, which will be cov¬ 
ered in Chaps. 8 through 12. The shortest-route algorithms you study here in 
general terms can be viewed as the basis for solving such models. Thus the chapter 
sets the stage for various dynamic applications that you will learn subsequently. 

Four notions comprise the foundations from which network optimization algo¬ 
rithms are constructed. One is duality, which is reviewed in the next section. 
Another is called the triangularity property (or tree structure) of a basis. This idea is 
explained in Sec. 7.3. The third concept is the determination of shortest routes, 
covered in Secs. 7.6 and 7.7. The fourth notion relates to maximizing the total 
flow in a capacitated transshipment network model with a single source and a 
single sink. Because the maximum flow model is needed primarily for the more 
sophisticated algorithms, it is discussed in Appendix I. 

7.2 SUMMARY OF THE FUNDAMENTALS 

We begin with a discussion of the standard transportation model that you 
studied in Chap. 6, nzunely, the linear program: 


m n 


( 1 ) 

subject to 

( 2 ) 


minimize ^ ^ 


n 



for i = 1, 2. 




(supply) 


n 


( 3 ) 

( 4 ) 



ij = Dj for j = 1, 2,..., n (demand) 
Xij'^0 for all* and j. 


where all the St and Dj are nonnegative integers satisfying 


m 


n 



(total supply = total demand). 


( 5 ) 
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As you learned in Chap. 6, with some ingenuity you can transform many a network 
optimization problem into an equivalent transportation model. 

Because of the equality between total supply and total demand in (5), and the 
structure of the supply and demand equations (2) and (3), the model contains a 
redundancy, in that if any m + — 1 of the restrictions in (2) and (3) are satisfied, 

then the remaining restriction is satisfied too. [This fact can be demonstrated 
mathematically by multiplying each restriction in (3) by — 1 ; by then adding any 
selection of m + n — 1 of the restrictions, and by using (5) to simplify the constant 
on the right-hand side. The resulting composite equation will be identical to the 
remaining restriction.] Consequently, any one of the supply and demand equations 
in (2) and (3) can be dropped without harm. The resultant model then consists 
of m + n — 1 independent restrictions, and so any basic solution contains this num¬ 
ber of variables. 

It is straightforward to show that the corresponding dual linear programming 
problem can be written as 

m ■ n 

(6) maximize ^SiVi 

t-i ^-1 

subject to 

(7) Vi + w^< Ctj for all (i,j), 
where the and Wj are unrestricted in sign. 

► A consequence of the Dual Theorem and the Theorem of Complementary Slackness 
in Chap. 5 is that if xfj for all (i,j) satisfy (2), (3), and (4), and if i;f and wf satisfy (7), 
and also if 

(CS) (wf -j- wf — Cij) = 0 for all (i,j) (complementary slackness), 

then xfj is an optimal solution to the transportation model (1) through (4). 

Based on this observation, a variety of network algorithms have been devised. They 
share the idea that at each iteration the trial values for Xij, Vf, and Wj always satisfy two 
out of the following three conditions: (i) primal feasibility (2), (3), and (4); (ii) dual 
feasibility. (7), and (iii) complementary slackness (CS). They differ principalfy in which 
two of the three conditions are maintained. In these algorithms, the iterations terminate 
when all three conditions arc simultaneously satisfied. For example, the simplex method 
in Secs. 7.3 and 7.4 maintains (i) and (iii) at every iteration, and stops as soon as (ii) 
holds. ^ 

7.3 SIMPLEX TECHNIQUE FOR TRANSPORTATION 
PROBLEMS 

Here we apply the simplex method to solve the transportation model. We 
demonstrate that the special structure of the transportation model substantially 
reduces the computational burden of the algorithm. This section motivates the 
method with a brief review of certain essentials firom Chaps. 4 and 5, formulates 
the computational rules, and illustrates them with an example. Then Sec. 7.4 
fills in several of the missing details on how to apply the method. 
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Simplex steps* The standard simplex method instructions are parar 
phrased below as a starting point of the discussion: 

Step L Select a set of m + « — 1 routes that provides an initial basic feasible 
solution. 

Step 2. Check whether the solution is improved by introducing a nonbasic 
variable. If so, go to Step 3; otherwise stop. 

Step 3, Determine which route leaves the basis when the variable that you 
selected in Step 2 enters. 

Step 4, Adjust the flows of the other basic routes. Return to Step 2, 

As you will see in the illustration below, all four steps are very easily performed 
for a transportation problem. Assume the model has been put in a classical 
transportation problem format and is arrayed in a standard m X n table, such as 
Fig. 7.1. The ensuing discussion refers to this table. We begin with a verbal de¬ 



scription of the algorithm. Do not be concerned if you find certain details unclear, 
since the intent here is only to motivate the overall logic of the approach. A careful 
description of the steps is given afterwards. 

Suppose you have a trial basic-feasible solution, which consists of m + ^ — 1 
routes. If a unit of flow is introduced on a nonbasic route, then to maintain feasi¬ 
bility in both the corresponding row and column of Fig. 7.1, a unit must be with¬ 
drawn from a basic route in the same row, and similarly in the same column. 
But these two alterations in turn lead to unit changes in the flows of other basic 
variables. For reasons that will be apparent shortly, each nonbasic variable gives 
rise to a unique pattern of alterations of the basic variables. 

Therefore you can evaluate the resultant change as follows: take the total 
increase in cost from adding a unit of flow to both the new route and the appro- 
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priate basic routes, and deduct the total decrease in cost from removing a unit of 
flow on the other basic routes in the associated unique pattern of alterations. 
If the net amount shows an improvement and the new route is selected to enter 
the solution, then the pattern of alterations also indicates how many units of 
flow can be placed on the new route. Specifically, the various flows are increased 
and decreased according to the pattern until that level is reached where a basic 
variable is reduced to zero and thereby dropped from the basis. 

If the procedure for testing whether a nonbasic route should enter a trial 
solution were really as tedious as the above description suggests, this approach 
would be a marginal improvement at best on the way the simplex algorithm 
was applied in Chap. 4. You can evaluate a nonbasic route much more simply, 
however, by employing an insight from duality and exploiting the structure of the 
network. 

You learned in Sec. 5.5, which discussed the solution of the dual problem, 
that for a given basic solution, the value from introducing a unit of a nonbasic 
variable is the difference between the left- and right-hand sides of the dual restric¬ 
tion for this variable. Thus the key idea is to solve the m -f- n — 1 dual restrictions 
corresponding to the current basis: 

(1) Vi + = Cij for each basic variable 

and then with these values to evaluate 

(2) Vi + Wj — Cij for Xijy a nonbasic variable. 

The quantities in (2) correspond to the coefficients in Row 0 of the simplex method. 
When (2) is positive, the nonbasic variable is a candidate to enter the next 
basis. If all such quantities are nonpositive, implying dual as well as primal 
feasibility, the current basic solution is optimal. 

Algorithm illustrated. Consider the transportation problem shown 
in Fig. 7.2. (Jot down on a piece of paper your guess of an optimal solution; 
calculate its objective-function value.) 


Figure 7.2. Transporta¬ 
tion Example. 



Supply: 
6 

1 

10 


Step 1 of the algorithm asks for a starting trial basic solution, which contains 
m-fw—l=3-|-4—1=6 active routes. Suppose you allocate the supply of 
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6 units available in Row 1 to the cheapest route (1, 1), This exhausts the supply 
and leaves one more unit of demand to be filled in Column 1. Similarly, allocate 
the supply of 1 unit in Row 2 to the cheapest route (2, 2). The supply of 10 in 
Row 3 is then routed to meet all the remaining unfilled demand. The result, dis¬ 
played in Fig. 7.3, has an objective-function value of 112. (Was your guessed solu- 



Supply: 

6 

1 

10 


Total Cost = 2x6+ 0^1+5x1+ 8x4+ 15x3+9x2 = 112 Figure 7.3. Initial Basic 

Solution. 


tion any better? The advanced material below discusses a method for selecting a 
good initial solution.) 

Step 2 calls for an evaluation of each unused route to see whether a unit of flow 
on the route, improves the objective function. Consider route (1, 2). As you can see 
from Fig. 7.3, one unit allocated to this route must be withdrawn from the flow 
on route (1, 1). Then to meet the demand requirement in Column 1, another 
unit must be shipped on route (3, 1), and this unit may be removed from route 
(3, 2). The pattern of alterations is shown in Fig. 7.4. 


Row 

1 


Column 
1 


Demand: 


2 

6-1 

3 

+ 1 

II 

7 


1 

0 

1 

6 


1 


5 

1+1 

T 

00 

15 

3 

9 

2 


4 Supply: 
6 


1 


10 


Improvement Potential = Total Decrease-Total Increase 
= (2+ 8)-(3+5)= 2 per unit 


Figure .7-4. Pattern of Al¬ 
terations for One Unit of 
Flow on Route (1, 2). 


Will introducing a unit of flow on (1, 2) reduce the value of the objective func¬ 
tion ? A saving of 10 (= 2 4- 8) results from reducing a unit of flow on routes (1,1) 
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and (3, 2). However, there is a partially offsetting increase in cost of 8 (= 3 + 5) 
due to adding a unit of flow on routes (1,2) and (3, 1). The net improvement is 
2 (= 10 - 8), 

By a similar analysis, you can obtain the improvement potentials for the other 
nonbasic routes, as summarized in Fig. 7.5. Each of these is the value that would 


Column 
1 


Row 

1 


2 

0 

3 

2 

II 

1 

7 

-1 

1 

-4 

1^ 

lO 

6 

1 

1 

0 

5 

0 

8 

0 

Ol 

9 

0 


Figure 7.5. Improvement 
Potentials for Nonbasic 
Routes—Initial Solution. Note: The symbol 0 oppeors for current basic routes. 

appear as the coefficient of the associated Xfj in Row 0 of the simplex method. 
To check your understanding of how these potentials might be calculated, deter¬ 
mine the pattern of alterations for a unit of flow added to routes (1, 3) and (2, 1), 
and determine the cost of the implied change in the objective function. 

Now you will see it is unnecessary to obtain the specific pattern of alterations 
for each nonbasic route in order to find its improvement potential. You can arrive 
quickly at the same result by evaluating the dual variables, and then by looking 
at the difference between the left- and right-hand sides of the dual constraints 
associated with the nonbasic routes. The network structure makes this process 
easy. A short-cut method that uses only an m X n table containing Cfj (such as 
Fig. 7.5) will be described. But in order to clarify what is actually going on, the 
mathematical relations involved in the short-cut will be explained first. 

The system of linear equalities that yields trial values for the dual variables was 
shown in (1) above. Applied to the initial solution. Fig. 7.3, the six dual equalities 
for the basic routes are 




V2 


( 3 ) 


+ K'l 

= 2 

route (1, 1) 

+ u>. 

= 0 

route (2, 2) 

+ U>i 

= 5 

route (3, 1) 

Vi + Wi 

= 8 

route (3, 2) 

Oj 

+ Wj =15 

route (3, 3) 


+ zt(4 = 9 

route (3,4). 


Observe that in (3) there are six equations and seven unknowns. To obtain 
a solution, one of the variables must be selected and given an arbitrary value. 
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You arc at liberty to choose any variable and assign it any convenient value. 
Choose Vi since it appears in four of the relations, and let Os = 0 for convenience. 
The reason (3) has an “extra” variable is the redundancy in the m + n = 7 
supply-and-demand restrictions. Giving an arbitrary value to is equivalent to 
dropping the Row 3 supply equation in order to remove the redundancy in the 
m + n relations. 

With Os = 0, notice how remarkably simple it is to solve for the remaining 
variables. You can proceed as follows: 

Os = 0 (arbitrary) 

104 = 9 — Os = 9 
zos = 15 — Os = 15 
(4) lOs = 8 — Os = 8 

zo, = 5 — Os = 5 

02= 0 — 102 = 0 — (8 — 03) = —8 

0i = 2 — lOi = 2 — (5 — 03) = —3. 

Thus, by knowing only the values of the previously found variables, you can obtain 
each dual variable one by one. In a network technology^ you will always be able to 
proceed in such a sequential fashion. 

The mathematical term triangularity is used to describe a structure that 
admits this sequential approach of finding solution values. You can appreciate 
the appropriateness of the term by looking at the structure of (3). When 03 vanishes 
(that is, equals 0), the array has a triangular appearance. We say that a system 
of m linear equations in m unknowns has a triangular structure if, pos¬ 
sibly after rearranging the order of the variables and the equations, the variable 
that appears first in equation k is also the A:th unknown, and it does not appear in 
any equation beyond the A:th. 

The triangfularity property holds for the system of dual equations (3) after you 
select any dual variable and assign it an arbitrary value. However, to see at a glance 
that the system is triang^ar, you would have to rearrange the sequence of variables 
and equations. For example, if you choose to let vanish, then the variables can 
be arranged in the sequence w^^ W 3 , Wiy Wi, and the routes in the sequence 
(2, 2), (3, 2), (3, 3), (3, 4), (3, 1), and (1, 1) to exhibit the triangular structure. 
Write the system in this form to verify its triangular appearance. 

Also note in (4) that if is set at a value other than 0 , then each increases 
and each Wj decreases by this amount. Thus any sum (vi + Wj) remains unchanged, 
as does the test quantity Vt + Wj — Ci). 

You would suspect from your knowledge about duality that a similar triangu¬ 
larity property also holds for the equations that determine values of the basic 
To see this, write the conservation of flow constraints, omitting all nonbasic routes 
and dropping the redundant supply equation for Row 3 (corresponding to the 
selection of v^): 
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^11 

= 6 

Row 1 

^22 

= 1 

Row 2 

*11 + *Ji 

= 7 

Column 1 

*n + *« 

= 5 

Goluixm 2 


*jj = 3 

Column 3 


= 2 

Column 4. 


Notice that the row of coefficients in the A:th equation of (5) are the column of 
coefficients for the A:th variable in (3) (not counting V 3 ), Whereas (3) is upper 
triangular in shape, the system ( 5 ) is lower triangular and can be solved quickly 
starting with variable Xu, [How can you rearrange the variables and restrictions 
in (5) so that the system has the triangular property as defined above?] 
You are now ready to determine the improvement potential for each nonbasic 
route from the formula 


( 6 ) — c„. 

Using the values for the dual variables that you found in (4), you can obtain 


-3 + 

8 - 

3 

= 2 

route 

(1.2) 

-3 + 

15 - 

11 

= 1 

route 

(1,3) 

-3 + 

9 - 

7 

-1 

route 

(1.4) 

-8 + 

5 - 

1 

== -4 

route 

(2, 1) 

-8 + 

15 - 

6 

= 1 

route 

(2.3) 

-8 + 

9 - 

1 

= 0 

route 

(2,4). 


Each positive number signifies a possibility for reducing the value of the objective 
function. Thus three routes offer an improvement, and you should select route 
(1,2) because it is the one with the greatest potential. This is the same rule as 
Simplex Criterion I (Minimization). 

Step 3 involves the determination of how much flow to allocate to route (1, 2). 
The pattern of alterations of the basic variables in the initial solution was already 


\ Column 
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Demand: 
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5 
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Supply: 
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10 


Figure 7.6. Second Basic Total Cost=Previous Total Cost-Total Improvement 

Solution. = 112 - 2 ( 4 ) = 104 
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established for this route in Fig. 7.4. Recall that the flows on routes (1, 1) and 
(3, 2) decrease, and these amounts are presently 6 and 4. Consequently, the largest 
possible increase of flow on route (1, 2) is 4, causing route (3, 2) to be dropped 
from the new basis. This calculation is the same as Simplex Criterion IL 

Step 4 performs the implied rerouting of flow, giving the second-trial basic solu¬ 
tion in Fig. 7.6, with an objective-function value of 104. 


Iteration 2. Next return to Step 2 to check whether this solution is optimal 
or whether a further improvement is possible. The dual equalities for the new 
basis are 


V2 + W2 

= 0 

route (2, 2) 

W2 + ^1 

= 3 

route (1, 2) 

l/j + 

= 2 

rouie (1, 1) 

Wi 

+ »5 = 5 

route (3, 1) 

W2 

H- »5 = 15 

route (3, 3) 


W. + = 9 

route (3, 4). 


The variables and equations in (8) have been sequenced so that the triangularity 
of the system is apparent when you let = 0. Check that the solution values are 

1/3 = 0 


Wi = 9 
W 3 = 15 


(9) 

= 5 



Vi = 2 — Wi = 

-3 


W 2 = 3 — Vi = 

6 


V 2 = 0 -- W 2 = 

-6, 

and the improvement potentials are 



-3 + 15 - 11 = 1 

route (1,3) 


-3 + 9 - 7 = -1 

route (1, 4) 

(10) 

-6 + 5 - 1 = -2 

route (2, 1) 

-6 + 15 - 6 = 3 

route (2, 3) 


-6 + 9 - 1=2 

route (2, 4) 


1 

II 

00 

1 

<0 

+ 

0 

route (3, 2). 




Consequently, route (2, 3) enters the next solution. The corresponding pattern 
of alterations is shown in Fig. 7.7. Notice that all but one of the previous basic 
routes are altered. (It is possible to have a case in which every basic route is 
revised.) Flow decreases on routes (2, 2), (1, 1), and (3, 3). Route (2, 2) drops 
from the basis first, when the total change is one unit of flow. In this instance, 
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Figure 7.7, Pattern of Al¬ 
terations for One Unit of 
Flow on Route (2, 3 )t— 
Third Solution. 
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Demand: 7 5 3 

Total Cost = 104-1(3) = 101 


Supply: 
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therefore, the quantities in Fig. 7.7 also represent the flows for the third-trial 
basis. 


Iteration 3. A simplified procedure* Now that you have seen the 
method to develop the improvement potentials in Step 2 of the algorithm, it is no 
longer necessary to write the full dual equations for the basic variables. A short¬ 
cut is to fill in an m X n table with the potentials. Start by drawing a table with the 
structure appearing in Fig. 7.8. Insert all the Cij in the small upper left boxes and 


Figure 7.8. Improvement 
Potentials for Nonbasic 
Routes—^Third Solution. 
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0 
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. 0 
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the symbol ^ for each basic route in the current solution. Leave blank all the other 
table entries, including the and Wy 

Next select any y, or and give it an arbitrary value. Suppose, as previously, 
you let = 0. Then enter the selected value at the edge of the corresponding row 
or column (Row 3 in the example). Look across the row or column for basic routes. 
In Row 3, the routes (3,4), (3, 3), and (3, 1) qualify. The dual equation for each 
of these routes permits you to determine another dual variable. For the example, 
Wi, == 9, = 15, and Wi = 5. Write these values along the edge of the table. 

Continue until all dual variables are evaluated. For the illustration, you can 
proceed in the sequence = —9, Vi = —3, and Wi = 6. 

Lastly, for each nonbasic route calculate {pi + and enter the result 

in the table. The final array appears as Fig. 7.8, where you can see that only route 
(1,3) shows a potential improvement. 
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To test your understanding of the steps, develop the pattern of alterations when 
route (1, 3) is introduced. Verify that the largest amount of possible flow on route 
(1,3) is one unit, and the new flows are those in Fig. 7.9. 


X Column 
-\ 1 

2 

3 

4 

KOW 

1 


5 

1 


2 



1 ~~ 


3 

7 


1 

2 

Demand; 7 

5 

3 

2 


Supply: 

6 

1 

10 


Total Cost =101-1(1) = 100 


Figure 7.9, Fourth and 
Optimal Basic Solution. 


Iteration 4. Perform the calculations to test for optimality of the solution, 
and check your answers with the results in Fig. 7.10. How close to optimal was the 
solution you guessed at the beginning of the illustration? 


\ 

Row^ 

1 

Column 

1 

2 

3 

4 


-1 

0 

0 

-2 

-4 

2 

-5 

-2 

0 

-1 

-9 

.3 

0 

-1 

0 

0 

0 


5 

7 

15 

9 



Figure 7.10. Improve¬ 
ment Potentials for Optimal 
Solution. 


►If the original problem in fact is a transshipment model, then the resultant expanded 
transshipment tables have both a Row ije and Column jk for each transshipment Node k. 
Accordingly, there will be two dual variables Vi^ and wj^. Since the buffer amoimt 
B is sufficiently large, each is in every trial basis. Consequently Vt^ wj^ = 0, so 
that and wj^ will differ only in sign for these rows and columns, ^ 


7.4 FURTHER COMMENTS ON THE SIMPLEX 
METHOD 

A few supplementary remarks are appropriate to conclude the description of the 
simplex method as applied to transportation problems. 

Although we have referred to the algorithm as “the simplex method applied 
to transportation problems” and have used terminology that is accordingly 
consistent, we really have not explicitly demonstrated the equivalence. We suggest 
that you take our word for it, unless you have a burning interest in this technical 
question. In that case, you can read the discussion in the advanced material below. 
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Special structure. The text has mentioned repeatedly that transportation 
problems have a special structiure to be exploited. The important proposition is 
the theorem below. 

Triangularity Theorem: After dropping any one of the equations in the 
redundant system of restrictions for the transportation problem, every basis 
has the triangularity property. Hence, since all the coefficients in the constraints 
are 1 , no multiplication or division is needed to sequentially determine the values 
of any set of basic variables; only addition and subtraction are required. 

Thus when the simplex method is applied to a transportation problem, no divi¬ 
sion or multiplication is required for Steps 3 and 4, The technique was explained 
in a way that recognized and consequently exploited this fact. 

Optimality. The following line of reasoning demonstrates that when 
the algorithm terminates, an optimal solution in fact has been obtained. 

First, observe that since a solution for a transportation model must allocate the 
entire supply Si among the routes in Row i, you can subtract the same constant firom 
each Cij in that row without affecting which solutions are optimal. Similarly, you 
can diminish each 0 ^ in Column j. In this subtraction process, let the final value 
of Vi be the constant used for Row i and the final value of the constant for 
Column j. Then suppose you replace the original objective-function values Cy 
by the new values [Cij — (i;< + As we have reasoned, the model with the 
original Ci^ and the model with the new coefficients have the very same optimal 
solutions. 

Second, recall that, at termination of the simplex algorithm, 

( 1 ) 0 ^ + Wj) for all ( 1 , 7 ) in network, 

since the quantities in ( 1 ) are merely the negative of the entries in the final im¬ 
provement potential table. Therefore, the minimum value of the new objective 
function cannot be smaller than 0 , because the new objective-function coefiScients 
are all nonnegative. 

Finally, notice that since equality holds in (1) for the basic routes, the terminat¬ 
ing solution uses only routes where the new cost coefficients are 0. Consequently 
no better solution can exist. 

If the improvement potential for a nonbasic route has zero value, then an alter¬ 
native optimal solution can be found by introducing the route into the basis accord¬ 
ing to Steps 3 and 4, 

Integrality. Several times previously, we emphasized that if the trans¬ 
portation model is further restricted by the condition that the variables must be 
integers, 

( 2 ) every = 0 , 1 , 2 ,..., 
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the optimal value of objective function is not increased. The following terse argu¬ 
ment demonstrates the assertion: the simplex algorithm always finds an optimal 
solution to a linear programming model, and when the technique is applied to 
a transportation problem, the method looks only at solutions satisfying (2). 

The underpinnings of the above argument are 

(i) There exists an optimal basic solution, since the transportation model has 
a finite optimal solution. 

(ii) Every basis is triangular. 

(iii) Each technological coefficient is either 1 or —1. 

The combined effect of these three points is that only additions and subtractions 
of integers occur in the sequential determination of each basic 

Completeness. In Step 2, if more than one route shows the greatest potential 
improvement, you can select one arbitrarily. In Step 3, if the flow on more than one 
basic route becomes zero when the new route enters, then the new basis will be 
degenerate. Only one route should be dropped, and the others should remain in 
the new. basis at zero level. For practical purposes, a particular “tie-breaking” 
rule to select the route that drops is of no consequence. However, as you already 
saw in Chap. 4, the degeneracy condition must be dealt with to prove that the 
method always terminates in a finite number of trials. 

Domain of applicability. The method applies to all network problems 
that can be converted to a transportation model format. Notice that the Cij do 
not have to be nonnegative to perform the steps. 

^Convergence. By the very nature of the transportation problem, an optimal 
solution must have a finite objective-function value. If the trial solutions make a strict 
improvement at each iteration, then the method must converge, since the objective 
function is reduced by at least one unit each time. But when degeneracy occurs, the 
objective function can stall for several iterations, and the possibility of cycling arises. 

It is always possible to rescale the units and slightly perturb the supplies and demands, 
so that degeneracy never arises during the calculations. The following scheme can be 
shown to achieve this result. Let the new available supplies be nSi, for i = 
1, 2,..., m — 1, and nS^ + n, and the new requirements be nDj -f- 1, for j = 
1, 2,..., n. Then the objective function will improve at every iteration. The final 
answer is scaled back to the original units by dividing each Xij by n and rounding so as 
to preserve the supply and demand restrictions, which always can be done. Since the 
rounded answer is feasible and uses the same basic routes, it remains optimal. 

Relation to the standard simplex method. Figure 7.11 depicts a 
simplex tableau tracing the iterations. It employs the condensed format introduced in 
Fig. 4.11 on p. 120. The redundancy has been removed. 

Notice at every iteration that each coefficient in Rows 1 through 6 is either +1 or — 1, 
a condition that arises because of the network structure. The pattern of coefficients in 
each column corresponds to the pattern of alterations occurring when the nonbasic 
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Figure 7.11. Simplex Tableau. 
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route is introduced. You can see that pivoting requires only additions and subtractions 
for this structure. 

Capacitated transportation problem. When capacity constraints 
are also imposed, the uppcr-boundcd-variables technique of Sec. 5.10 is 
easily specialized to a network structure. Assume each is an integer. 

In Step 3y suppose flow on a nonbasic route is to be increased. Then to determine the 
largest amount of flow possible, take into consideration the capacity constraints on the 
new route and those current basic routes with increased flow. If any one of these bounds 
is reached before the flow on some other basic route becomes zero, then put the associated 
flow at its capacity level and let it be the variable left out of the basis in the next trial 
solution. 

In Step 2, also consider each nonbasic route with flow at its capacity level. If the 
improvement potential is negative, then the route is a candidate for reduced flow. If it is 
selected, the procedure in Step 3 is reversed to reflect a diminution of flow. A trial solu¬ 
tion is optimal when every nonbasic route is at zero flow if it has a nonpositive improve¬ 
ment potential and at capacity flow if it has a positive improvement potential. 

The bounded-variables network algorithm yields optimal values for Xij that are also 
integers. We caution that the values of Uij may be such that there is no feasible solution, 
even though the total supply equals total demand. Further, some care must be exercised 
in selecting an initial trial solution, in order that it contain a basic set of routes with 
flow not exceeding the stated capacity constraints. This is not really hard to do, but the 
details go beyond the intended coverage of this text. 

Initial basic solution. In selecting a starting solution for Step 1, you must 
be careful to pick a basis, that is, -f « — 1 routes with flows uniquely determined by 
the iSi and Dj, It is usually worthwhile to perform a preliminary analysis of the to 
assist in selecting a good initial basis. 

In the discussion of optimality, you learned that a constant may be subtracted from 
every Cjj in a row or in a column without affecting the selection of an optimal solution. 
Furthermore, when the optimal dual variables and Wj are the constants for Row i 
and Column j, the revised costs are nonnegative and flow occurs only on zero-cost 
routes. These observations motivate the suggestion: 

(i) Subtract the smallest Cij in Row i from every cost coefficient in that row so as 
to form a new array. 

(ii) Subtract the smallest of the numbers in Column j of the new array from every 
other entry in that column so as to yield a relative cost table. 

(iii) Select an initial basis with reference to the relative costs. 

The procedure is illustrated for the example in Figs. 7.12 and 7.13. To ensure that you 
pick a basis, follow the procedure: 

(i) Select route (i, j) from those permitted and let the flow Xij be the smaller of the 
amounts available, gj ^ 0, and required, > 0. For the first route chosen, 
Qi = Si and = Dj, 

(ii) If Gi < rj, check off Row i and permit no more routes to be selected in Row i. 
Decrease by Xij the amount required in Column j, and let tj now be this new 
amount. Return to Instruction (t). 

(iii) If G< > check off Column j and permit no more routes to be selected in 
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Figure 7.12. Row Reduc¬ 
tion Calculation. 


Figure 7.13. Column Re¬ 
duction Calculation and 
Relative Cosbi. 
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Column j. Decrease by Xij the amount available in Row i, and let Ci now be this • 
new amount. Return to Instruction (x). 

(iv) If Oi = Tj, stop if m + n — 1 routes have been selected. Otherwise, perform either 
Instruction (ii) or (xxx), but not both. However, if onfy Row i remains, perform 
Instruction (xii), or if only Column j remains, perform Instruction (ii). The initial 
solution will be degenerate. 

The procedure is illustrated in Figs. 7.14 and 7.15 by an example having a degenerate 
initial solution. To check your understanding, perform the operations indicated in Fig. 
7.14 on the table in Fig. 7.15. 


Instruction 

Calculation 

iii 

i 

Pick = ^ 

Check Column 1. Set = 7 -1 = 6 

Pick Xj 2 ~ — 6 

iv(ii) 

Check Row 1. Set r 2 = 6-6=0 

Pick X 23 = 02 =r 3 = 2 

iv(ii) 

i 

Check Row 2. Set r 3 = 2-2 = 0 

Pick X 32 = r 2 = 0 

iv(iii) 

Check Column 2. Set <73 = 3 - 0 = 3 
Pick X 34 = 03 = r 4 = 3 

ivdii) 

i 

Check Column 4, Set '03 = 3 - 3 = 0 
Pick X 33 = ©3 = 0 

iv 

Stop (p -1 = 3 + 4-1 = 6 routes) 


Figure 7.14. Selection of 
an Initial Basis. 
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Figure 7.15. Initial basis. 


General network. The two important theoretical properties of the tri¬ 
angularity of all basic solutions and the existence of an optimal solution that is also 
integer-valued are valid for a fairly general network model: 

(i) minimize 2 2 

(hJ) in 
network 


subject to 

(ii) 2 2 for A' = 1, 2, . .. ,/> (node balance equations) 

(A\j) in (i,k) In 

network nutwoik 

(iii) Xfj ^ 0 for all (i,j) in network, 
where 

(iv) ^ 7\ = 0 and each T/i is integer-valued. 

A:=l 


Given (iv), there is a redundancy in restrictions (ii), but by dropping any one of the 
balance equations in (ii) you obtain a set of — 1 independent restrictions. It can be 
shown that every basis for such a set of/? — 1 restrictions has the triangularity property. 
Hence, since all the coefficients in (ii) are either -f 1 or —1, the corresponding solution 
is integer-valued. 

The associated dual problem is 


(v) 

subject to 

(Vi)- 


maximize 


t njy. 


k=l 

yi — J'j < Cij for all (f,j) in network 


(arc constraints), 


where each yi^ is unrestricted in sign. Again because of triangularity, given any basic 
solution, the associated values for the dual variables can be found sequentially (after 
selecting a variable arbitrarily and assigning it a value). 

Finally, we point out that given p — \ independent restrictions from (ii), a set of 
p — \ basic variables corresponds to a tree (Sec. 6.8) in the network representation of (ii). 
The simplex method applied to this network can be interpreted as follows. Select a 
beneheial arc to add to a trial solution, which destroys the associated tree structure by 
creating a loop. Subsequently, allocate as much flow as possible to the new arc by 
rerouting flow on the other arcs of the loop. This maximal allocation in turn causes one 
of the other arcs on the loop to drop, thereby restoring the tree structure. *4 
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7.5 SENSITIVITY TESTING 

You found in Chap. 5 that the notion of duality is central in examining how 
a change in model formulation affects an optimal solution. We review a few of 
these ideas here in the context of the transportation problem. 

Objective function. Given the value for and Wj at the final iteration, 
the test quantities + Wj — Cu) are all nonpositive. This implies that each Cij 
can be as small as (vf + Wj) and the current solution remains optimal. In the 
numerical example of Sec. 7.3, we had at the final iteration that 0 i = —4 and 
W 4 = 9, as you can see in Fig. 7.10. Therefore the coefficient C 14 , which currently 
is 7, can be as small as 5 (= — 4 + 9), and the present solution remains optimal; 
in other words, the test quantity — 2 (= —4 + 9 — 7) represents the largest 
decrease in r ,4 that leaves the current solution optimal. 

Supplies and demands. Assume you have a nondegenerate optimal 
solution, that is, assume every basic route has at least one unit of flow. Suppose 
Sp is increased by a unit, so that an oversupply is created. It can be shown that 
the new optimal allocation will assign the excess to that source /: that has maximum 
(i/t), and that [vp — maximum (o,)] < 0 will be added to the objective function. 

For example, in Fig. 7.10, maximum ( 2 / 4 ) = = 0. So ifiS*! = 7, then the flow 

on route (1,3) will increase by a unit and the flow on route (3, 3) will decrease 
a unit, leaving the oversupply in Row 3. The net reduction in cost is —4 (=11 
— 15 = v^), assuming the oversupply is costless. Verify that if, instead, S 2 = 2, 
the flows can be altered so that the net reduction in cost is V 2 = —9. 

Still assuming the optimal solution is nondegenerate, suppose next that there 
is a unit increase in both Si and Dj. It can be shown that Vi + Wj will be added to 
the objective function, as would be suggested by the dual objective function. 

For example, suppose .$'2 = 2 and Di = 8 . Then the objective function becomes 
[100 + (—9 + 5) = 96]. At first it seems surprising that adding one more unit 
of flow to a transportation schedule can actually reduce total cost; but this happens 
because the cost of a unit increase in flow on routes (3, 1) and (2, 3) is offset by 
the cost of a unit decrease in flow on route (3, 3). The net reduction in cost is 
—4(= 5 + 6 — 15). The pattern of alterations for the basic variables is estab¬ 
lished by considering what would happen if a unit of-flow were to be removed 
from route ( 2 , 7 ). Verify that if, instead, iS*! = 7 and D 4 = 3, the flows can be altered 
so that 5 (= 2/1 + 2 U 4 ) is the net increase in the objective function. 

7.6 SHORTEST ROUTE FOR A GENERAL NETWORK 

In Sec. 6.5 you saw several examples of optimization models that proved 
equivalent to finding a shortest route in a network. Review the general discussion 
on pp. 177-180 so that you recall the context and assumptions of the problem. 
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Here you will study a method of solution which, like the simplex algorithm, relies 
heavily on duality. 

Suppose a network contains several routes from the Source Node s to the Sink 
Node r, and you want to obtain a shortest one. The mathematical model for the 
problem was given in Sec. 6.5. What we need now is a statement of the dual prob¬ 
lem; 

( 1 ) maximize —+ys 
subject to 

( 2 ) jPi — < Cij for all (z,j) in network, 

where each is unrestricted in sign. 


Method. The algorithm proceeds to solve for the dual variables; 

(i) Start by letting^,. = 0, and all other = oo. 

(ii) If there continues to be any arc ( 7 , 7 ) such that^j > then change the 

corresponding value of to Otherwise stop. 

Thus the values of are successively decreased until (2) holds for all arcs. The 
technique will converge in a finite number of calculations, provided that the sum 
of Cij around every loop of the network is nonnegative. [The speed of convergence 
can be accelerated by using a systematic approach in applying Instruction (ii). But 
this topic is too technical to be discussed here.] 

The method actually obtains a shortest route to the Terminal Node r from 
every other node. Specifically, to find the path for any Node s, determine the arc 
(5, /) for which = c^. The algorithm guarantees there is at least one such 

arc. Similarly, at Node t find the arc (/, u) such that^i — Continue in 

the same fashion to trace a route that eventually leads to Node r. Notice that the 
value of is the length of a shortest route from Node k to Node r. Although each 
is unique, there can be more than one shortest path from Node k. 


Example. The technique is illustrated for the example in Fig. 7.16, which 
is the same network structure as Fig. 6 . 8 . You seek the shortest paths to Node 1 
from every other node. The procedure is best followed by making the calculations 
directly on the network diagram. Copy Fig. 7.16 on a piece of paper, or write 
lightly with pencil on the page. 

Start by putting a 0 adjacent to Node 1 and the symbol 00 next to all the other 
nodes. There are many ways to proceed—here is one; 


Cii +yi = 2 + 0 <00 =y^^ 
Czi +yi = 1 + 2 < 00 =y^, 
Cci +^4 = 5 + 2 < 00 
C 54 +^4 = 1 + 2 < 00 = 

= 3 + 3 < 7 


therefore let ^4 = 2 
therefore let ^3 = 3 
therefore let^e = 7 
therefore let = 3 
therefore let^o = 6 


( 3 ) 
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Css ys — 6 + 3 <C OO —ysy 

^23 ys = 3 3 OO =^ 2 > 

<^73 +73 = 5 + 3 < OO =^7, 

<^72 +72 = 1 + 6 < 8 = 77 , 

^87 +77 = 1 + 7 < 9 =78, 


therefore let 78 = 9 
therefore let_y 2 = ^ 
therefore let 77 = 8 
therefore let j ^7 = 7 
therefore let^s = 8 . 



As soon as the new value is calculated in (3) for each 7 i;, be sure to put it next to 
Node k in your diagram. At this point 

^1 = 0 >1 = 6 ^3 = 3 yi = 2 

(4) 

75 = 3 78 = 6 77 = 7 78 = 8 . 

Check that the values in ( 4 ) now satisfy (2) for all the arcs in the figure, so the 

calculations terminate. In Fig. 7.17, the values for 7 jt appear next to each node, 

and the arcs in the shortest paths have been drawn with heavier lines. 

► If every optimal path is unique, then the set of all arcs on the shortest paths forms 
a tree, defined in Sec. 6.8. 

A proof that the final is indeed the length of a shortest path from Node s to Node r 
is practically immediate. By construction, all the dual constraints (2) have been satisfied. 
The associated value of the dual objective function is 

(i) yrTr + 7 , 7 ; = 0(-l) + J.,(l) = 

which is the value of the length of an attainable path from Node s to Node r. Therefore, 
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Figure 7.17. Shortest Routes to Node 1. 


f 

i 


I 


feasible solutions to both the primal and dual problems have been found. Since the two 
solutions have the same objective-function value, the path must be optimal. 

A succinct mathematical statement of the conditions thc>'i must satisfy is 


(ii) 


= minimum {cij + yj) 


(t, S) In 

network 


for all i ^ r, 


whereby = 0. 

Sometimes you may want to calculate the shortest routes betw^een every pair of nodes. 
The following approach is an efficient way of doing so. Define 


(iii) 


[ length of arc (i, j) if {i,j) is in network, 
= < oo if (i,j) is not in network and i 

l0ifi=i. 


Consider the network comprised of only Nodes 1, 2,..., q — 1 (< P) and for this 
abbreviated network, let 


(iv) 

Similarly, let 

(v) 


dij = shortest distance between Nodes i and j, when 
only Nodes 1, 2,..., q — 1 are in the network, 

dfj = shortest distance between Nodes i and j, when 
Nodes 1, 2,..., q are in the network. 


Then 
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(Vi) 

</♦. = minimum {c^u + 

ioTj = I, 2,..,, J — I 

(vii) 

dJlj = minimum (</,* + Cjtg) 

for % — 1, 2, • * • y q 1 

(viii) 



(ix) 

df^ = minimum (</,j, df^ + d*j) 

for i,j = 1,2,1. 

Therefore, 

starting with ^ = 2, and proceeding to successively larger values of q 


up to ^ = py you apply, in turn, (vi) through (ix). (For ^ = 2, = Cij for i, = 1, 2.) 
The algorithm involves a total of p{p — 1)(^ — 2) additions. -4 

7.7 SHORTEST ROUTE IN AN ACYCLIC NETWORK 

The shortest-route algorithm can be made even simpler vy^hen the network is 
acyclic, for then the dual variables can be determined recursively, that is, 
sequentially. 

As a preliminary step, number the nodes from 1 to ^ such that if the network 
contains an arc then i > j. To do this, designate the sink or terminal as 

Node 1, which has only inward-pointing arcs. Check off this node and all its arcs, 
and consider them no further in the labeling process. Look for any other node that 
has only t7za’flrd/-pointing arcs. Designate it as Node 2. Check off this node and all 
its arcs, and consider them no further in* the labeling process. Continue in the 
same manner until all nodes have been numbered. The^i will be determined in 
the same order as the nodes are numbered. 

The algorithm is 

(1) ^ 1=0 (terminal) 

(2) yt = minimum (^,^ +yi) for i* = 2, 3,... 

(t. S) in 
network 

To see how the recursion (2) works, consider the network in Fig. 7.18, where 
the rg are indicated next to each arc. (Carry out the node-numbering process to 
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• 7.7 • 


venfy that the labels have been correctly assigned.) The algorithm proceeds as 
follows: 


(3) 


= 0 

Jft = minimum (fj, +^,) (1+0) =1 

^3 = minimum (c,, +j>,, c ,2 +^ 3 ) = minimum (4 + 0 , 1 + 1 ) = 2 

^4 = minimum (e 4 , +^„ ^43 +,^ 3 ) = minimum (4 + 0 , 1 + 2 ) = 3 

jis — minimum (cj, +7,, Cj 4 +_)' 4 ) = minimum (6 + 0 , 2 + 3 ) = 5 

jie = minimum (e (4 +j> 4 , Cjj +j> 5 ) = minimum (3 + 3 , 2 + 5 ) = 6 

= minimum +^ 3 , c,, +,y„ c^c +J>c) 

= minimum (8 + 1, 6 + 2, 1 + 6) = 7 
js = minimum (cg, +y„ + j,,, c„ + j-,) 

= minimum (6 + 5, 4 + 6, 1 + 7) = 8. 


• preceding section, the calculations may be done on the diagram 

Itself The routes are determined as before, and the solution here is shown in 
Fig. 7.19. 



REVIEW EXERCISES 


1 Write the full equations for a standard transportation model with m = 2 and n = 3 , 
where total supply equals total demand [use the notation in ( 2 ), ( 3 ), and ( 5 ) of 
Sec. 7.2]. Show that if you add the supply constraints in (2) for i = 1,2, and from 
this sum subtract the demand constraints in ( 3 ) for j = 1 , 2 , you obtain the demand 
constraint forj = 3. Indicate where you use the assumption that total supply equals 
total demand. 
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2 Write the full equations for a standard transportation model with m = 4 and n = 3. 
Also write the full corresponding dual linear programming problem; use the notation 
in (6) and (7) of Sec. 7.2. 

*3 Given the primal model for the transportation problem stated as (1) through (5) in 
Sec. 7.2, verify that the associated dual problem is as stated in (6) and (7). 

4 Arrange the dual variables and dual equalities (3) in Sec. 7.3 to display at a glance 
the triangularity property when you set equal to zero the dual variable indicated in 
each part below. 

(a) 

(b) 

(c) ^ 2 . 

5 (a) Arrange the shipment variables and equalities (5) in Sec. 7.3 to display at a glance 

the upper triangularity property. 

(b) Do the same where the equality for Row 3 is made explicit and the equality for 
Column 4 is dropped as the redundant relation. 

(c) Do the same where the equality for Row 3 is made explicit and the equality for 
Column 1 is dropped as the redundant relation. 

(d) Do the same where the equality in Row 3 is made explicit and the equality for 
Row 2 is dropped as the redundant relation. 

6 In each part below, find the pattern of alterations, like that in Fig. 7.4, when the 
indicated nonbasic route is selected in Fig. 7.3. Also calculate the associated improve¬ 
ment potential and verify your answer in Fig. 7.5. Finally, show the values for the 
new basic solution if the route indicated is selected. 

(a) Route (1,3). (d) Route (2, 3). 

(b) Route (1,4). (e) Route (2, 4), 

(c) Route (2, 1). 

7 In each part below, solve the dual equalities (3) in Sec. 7.3 using the indicated value 
for the selecicd dual variable. 

(a) Bj = 1. (d) Wt = 1. 

(b) B, = -l. (e)«;, = 0. 

(c) 0^4 = 0. (f) a;i = — I. 

(g) Verify that Vi wjj for every i and j\ has the same value in each part above. 
Explain why. 

8 Northwest Corner Rule, In Sec. 2.7, the following rule was described for finding an 
initial solution to a transportation model: start at the upper-left comer, or, as it is 
sometimes called, the northwest comer. Allocate the available supply Si to each demand 
requirement, beginning with Dj, until the supply is exhausted. Then continue the 
process with and the remaining unfilled demand requirements. Etc. 

(a) Consider the example in Fig. 7.2. Let •S '2 = 2 and D, = 6. Apply the northwest 
corner rule to obtain an initial solution. Calculate the associated trial value for 
the objective function. 
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(b) Revise the trial solution in part (a) for the case •S '2 = 1 and D 2 = 5, which arc 
the values in Fig. 7,2. Be sure to maintain a basic solution having m n \ 
routes. 

(c) Indicate how to amend the above statement of the northwest corner rule to ensure 
that the process yields a basic solution containing m + n — 1 routes. 

(d) In Fig. 7.2, let = 7, .^2 = 5, and JD 4 = 7. Is a feasible solution using routes 

^is> ^ 22 > ^ 3 i> ^S 3 > 21 *^^ X 34 a basic solution? If not, explain why, and apply your 
answer to part (c). {Hint: ascertain whether feasible values for the are uniquely 
determined.) 

9 In eacli part below, find the pattern of alterations, like that in Fig. 7.7, when the 
indicated nonbasic route in Fig. 7.6 is selected. Also, calculate the associated improve¬ 
ment potential and compare your answer to (10) in Sec. 7.3. Finally, show the values 
for the new basic solution when the route indicated is actually selected. 

(a) Route (3, 2). 

(b) Route (1, 4). 

(c) Rouie (2, 4). 

♦ 10 Construct a basic solution for an example with m > 2 and n> 2 such that every basic 
route is revised when a particular nonbasic route is introduced. {Hint: let m = n, 
and let the basic solution be found by using the northwest corner rule in exercise 8 .) 

11 In each part below, assume that the indicated nonbasic route is selected in Fig. 7.6 
at iteration 3. Continue the transportation simplex process to find the optimal solution. 
Use the tabular method, like that in Fig, 7.8, to calculate the improvement potentials. 

(a) Route (1, 3). 

(b) Route (2,4). 

12 (a) Explain why subtracting the same constant from each in a specified Row i 

yields a new problem that has the same optimal solutions as the original problem, 
(b) Consider the transportation model applied to the distribution of an item that is 
manufactured at m plants. Let pi be the production cost of each item at Plant i 
and tij the cost of transporting each item from Plant i to Warehouse j. Suppose 
total potential production at all the plants equab total demand at all the ware¬ 
houses. Given your answer in part (a), explain why the actual values for the pi 
do not influence an optimal dbtribution plan. Would your explanation be modified 
if total potential production exceeded total demand? If so, how? 

13 In each part below, determine whether an alternative optimal solution would be 
indicated in Fig. 7.10, and if it would, find such a basic solution. 

(a) Cii = 3. (d) ^32 = 9. 

(b) Cii = 1. (e) Cs 2 = 7. 

(c) C 8 I = 8 . 

^14 Degeneracy. In each part below, apply the transportation simplex method to find an 
optimal solution and use the starting solution given in exercbe 8 . 

(a) Part (a). 
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(b) Part (b), 

(c) Part(d). 

*15 Degeneracy, In each part below, apply the rescaling procedure suggested in the special 
material of Sec. 7.4 for removing degeneracy. Solve the perturbed model by the trans¬ 
portation simplex method, and indicate how to find the optimal solution in the 
original problem. Use the data and starting solution in exercise 8 . 

(a) Part (a). 

(b) Part (b). 

(c) Part (d). 

16 Consider the example in Fig. 7.2. Apply the transportation simplex method to find 
the optimal solution starting with the basis: 

A'li = 2 X |2 = 4 ^22 = 1 ■^31 = 5 X 33 = 3 X 34 = 2. 

*17 Solve the problem in exercise 16 using the ordinary simplex method as presented in 
Chap. 4. (If you prefer, use either of the tabular formats presented in Sec. 4.9.) 
Check that at every iteration each variable corresponds to the pattern of alterations 
occurring when the associated nonbasic route is introduced. {Suggestion: to get the 
calculations started, eliminate any one of the equations as the redundant^ relation, 
and solve for the basic x^j indicated in exercise 16.) 

18 Let the initial basic solution in Fig. 7.3 employ the route (2, 3), instead of (2, 2). 
Find the optimal solution using the transportation simplex method. (Note that this 
initial starting basis is suggested by an analysis of the relative costs in Fig. 7.13.) 

*19 (a) Consider the technique in Sec. 7.4 for deriving the relative costs displayed in 
Fig. 7.13. Apply the technique where you first perform the colunm calculations 
(ii) and then the row calculations (i). Are your answers the same as those in Fig. 
7.13? 

(b) Show that the solution in Fig. 7.9 is optimal for a problem having Cij as indicated 
in Fig. 7.13. Show the relation between the value of the optimal solution in Fig. 
7.9 and the process generating the Cij in Fig. 7.13. 

(c) Show that the solution in Fig. 7.9 is optimal for a problem having as indicated 
in your answer to part (a). 

*20 Initial Basic Solution 

(a) Perform the operations indicated in Fig. 7.14 on the table in Fig. 7.15. 

(b) Use the same selection sequence of shown in Fig. 7.14 to find an initial basis 
for the supplies and demands: 

= 3 ^2 = 4 S^ = 5 = 3 A = 1 2)3 = 4 Z >4 = 4. 

21 Consider the problem in Fig. 7.2. In each part below, calculate how small the indicated 
objective-function coefficient can be (leave the other costs unchanged) such that the 
solution in Fig. 7.9 remains optimal. 

(a) c„. (d) Cj,. 

(b) Cij. (e) « 3 j. 

(c) fj 3 . (f) Cn. 
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♦22 Suppose you have an optimal solution in which every basic route has at least one 
unit of flow, as in Fig. 7.9. Suppose Sp is increased by a unit, so that an oversupply is 
created. Show that the new optimal allocation will assign the excess to that source k 
with maximum i;^, and that [Vp — maximum will be added to the objective 
function. 

23 Consider the example in Sec. 7.3. Suppose ^2 = 2. Find a new optimal solution (using 
sensitivity analysis on the previous optimal solution), and verify that total cost is 
reduced by V 2 = —9. 

24 Consider the optimal solution shown in Fig. 7.9. In each part below, use sensitivity 
analysis to find a revised optimal solution and the associated value of the objective 
function. 

(a) Si ^1, 8 . (d) S 2 = 2, = 3. 

(b) Si^= 7, = 3. (e) ^3 = 10, = 6 . 

(c) ^2 = 2 , 2)2 = 6 . 

25 Consider the solution in Fig. 7.9. In each part below, suppose the supply at Point i 
is iSi — 8 and the demand at Point j is 2)j ~ 8 . State how large 8 can be such that the 
basis in Fig. 7.9 remains optimal. Also, indicate an optimal solution for this value of 8 . 

(a) 51 — 8 and D 2 — 8 . (d) ^3 — 8 and — 8 . 

(b) 5i — 8 and 2)3 — 8 . (e) ^3 — 8 and 2)3 — 8 . 

(c) 52 — 8 and 2)3 — 8 . (f) 53 — 8 and 2)4 — 8 . 

26 Answer questions in exercise 25 for each part below. [Hint: the pattern of alterations 
for the basic variables is the same as if you were going to add a unit of flow on route 

ftj).] 

(a) 5i — 8 and — 8 . (c) 52 — 8 and — 8 . 

(b) 5i — 8 and 2)4 — 8 . (d) 53 — 8 and D 2 — 8 . 

27 Consider the network in Fig. 7.16. Delete the arcs from Node 4 to Node 3, from 
Node 6 to Node 5, and from Node 7 to Node 2. Add arcs corresponding to ^ 47 = 1, 

= 3, and ^ 1 . 

(a) Apply the algorithm in Sec. 7.6 to find the shortest routes to Node 1 from every 
other node, and indicate their corresponding lengths. 

(b) How small can ^34 be such that your answer in part (a) remains optimal? 

(c) How large can C 34 be such that your answer in part (a) remains optimal ? 

(d) How large can C 41 be such that your answer in part (a) remains optimal? 

28 Consider the network in Fig. 7.16. Delete the arc from Node 5 to Node 4. Add arcs 

corresponding to ^ 13 = 1, = 2, Cjo = 3, and let ^72 = 4. 

(a) Apply the algorithm in Sec. 7.6 to find the shortest routes to Node 2 from every 
other node, and indicate their corresponding lengths. If there are several optimal 
solutions, indicate the alternatives. 
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(b) Suppose you add an arc from Node 8 to Node 3. What is the smallest value of 
Cgj that will allow your answer in part (a) to remain optimal? 

(c) Suppose you add an arc from Node 8 to Node 1. What is the smallest value of 
c^x that will allow your answer in part (a) to remain optimal? 

(d) Suppose you add an arc from Node 5 to Node 2. What is the smallest value of 
^52 that will allow your answer in part (a) to remain optimal? 

29 Carry out the node-numbering process given at the beginning of Sec. 7.7 to verify 
that the labels for the network shown in Fig. 7.18 have been correctly assigned. 

30 Consider the network in Fig. 7.18. Delete arcs ^32 and ^ 7 ^. Add an arc corresponding 
to C 74 = 5. 

(a) Apply the algorithm in Sec. 7.7 to find the shortest routes, and their respective 
lengths, from each node to Node 1. If there are several optimal solutions, indicate 
the alternatives. 

(b) What is the smallest value of rae will allow your answer in part (a) to remain 
optimal? 

(c) Suppose you add an arc from Node -8 to Node 3. What is the smallest value of 
^83 that will allow your answer in part (a) to remain optimal ? 

(d) Suppose you add an arc from Node 8 to Node 2. What is the smallest value of ^82 
that will allow your answer in part (a) to remain optimal? 

31 Explain your understanding of the terms triangularity (or triangular structure, or 
triangular property) and relative cost. 

Exercises 32 through 42 refer to material contained in Appendix I. 

*32 In each part apply the algorithm in Sec. I.l to find the maximum flow in Fig. I.l. 
Start the process with the initial flow indicated below. 

(a) A unit of flow from Node 0 through Nodes 4, 5, and 6 . 

(b) A unit of flow from Node 0 through Nodes 1, 2, 3, 5, and 6 . 

*33 Consider the example in Fig. 1.1. In each part below, determine whether the maximum 
flow can be increased if 

(a) You permit flow in any direction between every pair of nodes already connected 
by an arc. 

(b) You add an arc from Node 1 to Node 6 . 

(c) You make the changes in both parts (a) and (b). 

*34 Consider the example in Fig. I.l. Suppose the capacity limit Uij on arc (i,j) is i -f-j. 

(Thus, Mo 4 = 4, 1/38 = 9, M 23 = 5, etc.) Find the maximum flow, and indicate an 

optimal flow pattern. 

*35 CuL Consider the example in Fig. I.l. Let Cq be all the spanned nodes associated with 
Fig. 1.3, and let Cp+i be the remaining nodes. 

(a) What nodes are in Cq ? 

(b) What is the associated cut capacity? 


242 Network Algorithms 


*36 Answer the questions in exercise 35 for the maxim\im How solution in exercise 34. 

♦37 Apply the transportation simplex method to solve the assignment problem in Fig. 1.4, 
where you let the initial basic solution consist of X 14 = X 23 = JC 32 = X 41 = 1 and 
Xfi = X 34 = X 44 = 0. (Let Oj = 0 for the initial solution.) 

♦38 Apply the maximal flow approach given in Sec. 1.2 to solve the assignment problem 
in Fig. 1.4, where you let the initial values for Vi and wj correspond to those values 
associated with the initial basic solution given in exercise 37 (with i;i = 0 initially). 
At each iteration, show the scanning process, as in Fig. 1.16, and the resultant network 
solution, as in Fig. 1.17. 

♦39 Consider the assignment problem in Fig. 1.4, and revise the costs to be — z;( — wj, 
where and Wj correspond to those values associated with the initial basic solution 
given in exercise 37 (with i;i = 0 initially). Then apply the minimal cost/maximal 
flow approach described in Sec. 1.2 with vi ~ wj = 0 and zero flow at the initial 
iteration. At each iteration draw the relevant network, like that in Fig. 1.19, and a 
shortest route, like that in Fig. 1,20. 

♦40 Consider the assignment problem in Fig. 1.4, In each part, apply the shortest-route 
approach in Sec. 1,2, where you consider the rows and columns in the order indicated 
below. At each iteration, draw the network and associated shortest route, as in Figs. 
1.24 and 1.25. 

(a) Row 1 and Column 1; Row 2 and Column 2; Row 3 and Column 3; Row 4 and 
Column 4. 

(b) Row 4 and Column 4; Row 3 and Column 3; Row 2 and Column 2; Row 1 and 
Coliunn 1. 

(c) Row 1 and Column 2: Row 2 and Column 3; Row 3 and Column 4; Row 4 and 
Column 1 . 

♦41 Solve the transportation problem in Sec. 7.3 using the maximal flow approach as 
outlined in Sec. 1.3. 

♦42 Explain your understanding of the following terms: 

maximum flow flow-augmenting path 

labeled node cut 

spanned node cut capacity, 

breakthrough 


COMPUTATIONAL EXERCISES 

43 Consider a transportation problem with m == 3 and n = 3, where 
Cii =1 ^12 = 3 ^13 = 8 

^21 16 ^21 ~ ^ ~ ^ 

C^l = 2 = 4 ^33 = 10. 
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(a) Suppose Si = 4, 5*2 = 4, and .$'3 = 8 ; and Di = 2, Z >2 = Apply 

the transportation simplex method to find an optimal solution. 

(b) Use sensitivity analysis to determine how your solution in part (a) changes if 
5'j = 5 and Di = 3. What is the new value of the objective function ? 

(c) Answer the questions in part (b) for So = 5 and D 2 = 5. 

(d) What is the smallest value of ^22 that will allow your solution in part (a) to remain 
optimal? Similarly, how small can C 12 be? 

(e) Give a range of values for Coj that allows the solution in part (a) to remain optimal. 
Do the same for ^ 33 . Do the same for ^ 32 * 

44 Consider a transportation problem with m = 4 and n = 4, where 


II 

Cio — 9 

^13 — 1 

II 

f,, = 22 

^22 = 25 

f23 = 16 

Cj 4 = 25 

C31 = 25 

0^ 

C^i 

II 

^33 = 21 

00 

cs 

II 

II 

= 14 

^43 = 6 

«44 = 15. 


(a) Suppose Si = 7, 1^2 — 3 , Sj — 7, and S4 — 8j and Dj — 5, D 2 — 5, D3 — 7, and 
D 4 — 8. Apply the transportation simplex method to find an optimal solution. 

(b) Use sensitivity analysis to determine how your solution in part (a) changes if 
.92 — 4 and D 4 = 9. What is the new value of the objective function ? 

(c) Rework part (b) for .93 = 8 and D 3 = 8. 

(d) What is the smallest value ofcn that will allow the solution in part (a) to remain 
optimal? Similarly, what is the smallest value of ^ 21 ? Similarly, how small can 

be? 

(e) Give a range of values for C 23 such that the solution in part (a) remains optimal. 
Do the same for C 43 . Do the same for C 42 . 

45 Consider a transportation problem with m = 4 and n = 4, where 


II 

Ci 2 = 7 

II 

0 

II 

tj, = 4 

^22 = 5 

00 

II 

^24 — 

fji = 13 

C 32 = 15 

C , = 14 

C34 - 21 

00 

II 

C42 = 20 

« 4 , = 24 

C 44 = 31. 


(a) Suppose .91 = 3, 5*2 = 4, .93 = 9, and .94 = 18; and Di = 15, Do = 5, Z>3 = 7, 
and D 4 = 7. Apply the transportation simplex method to find an optimal solution. 

(b) Use sensitivity analysis to determine how your solution in part (a) changes if 
.9i = 4 and Dj = 16. What is the new value of the objective function? 

(c) Rework part (b) for .92 = 5 and Dj = 16. 

(d) Rework part (b) for .9i = 4 and D 2 = 6 . 

(e) What is the smallest value of ^44 that will allpw the solution in part (a) to remain 
optimal ? 

(f) Give a range of values for such that the solution in part (a) remains optimal. 
Do the same for ^ 23 - Do the same for ^ 34 . 

46 Fly-by-Night Airlines 

(a) Solve the case given in exercise 28 of Chap. 2. 
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(b) What is the optimal value of the objective function and what are the revisions to 
the purchase plan in part (a) if one more gallon is required at Airport 1 ? At 
Airport 2 ? At Airport 3 ? At Airport 4 ? 

(c) What is the optimal value of the objective function ana wnat are the revisions to 
the purchase plan in part (a) if one more gallon is offered by Oil CJompany 1 ? 
By Oil Company 2 ? By Oil Company 3 ? 

(d) What is the optimal value of the objective function and what are the revisions to 
the purchase plan in part (a) if one more gallon is offered by Company 1 and 
required at Airport 1 ? At Airport 2? At Airport 3? At Airport 4? 

47 Hayes Manufacturing Company 

(a) Solve the case given in exercise 29 of Chap. 2. 

(b) What is the value of an extra unit of capacity at each plant? 

(c) What is the cost associated with an extra unit of demand by each wholesaler? 

(d) Give a range for the direct cost of production at each plant such that the solution 
in part (a) remains optimal. 

48 Longview Manufacturing Company 

(a) Apply the transportation simplex algorithm to solve the case in exercise 28 of 
Chap. 6 . Be sure to indicate the amount produced in each period and the level of 
inventory at the end of each period. 

*(b) Did you obtain the same solution as you did in exercise 28, part (c), of Chap. 6 ? 
If not, do the two solutions have the same total cost? 

49 Sinbad Steamship Company, Apply the transportation simplex method to solve the case 

in exercise 29 of Chap. 6 , where 

Pa = ^ = 2 ^(7 = 4 

= 4 2 = 1. 

Be sure to indicate an optimal routing for each ship. 

50 Sinbad Steamship Company, Apply the transportation simplex method to solve the case in 

exercise 30 of Chap. 6 . Be sure to indicate an optimal routing for each ship. 

51 Consider the network in exercise 32 of Chap. 6 , where c^ = c^ and 

Cl, = 14 Ci 3 = 4 Ci 4 = 1 c ,5 = 6 

c,, 10 c,! — 13 c,^ ^ 5 

^34 = 3 C 35 = 2 

C 45 = 8 . 

(a) Find an optimal solution by applying the transportation simplex method to the 
appropriate condensed transshipment table, like that in Fig. 6.4. Be sure to indi¬ 
cate an optimal routing and state whether it is unique. 

(b) Answer part (a) using an expanded transshipment table, like that in Fig. 6.5. 

(c) What is the smallest value for C 45 such that the current solution remains optimal? 
For c ,4 ? For C 34 ? For C 35 ? 
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52 Rock-Bottom Discount Store Example, Consider the network in Fig, 6.3. Assume that 

Ci2 =1 C2Z = 7 C2i = S =1 ^45 = 3 C47 = 4 

C 54 = 2 Cjg = 5 = 3 Cjs = 1. 

(a) Apply the transportation simplex method to a condensed transshipment table, 
like that in Fig. 6.4. Be sure to indicate the implied optimal routing. 

(b) Apply the transportation simplex method to an expanded transshipment table, 
like that in Fig. 6.5. Be sure to indicate the implied optimal routing. 

(c) Does your solution change if the route from Store 6 to Store 7 is removed? If so, 
how? 

(d) Suppose you add a route from Node 1 directly to Node 3. How small can the 
associated value ^13 be such that the solution in part (a) remains optimal? Simi¬ 
larly, for a direct route from Node 2 to Node 7, what is the smallest possible 
value of C27 ? 

53 Rework exercise 52 using the supply and demand data given in exercise 6 , part (a), 
of Chap, 6 . 

54 Rework exercise 52 using the supply and demand data given in exercise 6 , part (b), 
of Chap. 6 . 

55 Rework exercise 52 using the supply and demand data given in exercise 6 , part (c), 
of Chap. 6 . 

56 Marcus Metal Company, Consider the case in exercise 31 of Chap. 6 , where 


0 

II 

^2 = 15 

II 

00 

/),= 10 

D, = 2 

mi = 1 

TTlj = 2 

II 

oa 

= 5 


S12 = 1 

Sn = 1 

-^23 — 6 

J,4 = 3 


<35= 1 

< 3 . = 2 

^37 = 3 

<45 = 3 

I40 = 2 /47 = 1 . 


(a) Find an optimal solution by applying the transportation simplex method to the 
appropriate condensed transshipment table, like that in Fig. 6.4. Be sure to 
indicate the optimal routing and flow of goods. 

(b) Rework part (a) using an expanded transshipment table, like that in Fig. 6.5. 

(c) What is the optimal value of the objective function and what arc the revisions in 

the solution if = ll?If5i = 9 ?If 52 = 16?If.S'2 = 14? If/)* = 9?IfDe =11? 
If D 7 = 3? = 11 and Z >7 = 3? If .S 2 = 16 and = 9? 

(d) How large can m^ be such that the solution in part (a) remains optimal ? Similarly, 
how large can m 2 be? Similarly, how large can p^ be? Similarly, how large can 
Pi be? 

(e) Suppose it is possible to transship the product through any of the Locations 5, 6 , 
and 7 to any other of these locations at the cost How small can each value of 

be (where k and h equal 5, 6 , and 7, and k ^ A), such that the solution in part 
(a) remains optimal ? 

57 Consider the network in exercise 33 of Chap. 6 , where Cij = and 
^12 = 15 ^14 = 8 Cjj = 55 C 21 = 28 ^24 ~ 5 C 25 — 12 
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^38 — ® ^45 “* 20 Cig — 48 C 57 — 20 C 58 — 7 Cq 7 — 12 C 73 — 10. 

(a) Find an optimal solution by applying the transportation simplex method to the 
appropriate condensed transshipment table, like that in Fig. 6.4. Be sure to indi¬ 
cate the optimal routings. 

(b) Rework part (a) using an expanded transshipment table, like that in Fig. 6.5. 

(c) What is the smallest cost for the arc between Nodes 1 and 6 such that the solution 
in part (a) remains optimal ? For the arc between Nodes 4 and 6 ? 

(d) Suppose you add a direct route between Nodes 1 and 3. How small can the value 
be for ^13 = C 31 such that the solution in part (a) remains optimal? Similarly, for 
a route between Nodes 4 and 7, what is the smallest possible value for ^47 = ^74 ? 

♦58 Employment Scheduling, Consider the example in Fig. 6.19, with the cost data as given 

in exercise 21 of Chap. 6 . Let 

Ri =10 = 15 = 9 724 = 20 R^ = 12. 

(a) Apply the transportation simplex method to the tabular form in Fig. 6.20. Be sure 
to indicate how many crews start each period in an optimal plan. 

(b) What is the optimal value of the objective function, and what are the revisions 
in the plan from part (a) if /2i = 11 ? If = 14? If i2i = 11 and R 2 = 14? If 
R^ = 8? If 724 = 21? If 723 = 8 and 724 = 21? 

Show that if every objective-function coefficient is integer-valued in a trans¬ 
portation model, then there are optimal values for the dual variables that are also 
integer-valued. 

Prove the same proposition for the general network problem given by (i) through 
(iv) in Sec. 7.4. 

60 Consider the network in Fig. 7.20. The number on each arc represents the distance 

between the nodes, and Cij = cjt. 


59 (a) 
♦(b) 


6 



Figure 7.20 


(a) Find the shortest routes, and their lengths, from each node to Node 1. 

(b) Find the shortest routes, and their lengths, from each node to Node 2. 


61 Consider the network in Fig. 7.21. The number on each arc represents the distance 
between the nodes, and Cfj = cji. 
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(a) Find the shortest routes, and their lengths, from each node to Node 1. 

(b) Find the shortest routes, and their lengths, from each node to Node 3. 

62 Consider the network in Fig. 7.22. The number on each arc represents the distance 
between the nodes, and = Cji, 


15 



(a) Find the shortest routes, and their lengths, from each node to Node 1. 

(b) Find the shortest routes, and their lengths, from each node to Node 3. 

63 Consider a network consisting of six nodes. Assume Cij = Cji and that there are arcs 
in each direction corresponding to 

Cj2 = 15 = 5 ^14 = 8 Ci5 = 3 

^13= 10 ^23 = 3 ^33 = 5 ^43 = 2 

^23 = 9 C34 = 3 ^45 = 4 C33 = 7. 

(a) Find the shortest routes, and their lengths, from each node to Node 1. 

(b) Find the shortest routes, and their lengths, from each node to Node 2. 
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*64 In each part below, \ise the algorithm in the advanced material of Sec. 7.6 to find the 
shortest routes from every node to every other node and indicate the corresponding 
distances. 

(a) Network in exercise 60. (c) Network in exercise 62. 

(b) Network in exercise 61. (d) Network in exercise 63. 

65 Consider the network in Fig. 7.23. The number on each arc represents the associated 

length Ci), Where an arrow head appears on both ends of an arc, the length shown 

applies to each of the two directions. 



(a) Find the shortest routes, and their lengths, from each node to Node 1. 

♦(b) Discuss the problem of finding the longest routes. 

66 Consider an acyclic network with five nodes and an arc from each Node i (i = 
2, 3,4, 5) to every Node j, where j < i. Let Ctj = i + (i — Find the shortest 
routes, and their lengths, from each Node i to Node 1. 

67 Consider the acyclic network in Fig. 7.24. The number on each arc is the associated 
length cij. 



Figure 7.24 
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(a) Label the nodes by the technique given at the beginning of Sec. 7,7. 

(b) Find the shortest routes from each node to Node 1 and indicate their lengths. 

(c) Find the longest routes from each node to Node 1 and indicate their lengths. 

68 Equipment Replacement. Apply the shortest-route algorithm in Sec. 7.7 to solve the 
Rhode-Bloch Trucking Company case in exercise 15, part (d), of Chap. 6. 

69 Equipment Replacement. Consider the Goode-O’Toole Company case in exercise 41 of 
Chap. 6. 

(a) Apply the shortest-route algorithm in Sec. 7.7 to find an optimal replacement 
policy. 

(b) Determine a range for p 2 , the cost of a new model in Period 2, such that the 
policy in part (a) remains optimal. Do the same for p^. For p^. For p^. 

(c) Determine a range for the cost of operating a piece of equipment during its 
second consecutive period of use, such that the policy in part (a) remains optimal. 
Do the same for r^. For For r^. 

*70 Equipment Replacement. Consider the network in Fig. 6.10. Interpret the amounts that 
are sequentially calculated by the shortest-route algorithm (1) and (2) in Sec, 7.7 in 
the context of this network. 

*71 Consider the problem in exercise 43, but let iSi = = 1 for all i andj. 

(a) Solve by the transportation simplex method. 

(b) Solve by the maximal flow approach in Sec. 1.2. 

(c) Solve by the minimal cost/maximal flow approach in Sec. 1.2. 

(d) Solve by the shortest-route approach in Sec. 1.2. 

*72 Consider the problem in exercise 44, but let Si = 1 for all i andj. 

(a) Solve by the transportation simplex method. 

(b) Solve by the maximal flow approach in Sec. 1.2. 

(c) Solve by the minimal cost/maximal flow approach in Sec. 1,2. 

(d) Solve by the shortest-route approach in Sec. 1.2, 

*73 Consider the problem in exercise 45, but let iSi = = 1 for all i andj. 

(a) Solve by the transportation simplex method. 

(b) Solve by the maximal flow approach in Sec. 1.2. 

(c) Solve by the minimal cost/maximal flow approach in Sec, 1.2. 

(d) Solve by the shortest-route approach in Sec. 1.2. 

*74 Solve the transportation problem in exercise 43 using the maximal flow approach 
outlined in Sec. 1.3. 

*75 Solve the transportation problem in exercise 44 using the maximal flow approach 
outlined in Sec. 1.3. 

*76 Solve the transportation problem in exercise 45 using the maximal flow approach 
outlined in Sec. 1.3. 
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MIND-EXPANDING AND 
PENCIL-PUSHING EXERCISES 

Exercises 77 through 81 refer to transshipment problems. Consider each example 
formulated as a general network, according to (i) through (iv) at the end of Sec. 
7.4, and draw the network representation. [In some problems, you may have to 
add a fictitious node so that equality in ^iv) holds.] You can carry out the simplex 
method directly on the network diagram without having to resort to either a 
condensed or an expanded table. 

To start, obtain an initial feasible solution; be sure you use p — I arcs, where^ 
is the number of nodes in the network, and that you permit no loops among the 
selected arcs. (In other words, select a tree, corresponding to a basic feasible solu¬ 
tion. Some of the selected arcs may have zero flov/ in the trial solution.) 

Then calculate the associated values of the dual variables y*, k = 1, 2,... 
by the ^ — 1 equations one equality for each route {{,j) in the trial 

basic solution. (Let = 0.) Test for dual feasibility, that is, check (iv) in Sec. 7.4. 
Stop if all the dual restrictions are satisfied; your solution is optimal. Otherwise, 
introduce a new route corresponding to a dual restriction that is violated most. 
Reroute the flow along this new route until a route in the previous trial solution 
has no more flow and hence drops. Repeat the test for optimality. Continue if 
necessary. 

77 Solve the problem in exercise 51. 

78 Solve the problem in exercise 52. 

79 Solve the problem in exercise 56. 

80 Solve the problem in exercise 57. 

81 Solve the problem in exercise 58. 

82 Explain how you would modify the above algorithm to take account of arc capacity* 
. constraints ^ i/g, where i/jj is a positive integer. 

Solve each exercise below using the simplex algorithm for a capacitated trans¬ 
portation problem, as outlined in Sec. 7.4. 

83 The problem in exercise 43 with = 2 for all i and j. 

84 The problem in exercise 44 with Ui^ = 3 for all i and j, 

85 The problem in exercise 45 with Uij = 5 for all i and j. 
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CHAPTER 


Introduction to Dynamic 
Optimization Models 


8.1 ANALYSIS OF DYNAMIC PHENOMENA 

Dynamic elements—time considerations—were foremost in several of the opera¬ 
tions research applications you examined in previous chapters, such as the inte¬ 
grated production planning example in Sec. 2.6. But the emphasis so far has been 
on presenting algorithmic techniques for coming to grips with a host of simul¬ 
taneous constraints. You saw, for example, the way the simplex method provides 
a feasible solution to all the restrictions at each iteration. You undertook sensitivity 
analysis of linear programming solutions in the same context. Chapters 6 and 7 
explored the special structure of network models, and focused on efiicient methods 
for obtaining numerical solutions to large-scale problems. One important departure 
from this orientation is the solution method for the shortest-route problem in an 
acyclic network. Sec. 7.7. In that method you took advantage of the “one-way” 
characteristic of the network. You should carefully review that material, pp. 
235-236, before starting the next section. 

In Chaps. 8 through 12, the simultaneity aspect still remains crucial, but the 
stress will now shift to dynamic structural relationships in optimization models. 
These chapters provide only deterministic examples, in that each decision leads 
to a uniquely determined outcome. Models with probabilistic outcomes are left 
for Chaps. 17 and 18. You will find, however, that the analytic solution to most of 
the probabilistic problems are direct generalizations of the concepts you will learn 
in Chaps. 8 through 12. Consequently studying the deterministic methods is also 
very useful in preparation for the chapters dealing with probabilistic models. 

Here you will concentrate on the form and properties of optimal solutions. You 
will study the conditions that must be satisfied by an optimal time-staged decision 
process, and discover how to exploit these conditions to determine the best action. 
Frequently, the term dynamic programming is applied to this sort of analysis. 
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The main topic explored in Chap. 8 is sensitivity analysis on time as a variable. 
For example, you will see how lengthening the span of a planning horizon can 
profoundly affect your choice of a correct immediate decision. A related subject 
studied is the influence of starting conditions, such as the level of resources on hand, 
and of constraints, such as capacity limitations. 

All the models treated in Chaps. 8 through 10 have a finite planning horizon, 
such as a year or a decade. Chapters 11 and 12 discuss the special considerations 
peculiar to optimization over an unbounded time horizon. (The concept of time 
discounting and the approach of present-value analysis are also postponed until 
Chap. 11.) 

A price must be paid to garner these insights about dynamic phenomena. 
Most important, the size of an optimization model, in terms of separate con¬ 
straints, must be reduced dr 2 istically as compared to typical linear programming 
models. There arc practical advantages to the approach, however, in addition to 
the value of the new results. One already mentioned is that the span of the planning 
horizon can be made unbounded. Another is that dynamic programming opens 
the way toward solving certain small-scale, yet important, cases of nonlinear 
programming models. These include problems having nonlinear objective func¬ 
tions and a few nonlinear constraints, as well 2 is integer-value restrictions on the 
variables. In Chap. 10, you will find specific, examples demonstrating how such 
problems, though simple in appearance, can be intricate and subtle. Finally, the 
numerical techniques exhibited are applicable to a number of decision problems 
that are not essentially dynamic, but nevertheless can be viewed as dynamic or 
staged for algorithmic purposes. 

So that you keep the proper perspective, we emphasize that numerical solutions 
to all the models in Chaps. 8 through 10 can be obtained by applying algorithms 
you have previously studied (in particular, the method of obtaining a shortest 
route in an acyclic network). The salient goal is learning how to characterize the 
models in such a way as to clarify their dynamic properties. Part of this task is 
accomplished merely by augmenting your operations research vocabulary with 
new terms, and introducing you to a helpful mathematical notation. But, as you 
will come to appreciate, there are no simple rules that can be applied mechanically 
to all problems so as to expose their dynamic properties. Experience is the best 
teacher, and therefore the text contains a wide variety of examples. Many specific 
optimization problems can be formulated in several apparently different ways, 
with each formulation highlighting a certain structural relationship, as you will 
see. 

Importance of dynamic programming. These introductory com¬ 
ments are meant to help you make a transition from the point of view of the 
previous chapters. But nothing has been said about the practical importance of 
the models you are to study next. Are such problems of major economic conse¬ 
quence ? 

Whereas most industrial applications of the linear programming models you 
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have seen are oriented to planning decisions in the face of large-scale complex 
situations, dynamic programming models are typically applied to much smaller- 
scale phenomena. The following illustrations typify dynamic programming deci¬ 
sion models: 

• Inventory reordering rules indicating when to replenish an item and by what 
amount 

• Production-scheduling and employment-smoothing doctrines applicable to 
an environment with fluctuating demand requirements 

• Spare-parts level determination to guarantee high-efficiency utilization of 
expensive equipment 

• Capital-budgeting procedures for allocating scarce resources to new ventures 

• Selection of advertising media to promote wide public exposure to a com¬ 
pany’s product 

• Systematic plan or search to discover the whereabouts of a valuable resource 

• Scheduling methods for routine and major overhaub on complex machinery 

• Long-range strategy for replacing depreciating assets. 

Most of these applications are treated in detail in this chapter and the chapters 
to follow, and so no further explanation b supplied here. 

Frequently, the decision processes embraced by several of the above models 
may themselves be micro. But many real operating systems call for thousands 
of such decisions each week. These models are valuable, then, because they make 
it possible to take a myriad of actions through a routine (often computerized) 
approach with a modicum of human intervention. Needless to say, even if these 
decisions are inconsequential taken singly, in aggregate they can exert a major 
effect on a business’s profits. A reduction of 25% or more in maintenance costs or 
in aggregate dollar inventory levels, with no degradation of service, has been the 
solid reward for many companies employing dynamic programming modeb. 

A word of guidance. The common characterbtic of all dynamic pro¬ 
gramming modeb is expressing the decision problem by means of a recursive 
formulation. If you have never applied this type of formal reasoning to solve a 
problem, then you will probably find the associated mathematical notation strange, 
and perhaps even puzzling. The advice below is aimed at helping you overcome 
these difficulties. 

Plan to read the sections at least twice. First concentrate on getting a feeling for 
the decision problem and on becoming familiar with the symbob. Then, on the 
second reading, pay more attention to the details of the presentation, including 
the nature of the mathematical expressions. Carefully follow the numerical exam¬ 
ples, and check the calculations wherever the text advises you to do so. Finally, 
be patient and give yourself plenty of time to study the sections. Reading about 
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dynamic programming can be slow going, even if you have previously studied 
recursive relations. Rest assured, if you proceed as suggested, that after mastering 
a few examples you will suddenly find it much easier to understand subsequent 
recursive formulations—this threshold effect in learning is what psychologists 
refer to as the “aha!” phenomenon. 

8.2 STAGECOACH PROBLEM I AN ALLEGORY 

An allegorical example will explain several important concepts in dynamic 
programming and establish a symbolic way to view time-oriented models. There 
will be nothing new to you in the nature of the problem—you are simply to find 
a shortest route through an acyclic network. However, the particular illustration 
will contain more contextual structure than that appearing in Sec. 7.7, where you 
learned the shortest-route algorithm. 

The scenario. Once upon a time there lived a Mr. Mark Off who decided 
to seek his fortune in San Francisco. In his day, the stagecoach was the only means 
of public transportation from the East, where he lived, to the West. His travel 
agent showed him a United States map. Fig. 8.1, depicting the various stagecoach 



Figure 8.1. Stagecoach Problem. 
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routes then available. Each block on the map represents a state; each state is 
numbered for convenience. Notice that the entire trip from East to West requires 
Mark Off to take four stages, regardless of the particular routing. 

Since the traveler knew the trip presented serious hazards to life and limb, he 
decided to take out an insurance policy before leaving. The cost of the policy 
depended on which routes he selected, since the greater the danger, the higher 
the cost. Let Cfj denote the policy cost of traveling from State i to State j. Illus¬ 
trative values of Ctj appear in Fig. 8.1. Mark Off’s objective was to pick a route 
from East to West that minimized the total policy cost. (See if you can find an 
optimal route.) 

Mark Off analyzed the problem as follows. He perceived as significant the fol¬ 
lowing principle. 

Principle of Optimality: An optimal policy must have the property that 

regardless of the route taken to enter a particular state, the remaining decisions 

must constitute an optimal policy for leaving that state. 

Thus he knew that an optimal routing out of State 6, say, did not depend on 
the particular routing that resulted in his entering State 6. (How would you cal¬ 
culate an optimal routing from State 6?) Pressing his logic further, he reasoned 
that if he knew optimal routings out of States 5, 6, and 7, then he could fairly 
easily determine an optimal route out of State 3, in case he decided to enter there. 
Why? Because he need only compare the immediate cost when he leaves State 
3—either ^35, ^30, or C37 — plus the known costs of optimal routes out of States 5, 6, 
and 7, respectively, and then pick a state giving the minimum combined cost. 
By the same token, as soon as he has foimd optimal policies for States 2, 3, and 4, 
he could find an optimal routing from State 1. Explain how. 

With the principle of optimality and its computational implications in mind, 
he defined the symbols 

Jhis) = minimum policy cost when he is in State s with n more stages to go 
to his final destination, 

Jn(^) = a decision yielding/,(j). 

Most beginning students are driven up the wall by this sort of notation, which 
is always used in specifying dynamic programming models. It looks even more 
complicated than the formulas employed in statistics texts, and those are bad 
enough. But like it or not, each of the letters is essential to the meaning. The letter 
/ signals to you that the number being represented is a value for the objective 
yUnction. The letter s tells you that the objective-function value actually depends 
on the .rtate of the system. And the subscript n gives you the dynamic information 
that there are n stages left to go when the system is in State s. By the same token, 
the Decision j also depends on both the Stage n and the State s and signifies his 
journey. 

As you read this and other dynamic programming examples, you may find it 
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helpful to repeat to yourself the definition of the symbols, just as if you were 
learning a new language. The reason we did not need such horrendous notation to 
explain linear programming models is that those problems are solved in one fell 
swoop. But here we creep up on a solution by stages. Hopefully, wc have managed 
at least to east your anxiety by this justification. 

Returning to Mark OfTs problem, he realized that he knew 

( 1 ) / o ( 10)=0 for jo(10)=stop, 

since when he was in State 10 with no more stages to go, his journey had indeed 
ended. But then he saw that with hardly any effort he could also calculate(8) 
andyi(9), because they were simply ^(10) plus Cg^io, and respectively. Heady 
with success, he examined how he could compute ^(6), the minimum policy cost 
when he is in State 6 with two more stages to go to his final destination. He noticed 
that he had only two ways to leave State 6, should he decide to enter there. One 
way is to go to State 8; the associated policy cost is plus/i(8), which he already 
has computed. The other is to go to State 9; the associated policy cost is ^ 8.9 plus 
y'i(9), which he also has computed. Aha! The value of^(6) must be the smaller 
of these two sums. (Is this how you calculated an optimal routing from State 6 ?) 

Mark Off suspected that there must be a method in his madness, and, of coui'se, 
he was right. The method—or madness—can be succinctly stated by a so-called 
dynamic recursive relationship: 

(2) fn{s) = minimum [c,j +fn-iU)] for n = 1, 2, 3, 4. 

(», Ji In 
network 

This formula, expressed in words, states the fact that Mark Off should compare 
each possible sum of the policy cost for the immediate stage, going from State s 
to State j, and the optimal policy cost when he arrives at State j with only n — i 
more stages to go to his final destination. He should decide on a j that gives the 
smallest value of these sums. Mark Off’s reasoning was to apply (2) by calculating 
first all the values fornamely,/i(8) and/i(9); then, in turn, for n = 2, he 
would calculate the values for/2(5),^(6),^(7), followed by the values fory^(2), 
yi(3), andyi(4). His entire problem was solved once he found the value of^(l), 
because this put him at the beginning of his journey, starting at the initial State 1 
with four stages to go. 

The above approach for solving the problem might almost be termed a method 
of lifting yourself by your own bootstraps. First you perform a small, even trivial, 
computation. You use that result to perform a subsequent computation, and you 
continue in the same fashion. In symbolic terms, the computation in (2) states that 
you can find the values of fx{s) when you know the values of^(j). You then can 
find the values of having calculated the values ofyi(j), and so on. This sort 
of procedure is referred to as a recursive algoritlim, and the formula is 
known as a recursion. 

Computational process. We review the idea of employing a recursive 


• 8.2 • 


Stagecoach Problem 259 


algorithm by doing the actual calculations for the numerical values of appearing 
in Fig. 8.1. 

For n = 1 , the calculations implied by (2) and (1) are trivial: 

. 3 . /.( 8 ) = .<^,0 + 0 = 1 for i,( 8 ) = 10 is = 8 ) 

^ /.(9) = r,..o + 0 = 4 for j\{9) = 10 (r = 9). 

To keep the rest of the calculations straight, use the tables shown in Figs. 8.2 
through 8.5. 

There is one table for each possible Stage n, namely, n = 1, 2, 3, and 4. The 
format for a table is to have a row for each possible entering state, given that there 
are n stages to go, and a column for each possible state at the next stage. Thus for 
/I = 1, shown in Fig. 8.2, there are two rows for States 8 and 9, because Mark 
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Figure 8,2, Stagecoach Problem. 


Figure 8.3. Stagecoach Problem. 


Off may choose to enter either of these states. But there Ls only one column for 
State 10, since that is the only destination from both States 8 and 9. For w = 2, 
shown in Fig. 8.3, Mark Off may choose to enter either State 5, 6 , or 7, and so 
three rows are needed; he then may travel to either State 8 or 9, and so two columns 
are required. 

The entries in a table are the sum of the immediate cost to go from State s 
to State j and the subsequent policy costy^_i(j) associated with an optimal route 
out of State j. In each row, you examine these sums to find the smallest. That mini¬ 
mum is labeled and an associated optimal decision is designated as 
both are shown on the right of each table. 

The computations forn = 1 are repeated in Fig. 8.2. In this particular example, 
when n = 1, the only feasible action is j = 10. Therefore, ji{S) =j,(9) = 10, 
as you already saw in (3). 

When n = 2 , recall 7 can be either 8 or 9. To complete the calculations when 
there are two more stages to go, it is necessary to have the and only the values 
/lU)- The computations are shown in Fig. 8.3. Observe that^( 8 ) = 1 is added to 
each in the j = S column andyi(9) = 4 is added to each in the j = 9 colunm. 




























260 Introduction to Dynamic Optimization Models 


• 8.2 • 


The table shows that with two stages left it is optimal to go to State 8 from States 
5 and 6, and to State 9 from State 7. 

The analysis for n = 3 appears in Fig. 8.4. Notice here that two entries are 


/? = 3 
^ 20 ') 


Decision: 


Entering 

State: 


\; 

5 

6 

7 


h(s) 

2 

10 + 8 

12 + 4 

^r,r. • 

6 
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4 


15 + 4 

13 + 5 

7 

18 


blanked out, since it is not possible to go from State 2 to State 7 or from State 4 
to State 5. Once again observe that the only data required from the previous Fig. 
8.3 are the values of^(j). This is a key point in all dynamic programming applica¬ 
tions : the value of an optimal policy with n stages left to go depends on the economic con¬ 
sequence of the immediate action and the corresponding value of an optimal policy with 
n — 1 stages remaining, 

n = 4 


Csj + W) 


v 

s\ 

2 

3 

4 

U(s) 

^4(5) 

Entering , 
Stole; 

2 + 16 

5 + 12 

1 + 18 

3 

17 Figure 8.5. Stagecoach 

Problem. 


The computations terminate in Fig. 8.5 with n = 4. There you see that the 
minimum policy cost is 

(4) M\) = 17 for j,{\) = 3. 

What is the corresponding optimal policy? To answer this question, you must 
trace through the tables in the following fashion. Starting with the table for n = 4, 
shown in Fig. 8.5, you find that an optimal decision is to go from State 1 to State 
3. Progressing to the table for n = 3, shown in Fig. 8.4, you see that when Mark 
Off enters State 3 (third row of table), an optimal decision is to go to State 7. 
Continuing to the table for n = 2, shown in Fig. 8.3, you find that when he enters 
State 7, an optimal decision is to go to State 9. And from State 9 he ends up at 
State 10. In summary, an optimal policy is the route from State 1 to 3 to 7 to 9 
to 10, which asyi(l) indicates, has a policy cost of 5 + 7 + 1+ 4=17. 

You should understand that the dynamic programming method is more efficient 
than enumerating and evaluating every possible policy. In this particular problem, 
there are 14 distinct routes from East to West. To evaluate the policy cost of any 
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route, it is necessary to add the four (one for each stage) appropriate Ciy Therefore, 
a naive enumeration approach would have required 42 (= 14 X 3) additions, 
as compared to a total of 16 additions in Figs. 8.3 through 8.5. The relative advan¬ 
tage of the recursive method is overwhelming in typical applications, where com¬ 
plete enumeration is usually out of the question. 

A study questionnaire. The stagecoach allegory contains several con¬ 
cepts and approaches that recur in subsequent applications. To master these 
ideas, when you study each new model ask yourself: 

(i) What are the policy or decision variables? 

(ii) What is the criterion or objective function for determining an optimal 
policy ? 

(iii) How is the problem characterized and then analyzed in terms of stages? 

(iv) What characterizes the state of the problem at each stage? 

(v) How do the constraints influence the states of the problem and the feasible 
values of the policy variables? 

Once you are able to formulate a model in multistage terms, you have taken the 
first step toward analyzing the problem’s dynamic characteristics. 

8.3 ELEMENTARY INVENTORY MODEL 

Inventory decisions included in previous planning models—for example, 
the integrated production planning model in Sec. 2.6—were imbedded in a large 
and complex system of constraints. Now the scope is narrowed considerably so 
as to concentrate on several essentials of dynamic inventory decision processes. 
The model you are about to study plays the same role in the field of operations 
research as do Newton’s elementary laws in the field of physics—^well, nearly the 
same role. Even though the situation considered by the model is idealized, it 
encompasses many of the important considerations for choosing an inventory 
policy. 

Keep in mind that the primary purpose of the discussion below is to examine 
the dynamic phenomena of inventory processes. For this reason, the description 
of the model does not comment on the severity or realism of the assumptions; 
these are taken up in Sec. 8.5. For the same reason, the economic concepts 
introduced are only described in brief. They will be elaborated in Chap. 19 
and Appendix II, which are devoted entirely to inventory models. 

By way of encouragement and motivation, however, let it be said that manufac¬ 
turing firms have implemented versions of this model with demonstrable economic 
benefits. As you would imagine, in such instances all the calculations involved 
have been computerized. 

Dependable Manufacturing Company Example. This firm wishes 
to establish a production schedule for an item during the next N time periods. 
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Assume the company has an accurate forecast of the amount it requires to meet 
demand for each of the N periods. 

The manufacturing time to produce a batch of these items is negligible, so that 
production in Period t can be used to fill, entirely or partially, the demand in that 
period. Since the demand requirements do vary from one period to another and 
there are ceitain economies of batch production, it is often economical for the 
firm to produce more than is needed in one period and store the excess until it is 
required later. However, there is a cost of holding the resultant inventory. Depend¬ 
ing on the circumstances, this expense is attributable to such factors as interest on 
capital borrowed for financing the inventory buildup, storage rental fees, insur¬ 
ance, and maintenance. Such inventory holding cost must be taken into account 
in determining a production schedule. 

The objective of the Dependable Manufacturing Company is to devise a schedule 
that minimizes the total production and inventory holding costs subject to the 
restriction that it meets all the demand requirements on time. (Would you believe 
that the company uses the slogan “Dependable is dependable”?) We begin the 
analysis by translating this qualitative statement of the problem into a mathe¬ 
matical model. 

Model formulation. Define the policy variables 
Xi = production quantity in Period t 
it = inventory at the end of Period t. 

Assume that Dt is the demand requirement in Period t, where each Dt is a non- 
negative integer known at the beginning of the planning horizon. 

Suppose that in each Period t the cost incurred depends only on the production 
, quantity Xt and the ending inventory level and possibly on Period t itself. Let 
the function Ci{xty it) denote the appropriate cost relationship for Period t. Then 
the objective function can be written as 

(1) minimize 2 Q (x^, it ). 

£•1 

Several constraints are imposed on the policy variables Xt and if We restrict 
production to be integer-valued 

(2) Xt = 0, 1, 2, 3,... for each Period t (integer-valued production levels). 

We assume that the management desires a policy in which the inventory level is 
zero at the end of Period N: 

(3) iif = 0 (no ending inventory). 

Finally, we stipulate that each period’s demand must be entirely satisfied on 
time. This condition can be imposed by means of two constraints. The first might 
be called an “accounting identity” since it states that 
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inventory entering Period t 

inventory at the end , n • j ^ 

«„ . : = production in Penod t 

of Period t ^ , 

less 

demand in Period t, 

or, symbolically, 

it = n-i + — A. 

We find it more convenient to rearrange the terms in this relationship, and express 
the constraint ais 

• it-\ + — it = A at each Period t, for / = 1, 2,..., A 

where 4 is a specified level of initial inventory at the beginning of the planning 
horizon. 

The second constraint we impose to ensure that Dependable meets its require¬ 
ments on time is that each period’s entering inventory and production must 
always be large enough to make ending inventory a nonnegative quantity. Actu¬ 
ally, we also want to restrict inventory levels to be integer-valued (which is a harm¬ 
less assumption given that demands and production levels are integer-valued). 
Thus, we require 

(5) Zj = 0, 1, 2, 3,... at each Period t, for / = 1, 2,..., iV' ~ 1. 

Observe that (4) is a linear restriction. If each cost function Ct(xt, i() is linear, 
then (as shown in the advanced material below) the model is equivalent to a 
network problem and is easily solved by the methods in the preceding chapter. 
But in most actual applications of production models, the cost functions are non¬ 
linear. For example, a costly setup may be required to produce a batch of items, 
so that the cost of producing the first unit is greater than the incremental cost of 
producing subsequent units. And when production exceeds the normal capacity 
level during a period, then the incremental cost may again increase due to the use 
of overtime. 

To cope with such nonlinearities in each Ct(xi, Ze), we turn to a dynamic pro¬ 
gramming version of the problem. 

o 

^Let iV = 4, and write out the equations (4) for / = 1,2, 3, and 4. You should get the 
inventory constraints technology table shown in Fig. 8.6. Now on scratch paper, add 



Xy X2 iz X3 /3 X4 


1 

1 -1 

= 0)-/o 


1 1 -1 

= P2 


1 1 -1 

= 03 

4 

1 1 

= 0, 


Figure 8.6. Inventory 
Constraints. 
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these four equations to form a fifth equation, and then rewrite each equation in Fig. 8.6 
after multiplying through by 1, Verify that the resulting system of five equations 
yields a network structure, as shown in Fig. 8.7. ^ 



Dynamic characterization. Recall in the stagecoach problem that 
the computational idea to start at the end (no more stages to go) and work 
back to the beginning. We use the same approach for this problem. Here, the end 
IS when there is only one period left in the planning horizon, and the beginning is 
when there are N periods to go. 

In writing the mathematical formulas, we will find it convenient to use an 
indexing system in which the subscript 1 denotes the end of the horizon and the 
subscript N the beginning. Specifically, we define 

d„ = the demand requirement in that period when there are n more 
periods to go, 

cost of producing x and having j items of ending inventory in that 
period when there are n more periods to go. 

In this notation, d, = D„ and d„ = D,. Similarly, c,{x,j) = C„{x,j). 

For example, if the periods are months, = 4 , and the beginning of the horizon 
IS January, then D, is January’s demand and is April’s. In the formulas, we 
w^t to use the reverse numbering system so that d, is January’s demand and d, 
IS April s. Thus dj, the demand when there are two months to go until the end of 
the horizon, refers to March’s requirements. 

Can you gue^ what determines the state of the production system at the start 
of any penod? TTie answer is entering inventory. Knowing how you arritred at this 
inventory level has no relevance to your current production ■decision. With this in 
mind, we define 

fn{i) — minimum policy cost when entering inventory is at level i with n more 
periods to go, 

*»(*') ~ ^ production level yielding 

Since inventory at the end of the horizon is 0, according to (3), you can write 
(6) /.(O) =0 (n = 0). 

Next look at B = 1. Entering inventory, t, can be any integer amount between the 

limits of 0 and d,; but regardless of the specific level, the production amount must 

O 
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be di — i so that all of the final period’s demand is met. It follows then that 

(7) Ui) = c,{d, - t, 0) for z = 0, 1,..., d,. 

Going on to w = 2, observe that if entering inventory is designated by i and the 
production level by Xy then the associated cost is 

Czixy i + ^ — ^ 2 ) +/i(* + X — di)^ 

assuming that you act optimally for ;z = 1. Note that the quantity i x — di 
is simply inventory at the end of the period. The value for i can be any integer 
amount between 0 and di + </ 2 . Explain why. Given i, the integer value of x 
must be at least as large as d 2 — i in order to meet the period’s demand require¬ 
ment, but no larger than ^1 + i because ending inventory must be 0. An 
optimal X is one that minimizes the above sum. This analysis of n = 2 can be 
summarized by the computation 

/ 2 (z) = minimum [c 2 {xy i + x — d^ +/i(* + x — d^ly 

X 

where f = 0, 1, . . , , and the minimization is over only nonnegative 

integer values of x in the range <^2 — * ^ ^ ^ + ^2 — 

As in the stagecoach problem, once you know the values for^(z), you can find 
the values foryi(i), and so on, until you eventually calculate/jv^(fo), where Iq is 
initial inventory, as before. The general recursion is written as 

(8) fn{i) = minimum i + x — (/„) +/„-i(z + x — d„)] 

for n = 1, 2,..., iVi 

where z = 0, 1,. . ., + .. . + </„ and the minimization is over only nonnega¬ 
tive integer values of x in the range — z < x < + (^2 + • • • — i. 

Observe that by letting entering inventory, z, be the state variable, the only 
independent decision variable in the recursion (8) is x, for ending inventory is 
merely {i + x — d„). Note that since /o(0) and/i(z) are easily computed in (6) 
and (7), it is straightforward to calculate, in turn, ^(0),^(1),. . . then 

• • • >^ 3(^1 + ^ 2 )? continuing, for successively larger values of n, even¬ 
tually to/^_i(0), ... jN-i{di + ^2 + • • • + and finally to/^(zo). 

To find an optimal schedule, you then check what production level Xxriio) 
yielded the value for this is an optimal decision at the start of the horizon. 

At the next stage, the entering inventory level will be Zq + “ dy Find a 

production level that yields the value for yiv^_i(zo + and so on. The 

process will be clear when you study the example in the next section. 

You should pause here to make certain you understand what has been done to 
characterize the model in dynamic programming terms. The problem is being 
viewed in stages, where n denotes the number of stages (here periods) imtil the 
end of the final period. To illustrate, suppose again that iV = 4 and the periods 
are January, February, March, and April, so that n = 1 refers to April and n = 4 
to January. The January requirements are denoted by in the dynamic program¬ 
ming recursion (8). Similar notation is used for the cost functions. There is nothing 
really subtle so far. 
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What is novel is letting the level of entering inventory describe the state when 
there are n periods left to go. Continuing with the four-month illustration, observe 
that given the amount of inventory at the beginning of April and that month’s 
demand requirement, you must produce exactly the difference between these 
two amounts. This fact is recognized in (7), Thus the optimization in April is 
trivial, given entering inventory. 

By the same token, given the amount of inventory at the beginning of March 
and that month’s demand requirement, you must produce at least the difference 
between these two amounts. 

Your production decision x in March, in turn, affects the amount of entering 
inventory in April. Specifically, what enters April is {i + x — d^. Given this 
amount, you act optimally in April. But you already completed April’s optimiza¬ 
tion analysis at the previous stage. Therefore, in deciding the optimal March 
production, you need compare only March costs plus the corresponding costs of 
acting optimally after March. The entirety of these considerations is expressed by 
the right-hand side of the dynamic programming recursion (8). The same line of 
reasoning can then be repeated for February, and finally for January, 

>The recursion ( 8 ) is equivalent to the method of finding a shortest route in an acyclic 
network. You might wonder, ‘*What does such a network look like for this model?” 
Because of the nonlinear cost functions, it is not one like Fig. 8.7. A diagrammatic network 
equivalent of ( 8 ) for an actual problem can be dense with nodes and arcs. The underlying 
structure can be illustrated adequately, however, by the following simple case: 

(i) dn = 1 for n = 1, 2, 3, 4 (stationary demand) 

(ii) X = 0, 1, 2 (restricted production levels). 

The network is pictured in Fig. 8 . 8 . Each node, designated by the symbol (i, n), 
corresponds to a possible level of the state variable i when there are n periods remaining. 
The five nodes on the left designate the different possibilities for initial inventory Xq = 
0, 1, 2, 3, 4. The single node on the right reflects the restriction that ending inventory 
is 0 . 

Consider an arc between (i, n) and {j,n— 1 ). Since J = i -f x — dn, then x = 
J —- X + dn at the period when there are n periods to go. One arc appears for each feasible 
action. The associated cost Cn{xJ) appears with the abbreviated symbol for 

example, ^ 4 ( 6 , 1) is indicated by 401. 

The diagram certainly appears more complicated than Fig. 8.7. This is because the 
problem is more complicated when the Q(x£, x'J are nonlinear. 

The recursion (8) is backward in time, in that it commences calculations with 
the last period of the horizon and moves toward the first period. In the illustration for 
iV = 4, the computation of/i(x) refers to April. It is also possible to construct 2 i forward 
algorithm that starts with the first period and moves toward the last period of the 
horizon. Here, an assumption about initial inventory x'q is needed. Suppose Xq = 0. The 
approach is to let 

(iii) gn{i) = minimum policy cost for Periods 1, 2,..., n when inventory 

at the end of Period n is at level i. 
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January February March April Terminal 


Figure 8.8, Inventory and Production Network. 

Node Designation: (i, n). 

Arc Cost Designation: c„(x,J) = wg. 

Then 

, goiO) = 0 

(iv) 

gi{i) = Ci{Di + I, z) for I = 0, 1,..., D, + D 3 4- • • • + 

and 

(v) gn(0 = minimum [g„-i(z — x + D„) + C„(x, 1 )] for n = 1, 2,. -., A; 

X 

where z’ = 0, 1,. .., D„+i + • • • + + ijr and the minimization ^is over the values 

of X = 0, 1,..., D„ + z*. What is being sought is the value of where ijr is a 

specified level for inventory at the end of the horizon. 

Observe in (v) that i — x + is the amount of inventory entering Period n. Whereas 
an optimal value of x giving rise toyj,(t*) in (8) referred to production in Period 1, with n 
periods left to go, an optimal value of x giving rise to g„(z*) in (v) refers to production in 
Period n. Thus, once the values of gn(*) ^ure tabulated, a solution is found by starting in 
Period N with the value of z>, determining a corresponding best xjy, noting what the 
resulting entering inventory must be, finding a corresponding optimal Xjf-i, etc. ^ 

8.4 NUMERICAL SOLUTION 

Now that you have a formulation of the inventory model, you are ready to solve 
the Dependable Manufacturing Company’s specific problem. We perform the 
numerical calculations in this section. In subsequent sections, we analyze the effect 
on an optimal policy of lengthening the time horizon and show how an optimal 
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policy can be drastically influenced by the imposition of a constraint. This example 
will demonstrate just how perplexing so-called combinatorial problems can be, 
even when they are small scale. 

In fairness we must explain that the particular numbers below have been care¬ 
fully chosen to produce the dynamic phenomena we wanted to exhibit; you should 
not conclude that the dynamic behavior illustrated is typical of inventory processes. 
You also should realize, however, that it would have been difficult to forecast the 
exact nature of the results prior to the dynamic programming analysis. This 
means that in a real situation you may not be able to judge beforehand whether 
an optimal policy is extremely sensitive to the length of the planning horizon. 

To keep the analysis simple, we assume stationarity over time in the demand 
requirements and the cost functions. Specifically, let 

(1) jDf = 3 for all periods (stationary demand). 

Assume that the cost function is simply the sum of a term due to production and 
a linear inventory holding cost: 

(2) Ci(x(, It) = C{xi) + hii for all periods, where 

(3) C(0) = 0 C(l) = 15 C(2) = 17 C(3) = 19 C(4) = 21 C(5) = 23 

(4) A = 1. 

Thus production cost can be viewed as consisting of a setup cost of 13 plus a vari¬ 
able unit cost of 2 per item produced. Holding cost is just 1 times the level of 
ending inventory. 

An added complication is that the Dependable Manufacturing Company has 
limited production capacity and storage space. In particular, it cannot produce 
more than five units in a period and hold more than four units at the end of the 
period: 

(5) Xi = 0, 1, . . . , 5 and if = 0, 1, . . ., 4 for all periods. 

Notice that because the setup cost is high relative to the other costs, an optimal 
schedule will attempt to avoid frequent production. But since production Xf 
cannot exceed 5 and demand is 3, a schedule cannot increase inventory by more 
than 2 each period. Thus if initial inventory is 0, two setups are required in the 
first two periods. It is not at all obvious what is the best schedule of setups and pro¬ 
duction for longer time horizons. The dynamic programming analysis will provide 
the answers. 

Dynamic formulation. Given the above data for the Dependable Manu¬ 
facturing Company, you can write the appropriate dynamic recursion to reflect 
the specifics of the problem. Remember that 

^(f) = minimum policy cost when entering inventory is at level i with n more 
periods to go, 

x„(i) = a production level yielding/,(t). 
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/;(o = ^(3-oj 

<i(i) = 3 — 1 1 


for i = 0, 1, 2, 3, 


For n = 1, 

f,(i) = C(3 - ; 

( 6 ) 

^i(») 

since the level of inventory at the end of the horizon is 0. The general recursion is 

/»(*') = minimum [C(Ar) + 1(* + * — 3) +/„-i(t + JC — 3)] 

^ ^ for n = 2, 3,..., 


where z = 0, 1, 2, 3, 4 and the minimization is over only nonnegative integer 
values in the range 3 — z < x < minimum (5, 7 — z). The production constraint 
in (5) keeps x from exceeding 5; and the end of period inventory constraint in (5) 
keeps X from exceeding 7 — z*. 

In order to perform the analyses of interest, it is necessary to have the values of 
/n(z) available; therefore this task is completed next. The format of the numerical 
solution tables is very similar to that used in the stagecoach problem. There is one 
table for each Stage n. A row in a table corresponds to a value of entering inventory 
z, and a column to a production level x. Because demand has to be met each period 
and the inventory at the end of a period cannot exceed 4, certain entries in the 
tables are eliminated from consideration—they represent infeasible combinations. 
The entries appearing in the body of a table are the sum of costs for the immediate 
period and cost of an optimal policy in the subsequent periods. For each row, the 
minimum of these sums is shown at the right under the column designated by 
fn{i) along with an associated optimal production level a:„(z). 

The fi{i), given by ( 6 ), are tabled in Fig. 8.9. The 
function ^(z) is computed in Fig. 8.10. Note the 
detailed construction of the table. There are five 
rows, one for each feasible value of z. Several of the 
possibilities are blocked out. For example, if z = 1, 
then X > 2 in order that all demand be met. If z = 4, 
then a: < 2 in order that inventory at the end of the 
horizon is zero. The first entry in each column x is 
the value C{x) from (3). The second entry is the 
holding cost h = I times the level of ending inven¬ 
tory. For example, if z = 3 and a: = 0, then ending 
inventory is 0 , and 0 appears as the second term in 
the sum for this case. If z = 3 and a: = 1, then 
ending inventory is 1 , so 1 appears as the second 
term in the sum for this case. And so forth along the z = 3 row. Finally, the 
third term is the value of/,(z + Af — 3) calculated previously in Fig. 8.9. 

Given a level is the minimum sum in the body of the table for that row, 

and a: 2 (z) is a corresponding production level. Thus if z = 1 with two periods 
left to go, the best production level is 5, which yields a cost of 26 for these two 
periods. Any other value for x is more costly. 


/i(/) = C(3-/) 



Figure 8.9. The Depend¬ 
able Manufacturing CSompany 
Model (n = 1). 
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Figure R.IO. Dependable Manufacturing Company Model (« = 2). 
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Figure 8.11. Dependable Manufacturing Company Model (n = 3). 
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Figure 8.12. Dependable Manufecturing Company Model. 
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The calculations yielding ^(f) are shown in Fig. 8.11. Here C{x) + 1(* + ^ 
— 3) is the first term, and + x — 3) from Fig. 8.10 is the second. The remain¬ 
ing values ofyj,(f), for n = 4, 5, 6, are summarized in Fig. 8.12. You should test 
your understanding of the dynamic programming recursive calculations by con¬ 
structing a complete table, laid out like Fig. 8.11, to produceyj(i). Compare your 
results with those in Fig. 8.12. Observe that the production levels, 3 and 4, are 
optimal when n = 4. 

8.5 SENSITIVITY ANALYSIS 

The tabulated numerical results needed to obtain an optimal production policy 
also provide considerable information about the sensitivity of the solution to 
assumed values of the model’s parameters, such as the length of the planning 
horizon and the level of entering inventory. We examine sensitivity questions of 
this sort using the data for the Dependable Manufacturing Company contained 
in Figs. 8.9 through 8.12. 

Length of the planning horizon. For the sake of definiteness, sup¬ 
pose each period in the model represents a month and that the first period is 
January. We want to know how the optimal monthly figures change as the horizon 
N increases, and, in particular, what happens to January production. The results, 
based on Fig. 8.12, are shown in Fig. 8.13, under the assumption that the inventory 
level at the beginning of January is 0. 
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Figure 8.13. Dependable Manufacturing Company Production Schedule 
When io = 0. 


Figure 8.13 is constructed as follows. When 1, January production, 3, 
is found from the first row of Fig. 8.12 under n = 1. When W = 2, January pro¬ 
duction, 3, is found from the first row of Fig. 8.12 under n = 2. And so on. When 
N = January production is 4 and inventory entering February will then be 
1 (=f + ^ — ^ = 0 + 4 — 3). Consequently, the February production level, 5, 
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is found in Fig. 8.12 for n = 5 with the new entering z = 1. This in turn means 
that inventory entering March will be3(=z+A: — </ = 1 + 5 — 3), so that March 
production is 0, as shown in Fig. 8.12 for « = 4 with the new entering i = 3. 
The same line of reasoning establishes that April production is 4, since n = 3 
with entering inventory ofO (=z+a: — </ = 3 + 0 — 3). Given the April decision, 
inventory entering May is 0 + 4 — 3 = 1, so that May production is 5 (n = 2). 
Therefore June production 0 is optimal, since entering inventory is 1 + 5 — 3 = 3 
with 1. Verify that minimum total cost when ^^=6 is (21 + 1) + 

(23 + 3) + (0 + 0) + (21 + 1) + (23 + 3) + (0 + 0) = 96, which appears 
as/6(0) in Fig. 8.12, 

The policies in Fig. 8.13 show how the best production amount for January 
depends on the length of the planning horizon. As the horizon length N increases 
from 1 to 5, there exist optimal policies such that January production increases. 
For iV = 6 , however, the best policy calls for January production of 4 as compared 
to the amount 5 when N = 5, Thus initial production can either increase or 
decrease as the planning horizon lengthens. For A" = 4, there are two alternative 
optimal policies. Figure 8.13 also exhibits how the cost per period depends on N, 
Note that the cost per period does not steadily decrease, but fluctuates as N 
increases from 2 to 6 . 

The policy for N = 5 deserves special attention. In this case, inventory is 
increased in January, February, and April. The May demand is thus filled by 
two units of April production as well as a unit produced in February. In this 
situation, it turns out to be optimal to bring in inventory to both February and 
April even though a setup cost is incurred in these two months. 

Initial inventory. Here we study how an optimal policy depends on the 
amount of starting inventory. Examine Fig. 8.14 to see how the January produc¬ 
tion figure varies with different levels of initial inventory. When the horizon 
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length N = each additional unit of initial inventory brings about a unit reduc¬ 
tion in January production. When AT = 2, 3, 4, and 6, however, the first unit 
of initial inventory actually causes January production to increase. Depending on 
the horizon length N, the second unit may cause January production to fall 
(when N = 2, production does not fall to the previous level) or remain the same 
(AT =4). 

The final two columns of Fig. 8.14 tabulate the reduction in total cost when 
initial inventory increases. For example, consider the horizon N = 2. If initial 
inventory is 0, the total cost is 38, shown in Fig. 8.12. One unit of initial inventory 
brings the total cost down to 26 and another unit to 24. Thus the value of the first 
unit of inventory is 12 and the next 2, as exhibited in Fig. 8.14. Notice that the 
value of initial inventory depends considerably on the length of the time horizon, 
and whether the item is the first or second unit. Using the information in Fig. 
8.12, trace the optimal policies for horizons N = 4^ and 6 and initial inventory 
levels I’o = 1 and 2 to see why the incremental values of inventory differ as they do. 
(Incidentally, you will find, when N = 4 and i = 2, that four units of inventory 
enter February, so that if inventory at the end of each period, if, were constrained 
to be strictly less than four, this solution would be ruled out, and total cost would 
increase.) 

*Long-run analysis of short-term policies. So far you have 
investigated what happens when the planning horizon N is the same as the total 
span over which the item is demanded. That is, the policies in Figs. 8.12 and 8.13 
refer to the situation in which demand Z)f = 0 for / > N. 

Now suppose Dt = 3 beyond the planning horizon AT, but the current monthly 
production amount is determined by looking only N periods ahead. For example, 
if = 4, then January production is determined by taking into account inventory 
at the beginning of January and selecting the corresponding optimal policy as if 
the total span were merely January through April. Then, in February, the process 
is repeated. Given entering inventory, February production is determined as if 
the horizon were Ar= 4 (February through May). This procedure, sometimes 
called a rolling schedule, is often employed in industrial situations. The 
question examined is how well the procedure performs for the example here. 

To begin, suppose that the true span for which Df = 3 is 10 periods, and Z)f = 0 
for / > 10. For each planning horizon AT, Fig. 8.15 shows the resulting schedule 
and cost, assuming initial inventory in January is 0. The truly optimal policy 
{N = 10) is to repeat twice the policy for A^ = 5 in Fig. 8.13. However, no rolling 
schedule produces this optimal schedule. When AT = 4, several possible rolling sched¬ 
ules arise, since if entering inventory is 0, production of either 3 or 4 is indicated. 
Only two schedules are shown, and they produce different total costs. Finally, 
note that lengthening the planning horizon for a rolling schedule can actually 
be harmful. The schedule for iV = 3 is less costly than that for Af = 5, although 
neither is optimal. ^ 

Suppose that in fact Dt has the same value of 3, and a rolling schedule procedure 
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Figure 8.15. Dependable Manufacturing Company Rolling Schedule 
Operating Over a 10-Period Span (lo = 0). 


is perpetually employed. Then assuming that initial inventory io is equal to 0, the 
resulting production patterns are the ever-repeating cycles exhibited in Fig. 8.16. 
Given that t’o = 0, the cost functions and various constraints imply that ending 
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^Policies for A/ 5 6 ore found in Figure 8.12, and for A/ > 6 ore found by extending 
thot onolysis. For oil AT, ) = 0 when / 2: 3. 


Figure 8.16. Dependable Manufacturing Company Rolling Schedule 
Operating Over an Unbounded Horizon (I'o = 0). 
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inventory will never exceed the level 2. Therefore, if for two different planning 
horizons the optimal production policies are identical for entering inventory levels 
: = 0, 1,2, they will lead to the same repeating production cycle. As a conse¬ 
quence, we tabulate only these three policy values in Fig. 8.16. You can see 
that only seven distinct optimal policies occur for N = 1,2,..., and these lead 
to only four different repeating production cycles. (For certain horizons, there are 
several optimal policies, so these values of N are repeated accordingly.) 

Notice that not until N = 10 does a rolling schedule yield an optimal solution, 
which is defined as a policy giving minimum cost per period. Even at iV’ = 17 
the resultant repeating production cycles are not optimal, and it can be shown that 
one alternative policy for N =22 incurs a cost of 16 per period. However, pro¬ 
vided the planning horizon is large enough—namely, N > 18—the optimal policy 
for the unbounded horizon is also optimal for the finite horizon situation. This 
illustration demonstrates that without careful analysis there is ample opportunity 
to make an error in judging the required horizon for planning. 

In Chaps. 11 and 12 you will study how to determine optimal policies when 
the time span of operation actually is unbounded. 

Commentary. Although the particular numerical values for the example 
have been selected with some care, the general description of the situation is 
reasonable: production expense consists of a setup cost plus a unit cost; inventory 
holding cost is linear (1 per unit of ending inventory); and simple upper bounds 
are imposed on production and inventory. Nevertheless, an optimal policy changes 
drastically with various alterations in the planning horizon. It is impossible to 
suggest how often you will encounter such extreme planning horizon sensitivity, 
and how economically serious it is to mistakenly adopt a nonoptimal policy. 
But you ought to realize from the example that it is difficult to recognize the degree 
and significance of the sensitivity analysis without undertaking a precise dynamic 
analysis of the particular application. The operations research approach you have 
just learned is a fundamental tool for this type of study. 

As you know, a mathematical model often is general enough to cover a multi¬ 
tude of real situations; a meaningful evaluation of the model in this chapter must 
therefore be carried out in terms of what is being assiuned rather than in terms of 
the one particular environment described for the Dependable Manufacturing 
Company. The crucial assumptions are listed below. 

1. The demand forecast is accurate. Although a company can rarely forecast several 
months’ demand without error, the margin of error is often small enough for the 
deterministic model to yield a good approximation. (In such circumstances, a 
rolling schedule discussed in the advanced material above is usually followed; 
but sometimes several periods are allowed to elapse before making a schedule 
revision.) When the forecasting errors are substantial, models of the type described 
in Chap. 19 and Appendix II must be used. 
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2. The manufacturing time is negligible. The assumption actually required is that 
manufacturing time can be predetermined with negligible error. To illustrate, 
suppose it always takes two weeks to produce a batch of items. Then if a schedule 
found by the recursion formulas in this chapter indicates that February produc¬ 
tion is to meet February’s demand, the batch really would be started two weeks 
earlier, in the second half of January. 

Another facet of this assumption is that manufacturing time can be determined 
independently of other orders being processed. If several items are each produced 
on a single piece of equipment having limited capacity, then an agglomeration of 
schedules, each found independently by a dynamic programming model, may 
not be feasible. 

The models in this chapter are often useful when an item is simply ordered from 
an outside vendor, who keeps an inventory on hand. The time delay then becomes 
the delivery lag, and the cost function includes the purchase cost, instead of the 
production expense. 

3. Each period^s cost depends on the amount produced and on the ending inventory; each 
periodic demand is entirely satisfied. Without much trouble these two assumptions can 
be altered to cover a much wider variety of situations. We discuss how in the 
advanced material below. 


►Without introducing any essentially new complications, the dynamic programming 
recursion (8) in Sec. 8.3 can be written as 

W /n(0 = minimum z, t + .v - </„) + x — d/„)]. 

This formulation then is able to encompass the situation in which inventory holding 
cost is levied against average inventory: 


(ii) 


l -}- (i + — dn) 

- 2 - 


It is also possible to let demand be fully backlogged until later periods. To do that, 
(ih) it = it-i + xt- Dt 

as before, but now is permitted to take on negative values to represent the amount 
backlogged. Or, as another alternative, it is possible to let imfilled demand be fully lost 
by defining 

(iv) it = maximum (it-i + — Dt, 0). 

Then 

(v) /n(0 = minimum {c„(x, x, i + x — dn) 4-/„_i[maximum (t + x — z/„, 0)]]. 

For both (iii) and (iv) the cost function c„(x, i, i + x — {/„) would include a penalty 
for the backlogging or lost demand condition. For example, one such cost function is 

iy i + X — dn) = c{x) + ^[maximum (i + x — (/„, 0)] 

— /^[minimum (i + x — dn, 0)], 
where ^ > 0 represents the cost per unit of demand not met in a period. In the lost- 


(vi) 
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sales case, presumably p would include the drop in sales revenue. These possibilities 
are examined in greater detail in Chap. 19 and Appendix II, 

In some instances, a firm may incur a cost due to changing production from one period 
to the next. This situation can also be handled by a dynamic programming recursion, 
but is more complex; it is discussed in Sec. 9.7. 

In truth, if demand can be forecast with sufficient accuracy so that a deterministic 
inventory model is useful, then the models in this chapter usually turn out to be too 
general. Typically, there is enough particular information about r„{x, i, z -f- ^ 
that much stronger results concerning planning horizons and the form of optimal policies 
can be derived. As a result, the computational burden of finding such policies is con¬ 
siderably simpler than that indicated by the example in Sec. 8.4. Two important special 
cases are considered in the next chapter to illustrate this point. ^ 


*8.6 STRATEGIC PLANNING 


If you were the Vice-President in charge of production in the Dependable Manu¬ 
facturing Company, you might well be disturbed by the results of the preceding 
analysis. In trying to pin down the causes for the peculiar sensitivity of optimal 
production policies, you might turn attention, on one hand, to the incentive for 
avoiding a high setup cost, and, on the other hand, to the restriction of staying 
below the maximum production level 5. You might consider what would happen if 
the production and inventory restrictions were dropped completely, and 


( 1 ) 


C{x,) = 


r 

|13 + 2x, 


for = 0 
for Xt > 0, 


which gives the same cost-function values as before for = 0, 1,..., 5. Then 
if initial inventory is 0 and demand A bas the same value of 3, the optimal January 
production can be shown to be 9, and this batch size should be manufactured 
repeatedly whenever entering inventory is 0. The corresponding minimal cost per 
period is 13-J^, as compared to 15| in Fig. 8.16. 

In an effort to relieve the cost impact of the production constraint, suppose you 
discover there is a way to produce up to six units each period. However, the 
corresponding cost of six units is 


( 2 ) 


C(6) = 28.5, 


which is incrementally large (5.5 = 28.5 — 23). Obviously, adding another possible 
production level cannot worsen the situation, because all the previous policies 
are still feasible. But does this new alternative make much of a difference? 

The revised values of and the corresponding a:„(i) appear in Fig. 8.17. 
You should check your understanding of the recursive calculation process by per¬ 
forming the computations for finding^(i) andji(i). Figure 8.18 exhibits the pro¬ 
duction schedules implied by the new values, assuming that initial inventory in 
January is 0. Comparing these figures with the corresponding amounts in Fig. 
8.13, you see that the schedules for horizons = 2, 4, and 6 are revised and 
utilize the sixth unit of production. As a consequence, now the lowest cost per 
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Figure 8.17. Expanded Production Capability of Dependable Manufactur¬ 
ing Company. 
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Figure 8.18. Dependable Manufacturing Company Production Schedule 
When io = 0. 

period is for horizons jY = 2, 4, and 6. For N = 1^3, and 5, the optimal January 
production is less than 6. Observe that for horizon N = 5 the previous solution 
is still optimal. 

The influence of initial inventory in January is shown in Fig. 8.19. If initial i 
inventory is either one or two units, the optimal January production is the same' 
as it was in Fig. 8.14. The incremental values of initial inventory have changed, 
however. 

If demand Dt = 3 for every t and a rolling schedule procedure is perpetually 
employed, then assuming that initial inventory is equal to 0, the patterns appear 
as in Fig. 8.20. Now if the planning horizon is an even number, an optimal 
schedule occurs. Otherwise the previous nonoptimal pattern in Fig. 8.16 results. 
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Figure 8.19. Dependable 
Manufacturing Company 
Inventory Valuation. 
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Figure 8.20. Dependable 
Manufacturing Company 
Revised Rolling Schedule 
Operating Over an Un¬ 
bounded Horizon (to = 0). 
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Take a closer look at the difference between the optimal repeating schedule for 
the new situation and the previous one in Fig. 8.16. In the previous situation, there 
are six setups in a ten-period span, maximum inventory is 4, and inventory 
exists at the end of eight of the ten periods. Here a batch of 6 is produced every 
other period, and inventory fluctuates between 3 and 0. Thus in a ten-period 
span, there are only five setups, maximum inventory is 3, and inventory is positive 
at the end of only five of the periods. By easing the constraint on production, not 
only has cost per period improved, but the new optimal policy itself appears 
qualitatively more appealing. 


REVIEW EXERCISES 

Exercises 1 through 6 refer to the Stagecoach Problem in Sec. 8.2. 

1 (a) Enumerate the 14 distinct routes from the East to the West in Fig. 8.1. 

(b) Explain why it is not necessary to evaluate all of these routes when you employ 
the principle of optimality. 
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2 (a) Suppose the stagecoach from State 7 to State 9 does not operate. What is the 

best routing from the East to the West? 

(b) Suppose a stagecoach service is inaugurated from State 3 to State 8. What is 
the smallest policy cost for this link such that Mark Off would still prefer the 
current routing? 

(c) Determine a range for the policy cost of the stagecoach from State 1 to State 3 
such that Mark Off prefers the current routing. Do the same for the policy cost 
from State 3 to State 7. Do the same for the policy cost from. State 2 to State 6. 

3 Mark Off suspects that it is the relative costs at each stage that determine an optimal 
routing. For example, he suspects that he should select the same routing if the policy 
costs are Ci^ = 2 + S, rjj = 5 + 8, and Ci 4 = 4 -f 8, where 8 can be any constant 
value. Is Mark Off correct? If not, state why. If so, discuss how his observation may 
be helpful. 

4 Mark Off’s father. Pop Off, lives in State 8. Find an optimal routing from the East 
to the West that goes through State 8. 

5 Mark Off’s brother, Buzz, lives in San Francisco and wants to travel East. Assume 
that the policy cost for each stagecoach remains the same in the easterly direction. 

(a) Explain why an optimal routing for Buzz is the reverse of the one Mark used. 

(b) Garry out the recursive calculations implied by (2) in Sec. 8.2, starting the compu¬ 
tations at State 1. (Note here that if n = 1 more stage to go, then Buzz is either 
in State 2, 3, or 4; similarly, if n = 4 stages to go, then Buzz is in State 10.) 

6 Stagecoach Problem, In each part, find an optimal route from the East to the West, 
where the policy cost from State i to State j equals the shown in Fig. 8.1 plus the 
modifications specified. 


(a) 


+ i 

(thus, the cost from State 1 to State 2 is 3, to State 3 is 6, 

and 

to 

State 4 




is 2, etc.). 





(b) 

^ii 

+ J 

(thus, the cost from State 1 to State 2 is 4, to State 3 is 8, 

and 

to 

State 4 




is 5, etc.). 





(c) 



— i (thus, the cost from State 1 

to State 2 is 3, to State 

3 is 

7. 

and to 




State 4 is 4, etc.). 





(d) 


+ « 

-f j (thus, the cost from State 1 

to State 2 is 5, to State 

3 is 

% 

and to 




State 4 is 6, etc.). 






♦7 Compare the algorithm (2) in Sec. 8.2 with the shortest-route algorithm (2) in Sec. 7.7. 

Exercises 8 through 20 refer to the production and inventory model in Sec. 8.3. 

8 Each part below shows either the production quantity in Period t or the inventory 
level ii at the end of Period t; you are to determine the implied levels of the unspecified 
policy variable, either or Xf Assume that there are iV = 6 periods, and that the 
demand requirements are 

= 10 Z), = 15 Z), = 8 Z)4 = 25 =12 Z>« = 30. 
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State whether the implied policy is feasible, that is, whether ^ 0 and it ^ 0 for 
every t. The symbol Iq denotes initial inventory available at the start of Period 1. 

(a) 1*0 = 10, and xi = 15 every period. 

(b) 1*0 = 5, Afi = 20, and Xt= 15 for / = 2, 3,..., 6 . 

(c) Zq = 5i Xi= 15 for / = 1,2,..., 5, and x^ = 20. 

(d) 1*0 10 for ^ = 1, 2, 3, and = 23 for / = 4, 5, 6 . 

(e) 1*0 = 0, 1*1 = 15, 1*2 = 20, :*3 = 25, u = 15, 1*5 = 5, = 0. 

(f) h = 10, i*i = 15, 1*2 = 20, 1*3 = 25, 1*4 = 15 ,15 = 5, z® = 0. 

(g) ?o = 30, z'l = 15, 1*2 = 20, 2*3 = 25, 2*4 = 15, 2*5 = 5, 2*3 = 0. 

(h) 2*0 = 0 , 2*1 = 10 , 2*2 = 10 , 2*3 = 10 , 2*4 = 10 , 2*5 = 10 , 2*3 = 0 . 

(i) 2 q = 35 , == 35 j 2*2 = 35 , 2*3 = 35, 2*4 = 35, 2*5 = 35, 2*3 = 0. 

(j) *0 “ 35, 2*1 = 35, 2*2 = 35, 2*3 = 35, = 35, 2*5 =10, 2*3 = 0, 

(k) How would you revise the plan if 2*3 = 10 in parts (e), (g), (h), (i), and (j) ? 


9 Suppose the cost function Ci{xt, it) is described as 

Q(^ 2 > * 2 ) = ^(^t) 4- hit, 

where 


C{xt) = 


[6 + 10^2 


for Xt = 0 
for Xt > 0 , 


and h = 2. Calculate the total production and inventory cost associated with the plans 
in exercise 8 , 


(a) Part (a). (e) Part (f). 

(b) Part (b). (f) Part (h). 

(c) Part (c). (g) Part (j). 

(d) Part (e). 


10 Assume you have values for production xt and inventory it that satisfy all the con¬ 
straints (2) through (5) in Sec. 8.3. Recall that Ct(xt, t)) is the production and holding 
cost function, and that ending inventory iy = 0 , 

(a) Consider increasing the level of X 2 by 1, decreasing the level of Xj by 1, and 
revising inventory levels accordingly. Explain why the new plan is still feasible. 
Indicate the resultant incremental change in total cost. 

(b) Consider increasing the level ofx 3 by 1, decreasing the level ofx 2 by 1, and revising 
inventory levels accordingly. Explain why the new plan may not be feasible. 
Give a condition involving the demand requirements that must be satisfied in 
order for the new plan to be feasible. Give an equivalent condition involving 
inventory. Assuming the new plan is feasible, indicate the resultant incremental 
change in total cost. 

(c) Consider increasing the level of X 2 by 1 , decreasing the level of X 4 by 1, and revis¬ 
ing inventory levels accordingly. Explain why the new plan is still feasible. 
Indicate the resultant incremental change in total cost. 

(d) Consider increasing the level of inventory 2*2 by 1, and revising production levels 
accordingly. Explain why the new plan may not be feasible. Give a condition 
involving the demand requirements that must be satisfied in order for the new plan 


j 

< 

I 

t 


1 
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to be feasible. Assuming the new plan is feasible, indicate the resultant incremental 
change in total cost. 

(e) Consider decreasing the level of inventory i, by 1, and revising production levels 
accordingly. Explain why the new plan may not be feasible. Give a condition 
that must be satbfied in order for the new plan to be feasible. Assuming the new 
plan is feasible, indicate the resultant incremental change in total cost. 

*11 Let iV = 6 , and write equations (4) in Sec. 8.3 for / = 1, 2,..., 6 . 

(a) Display the constraints of the model in a technology table, like that in Fig. 8 . 6 . 

(b) Derive an associated network structure, like that in Fig. 8.7. [Hint: add the six 
equations in part (a) to form a seventh equation; then rewrite each equation in 
part (a) after multiplying through by — 1 .] 

(c) Suppose the cost function is linear, 

*<) = 4" 

where 

Cl = 1 Q = 4 C, = 3 C 4 = 5 C 5 = 7 Co = 4. 

Let initial inventory Iq = 0, and let the demand requirements be those in exercise 
8 . Find optimal policies for A = 0, A = ^, A = 1^, and A = 4. Indicate alternative 
optimal policies when they occur. 

12 Give a verbal explanation of why the state of the system is completely summarized 
by the level of entering inventory each period. What are the assumptions about costs 
and production lags that permit this simple characterization of the state? 

13 The recursive calculation process in dynamic programming has been termed a boot¬ 
strap approach in Sec. 8.2. Give a verbal explanation of the way the bootstrap 
approach is applied to obtain a solution to the inventory model in Sec. 8.3. 

14 Suppose JV’= 6 and January is Period 1. Let refer to the demand requirement in 
that period when there are n more periods to go. To what month does </j refer? 
Similarly, (/«? d^? d^? 

15 Consider the recursion ( 8 ) in Sec. 8.3. Suppose that i = 0 when n periods remain. 
What is the smallest feasible value for production, x, in that period? Suppose, instead, 
that I = </i + (fj + • • • + What is the value for production in each of the n 
remaining periods? 

16 Consider the recursion ( 8 ) in Sec. 8.3. Suppose Cj(x,J) = 5x4- 2j. Assume that start¬ 
ing inventory i = 4 with n = 3 more periods to go, and that ending inventory must 
not exceed 4. Let = 10. Find an optimal level of production and the associated 
ending inventory level given the following. 

(a) /,(0) = 100 /,(!) = 90 ft(2) = 82 /,(3) = 76 /,(4) = 75. 

(b) /2(0) = no /,(!) = 100/,(2) = 92 /,(3) = 86 /,(4) = 85. 

(c) ftU) = 100 - 6 j. 

(d) /,O) = 100-9j. 

(e) MO) = 100 /.(I) = 99 /,(2) = 93 /.(3) = 85 /,(4) = 75. 
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17 Recall that x„(t) is an optimal production level when entering inventory is at level i 
with n more periods to go. Suppose dn = 2 for every n and that the values for x^(i) are 

x,(0) = 5 x,(0) = 4 xdO) = 2 

x,(l) = 4 x,(l) = 3 x,{l) = 1 

;c 3 ( 2 ) = 0 x,(2) = 0 x,(2) = 0 . 

X3(3) = 0 x,(3) = 0 

In parts (a) through (d), indicate an optimal production plan and the associated 
inventory levels when there are n = 3 more periods to go, and entering inventory 
level i equals 

(a) 0 . (c) 2 . 

(b) 1. (d) 3. 

(e) At n = 3 what is the incremental value of i = 1 versus z = 2 ? Of i = 4 versus 
z = 3? [Use the symbol Cic(x,j) to denote the cost function in the period when 
there are k more periods to go.] 

*18 (a) Modify Fig. 8.8 to treat the case in which demand </„ = 2 for n = 1, 2, 3, 4. 

(b) Modify Fig. 8.8 to treat the case in which the permissible production values 
are ;c = 0, 2, 4. 

(c) Explain how Fig. 8.8 is modified when you impose the constraint that inventory 
at the end of each period cannot exceed the level 1 . 

(d) Explain how Fig. 8.8 can be simplified if you know for certain that inventofy 
entering January equals 1. 

*19 Explain the connection between finding a shortest route in Fig. 8.8 and the recursive 
calculations indicated by ( 8 ) in Sec. 8.3. 

*20 In the network displayed in Fig. 8 . 8 , reorient each arc in the opposite direction. 
Assume that inventory entering January equals 1. Explain how finding a shortest 
route from the node labeled “Terminal” to the node for January with inventory level 1 
is the same process as applying recursion (iv) in the special material at the end of Sec. 
8.3, and yields the value g 4 ( 0 ). 

Exercises 21 through 34 refer to the case of the Dependable Manufacturing 
Company described in Sec. 8.4. 

21 In each part below, construct a complete table, laid out like Fig. 8.11, showing^(z) 
and Xn{i) for the indicated value of n. Use the values of/n_i(z) that are given in Fig. 
8 . 12 . 

(a) n = 4. 

(b) n = 5. 

(c) n = 6 . 

22 (a) Verify that the cost function'C(x) given as (3) in Sec. 8.4 represents a setup cost of 

13 plus a variable tmit cost of 2 per item produced. 

(b) Explain why the production level x in a period mtist be at least 3 — i, where i is 
the level of entering inventory. 
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(c) Explain why the production level x in a period cannot exceed 7 — i, given the 
restriction that ending inventory must be less than or equal to 4. 

(d) Suppose initial inventory is zero. Calculate the average cost per period if the 
production pattern is to produce three units every period. If the pattern is to 
produce five units and tlien one unit. If the pattern is to produce five units, 
five units, two units, and zero units. If the pattern is to produce five units, five 
units, zero units, and two units. If the pattern is to produce five units, two units, 
five units, and zero units. If the pattern is to produce four units and then two 
units. If the pattern is to produce four units, four units, four units, and zero units. 
Explain the cost tradeoffs among these different patterns. 

23 Construct a table like that in Fig. 8.13, and assume that initial inventory entering 
January is 

(a) Jo = 1. (c) j'o = 3. 

(b) fo = 2. (d) = 4. 

(e) Given your answers above, verify the incremental values of inventory shown in 
Fig. 8.14, and extend the table for Iq = 3 and I'o = 4. 

24 Construct a table like that in Fig. 8.12 given the constraint on ending inventory as 
indicated below. (Hint: you do not have to repeat all the computations in Fig. 8.12 
if you make judicious use of the information contained in Figs. 8.9 through 8.12.) 

(a) t^5. 

(b) f:^3. 

(c) Given your answers above, what is the smallest limiting value (3, 4, or 5) such that 
the restriction on the inventory level is not binding? 

25 Construct tables like those in Figs. 8.12 and 8.13, where the constraint on the produc¬ 
tion capacity limitation is specified as 

(a) Production x in each period cannot exceed four units. 

(b) Production x in each period cannot exceed six units and C(6) = 25. 

(c) Given your answers above, compare the impact of varying the production capacity 
limit. 

(d) What is the form of an optimal policy for this company if it imposes the production 
level restriction that x ^ 1 every period ? 

26 Construct a table like that in Fig. 8.12 given the constraint that inventory at the end 
of the horizon equals 

(a) One unit. (c) Three units. 

(b) Two units. (d) Four units. 

(e) For each of your answers above, construct a table like that in Fig. 8.13. 

27 Construct tables like those in Figs. 8.12, 8.13, and 8.14, given the assumption that 
demand Di each period equals 

(a) Two units. 

(b) Four units. 

(c) Explain how the optimal policies differ when stationary demand equals two or 
four, instead of three. 
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28 Construct tables like those in Figs. 8.12, 8.13, and 8.14, given the assumption that 

(a) The setup cost component in C(x), shown in (3) of Sec. 8.4, is 10, instead of 13. 

(b) The holding cost A =5, instead of A =1. 

(c) Explain how the optimal policies are affected by a decrease in setup cost or an 
increase in holding cost. 

29 Construct tables like those in Figs. 8.12 and 8.13, given the assumption that demand in 
the final period of the horizon is as specified below, instead of three units. (For example, 
if iV^ = 4, then demand is altered in April.) Compare your results with the policies in 
Figs. 8.12 and 8.13 and comment on the impact of incorrectly assuming in January that 
demand at the end of the horizon is three units, instead of the amount indicated below. 

(a) Two units. 

(b) Four units. 

*30 Verify that using a rolling schedule over 10 periods, with the horizon length N as 
indicated below, yields the policies shown in Fig. 8.15. 

(a) i\r=3. (c) i\r=5. 

(b) i\r=4. (d)i\r=6. 

*31 Verify that using a rolling schedule over an unbounded horizon, with the horizon 
length N as indicated below, yields the policies and associated cost per period shown 
in Fig, 8.16. 

(a) iV'=3. (c) A'=5. 

(b) AT = 4. (d) AT = 6. 

*32 Suppose the holding cost function in Sec. 8,5 is modified such that the holding cost 
A = 1 is applied to average inventory, given by (ii) at the end of Sec. 8.5. Construct 
tables like those in Figs. 8.12 and 8.13, and compare the efifect of this alteration in 
costing. 

*33 Suppose that the Dependable Company abandons its policy of meeting all demand 
requirements on time. Let the cost function, which now includes a penalty for not 
meeting demand, be (vi) at the end of Sec. 8.5, where the penalty cost p = 10. In each 
part below, construct tables like those in Figs. 8.12 and 8.13, and compare the effect 
of this alteration in operating policy. 

(a) Assume that unfilled demand is lost forever, so that ending inventory is given by 
(iv) at the end of Sec. 8.5. 

(b) Assume that any demand not filled in a period is backlogged and must be filled 
at the next period. No backlog, however, is permitted to exist at the end of the 
final period, so that all demand during the horizon is eventually filled. Assume 
the penalty cost is charged every period demand remains backlogged. 

*34 In each part below, construct a complete table, laid out like Fig. 8.11, showing/n(f) 
and x„(x) for the indicated values of n. Use the values foryi_i(i) as given in Fig. 8.17. 

(a) n = 2. (d) n = 5. 

(b) n = 3. (c) « = 6. 

(c) n = 4. 
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iMTRODUCnON TO DYNAMIC OPTIMIZATION MODELS 


35 Explain your understanding of the following terms: 


dynamic programming 
principle of optimality 
recursion 

recursive algorithm 
bootstrap process (or approach) 


optimal policy 

stage 

state 

♦backward in time 
♦rolling schedule. 


COMPUTATIONAL EXERCISES 

36 Mark Off’s fame spread world wide. He received an urgent request from the Russian 
Tsar Kazim to find a route to Rostov from Vladivostok. In this case, the Tsar’s wife 
made the trip, and the expense associated with each leg of the journey is for protection 
against attacks by Cossacks and local tribesmen. The numbers shown on each arc in 
Fig. 8.21 arc the rubles that must be paid for protection on that route. Find an optimal 
routing. Display your calculations in tables like those in Figs. 8.2 through 8.5. 



37 Consider Tsar Kazim’s problem in exercise 36. Assume that the same number of 
rubles must be paid for a journey in the opposite direction. Find an optimal routing 
starting in Rostov and ending in Vladivostok. Display your calculations in tables like 
those in Figs. 8.2 through 8.5. 

38 Consider Tsar Kazim’s problem in exercise 36. The Tsar’s wife insists on visiting 
Territory 5, because she has learned that they have perfected a recipe for caviar 
pancakes. Find an optimal routing from Vladivostok to Rostov through Territory 5. 

Exercises 39 through 44 refer to specific cases of the inventory model described 
in Sec. 8.3, and characterized by the recursion (8). In each exercise, the produc¬ 
tion and holding cost function for Period t has the form 

Ci(xty it) = Ct{xt) + htit. 

(Note if January is the beginning of the horizon, then Period 1 refers to January.) 
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(Construct tables like those in Figs. 8.12 and 8.13 for the indicated value of the 
horizon length N. (Remark; more efficient algorithms for solving these problems 
are discussed in Chap. 9.) 

39 Let N demand Dt = 1, and holding cost ht = .1 for all periods. Assume that 

C,(0) = 0 C,(l) = 5 C,(2) = 10 C,(3) = 15 C,(4) = 16 

C,(0) = 0 C,(l) = 6 C,(2) = 9 C,(3) = 12 

Q(0)=0 C,(l) = 5 C,(2) = 7 

C.(0) = 0 C«(l) = 3. 

40 Let iV" = 5. Assume 

jD, = 1 £>2=1 £>3 = 2 £>4 = 3 £>5 = 4, 

and that holding cost hi = 1 for all periods. Suppose the production cost function is 

fO forx4 = 0 

Ct{xt) = \ 

Ij( + ctxt for xt = 1,2, 3,..., 
where Ci = 10 for all periods, and 

ji = 1 J2 = 2 J3 = 4 J4 = 8 J5 = 7. 

41 Let N = Assume 

/),= ! £>, = 4 i), = 2 £>4 = 2, 

and that ht = 0 for all periods. Suppose the production cost function is the same form 
as in exercise 40, where 

jj = 1 == 12 J3 = 1 ^4 = 2 

Cj = 3 ^2 = ^ ^3=2 = 1. 

42 Solve the Longview Manufacturing case considered in exercise 28 of Chap. 6 and 
exercise 48 of Chap. 7. 

43 Suppose a firm can produce a limited number of units using its work force at regular- 



Jan, 

Feb, 

Mar. 

Apr. 

May 

June 

Regular Time 

cost/unit 

2 

■ 4 

2 

5 

2 

6 

capacity 

3 

1 

4 

3 

1 

3 

Overtime 

cast/unit 

5 

6 

6 

6 

3 

7 

capacity 

6 

3 

3 

2 

0 

1 

Demand 

Requirements 

1 

2 

7 

6 

0 

2 
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time wage rates and an additional, but limited, number of units using its work 
force at overtime wage rates. The data are given in Fig. 8.22. Thus, for example. 


2^1 for Xi = 0, 1, 2, 3 

2*3 + 5(a‘i — 3) = 5^1 — 9 for .Vj = 4, 5,.. ., 9. 


The fluctuation in costs from period to period is due to special conditions in the 
company’s labor market as well as to varying prices for the product’s raw materials. 
Assume that each unit of inventory stored at the end of a period has a holding cost 
/tj = 1 for all periods. 


44 Assume that the horizon length N = 3, that the demand requirements are 

D, = 3 Do = 6 D3 = 3, 

and that there are no holding costs. The production costs are described by 


January 
setup cost = 3 

cost/unit for 1 up to 
5 units = 1 

cost/unit for each item 
beyond 5 units and up to 
10 units = 5 

cost/unit for each item 
beyond 10 units = 30 


February 
setup cost = 6 

cost/unit for 1-to 
5 units = 1 

cost/unit for each item 
beyond 5 units = 25 


March 

setup cost = 20 

cosi/unit for first 
item — 1 

cost/unit for each item 
beyond the first = 30 


Construct tables like those in Figs. 8.12 and 8.13. 


MIND-EXPANDING EXERCISES 

45 Consider the inventory model described in Sec. 8.3. Suppose that x represents a 
“target” production level each period, and that if actual production x deviates from x , 
then you must pay the “smoothing cost” i;*! a: — 3c |. Assume 3c is a prespecified constant. 

(a) Show how to reformulate the recursion ( 8 ) to take this smoothing cost into 
account. 

*(b) Let V = 1, X = 3, and use the data in Sec. 8.4. Construct tables like those in 
Figs. 8.12 and 8.13 and indicate the impact of adding a smoothing cost. 

*(c) Answer part (b) for 3c == 2. 

*(d) Discuss the difficulties that arise in the dynamic programming formulation if 
the smoothing cost is |. (This case is treated in Chap. 9.) 

46 Indicate how you would modify the inventory model in Sec. 8.3 and the recursion ( 8 ) 
to accommodate 

(a) Deterioration, that is, inventory at the start of Period t is smaller than inventory 
at the end of Period t — 1. 

(b) Spoilage, that is, the production level x yields less than x usable items. 






1 


I 
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(c) Demand Dt can also be negative, indicating the return of items. 

(d) Production x can also be negative, indicating the dbposal of items. 

*(e) Production cost in Period t also depends on the production level in the previous 
period. 

47 Indicate how to modify the inventory model in Sec. 8.3 and the recursion (8) when 
Rt{yt) represents the total revenue from sellingunits in Period t and^^ is a decision 
variable. (Assume you cannot sell a unit in Period t unless it actually is available by 
the end of the period.) 

Exercises 48 and 49 ask you to formulate a model in terms of a dynamic pro¬ 
gramming recursion. Be sure to define all the symbols you use, and answer the i 

five questions at the end of Sec. 8.2. Give the appropriate optimization function 
when there is a single period remaining, as well as the recursion for Stage n. 

Explain how to initiate and terminate the calculations. 

48 Consider the Feedem-Speedem Airline Company case described in exerc’sc 32 of 
Chap. 2. Formulate (but do not solve) the model in terms of a dynamic programming 
recursion. 

49 Warehouse Problem, Consider the case of Byer, Long, Celler, Short described in exercise ! 

36 of Chap. 5. | 

(a) Formulate the model in terms of a dynamic programming recursion. T* 

(b) Show how you would modify your answer in part (a) if there also is a fixed trans¬ 
action cost St of selling and a fixed transaction cost bt of buying. 

(c) Show how you would modify your answer in part (a) if gallonage at the end of 
Period t is charged an inventory holding cost of hi per gallon. 

(d) Show how you would modify your formulation in part (a) if the warehouse 
capacity is not constant throughout the horizon but equals Q in Period t. {Hint: 

do not forget that if capacity is smaller than entering inventory in Period t, the . 

firm must sell.) 

*(e) Show how to modify your formulation in part (a) if there are liquidity constraints 
that limit the cash amount of a purchase in each period to be no larger than the 
accumulated cash available from the preceding periods (assume K Is the initial j 

amount of cash available at the start of Period 1). ' 

‘ ^ ! 
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CHAPTER 


Dynamic Optimization of 
Inventory Scheduling^ 


9.1 EXPLOITING SPECIAL STRUCTURE 

The analysis of the deterministic inventory model described in Sec. 8.3 is 
continued here. You will see that when the cost functions are assumed to have 
certain shapes, considerably more can be said about the optimal schedules. In 
particular, you caT'J determine the form of an optimal policy, and with such knowl¬ 
edge devise simplified computational procedures to actually find the best policy. 
Throughout this chapter we assume the cost functions are 

(1) Ct{xt, ii) = Ct{xt) + for each period, 

where 

(2) Ct{x,) > 0 Ce(0) = 0 and ht{it) > 0 ^^*(0) = 0. 

Thus the total cost in each period is the sum of C((x(), which is the cost due to 
producing Xi^ and ht{it)i the cost due to having it at the end of a period. The 
inventory balance equations are 

(3) it = it-i + Xt — Dt for each period, 

where we postulate Zq = 0 and each Z)( is a nonnegative integer. An equivalent 
way of writing (3) is 

(4) 

fThc focus of this chapter is primarily inventory analysis and secondarily dynamic analysis. 
Without loss of continuity, you can skip this chapter now if you wish, and return to it when studying 
Chap. 19 and Appendix II.' 


291 









292 Dynamic Optimization of Inventory Scheduling 


• 9.2 • 


Finally, we require 

(5) Xt and it nonnegative integers, 

so that all demand is satisfied on time. 

9.2 CONVEX AND CONCAVE COST FUNCTIONS 

The following sections will make some additional assumptions about the shape 
of C£(xt) and Specifically, we distinguish two important shapes. 

A function g{x) defined for integer values of x is said to be conve:& if 

(1) + 1) “ SM ^ — 0 for all a: convex 

and concave if 

(2) g{x + 1) — g{x) ^ ^(x) — g{x — 1) for all x concave. 

Several graphical illustrations of convex and concave functions are shown in 
Fig. 9.1. 


Convex Cost Functions 




Concave Cost Functions 



If you view g{x) as a total cost function, then convex costs occur when each 
additional unit costs at least as much as the previous unit. Analogously, concave 
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costs occur when each additional unit costs no more than the previous unit. Some¬ 
times the permissible values of x are constrained to be within a range, such as 
X = a, a + 1, . . ., Z? “ 1, ^ (where a + 1 < ^ — 1). Then the definitions are 
to be applied toa+l<A:^^— 1. 

To examine the forms of particular functions, it is convenient to have the terms 
in the definitions (1) and (2) rearranged as 

(3) ^ ~~ ^ ^W convex 

(4) ~ -l ) . < ^(;c) concave. 

Consider the following six examples of g{x). You will find it helpful to draw 
a sketch of each function on scratch paper, and are urged to do so. For this 
purpose, put a = 2, ^ = 1, and r = 3 in the first four examples. 

Case 2 . Let^(A*) = ax + b. Then 

(5) [g(A + 1) + ^] + [fl(^ - 1) + ^] ^ ax+ b, 

so that a linear function ^(x) is both convex and concave for any values of a and 
b and all a*. 


(6) [a(x + 1)^ + 6] + [a{x - 1)^ + ^ ^2 + « + * 


P' 


Case a. Let g{x) = ax“ + b. Then 

> flA* + ^ if a ^ 0 
ax^ + 6 if a < 0. 

Thus for all a, ^(a) is convex when a is nonnegative, and concave when a is 
nonpositive. 

aA + ^ for A ^ 0 
cx + b for A ^ 0 

with a > 0 and r > 0. The test for Case i shows that ^(a) is convex (3) for 
A < 0 and A > 0. At A = 0, 


Case Hi, Let ^(a) 


-I “ 

[—c 


(7) 


[a + b] + [c + _ (a + c) ^ ^ 


so that ^(a) is convex for all a. 

rO for A = 0 

Case iv. Let ^(a) = < 

(aA + A for A > 1 

with ^ > 0. The test for Case i shows that ^(a) is concave (4) for a > 1, At 

(8) PfL±.^_+i21 ^ a + 6, 

so that g{x) is concave (for nonnegative a). 
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Case V, Let g{x) = 


0 for X = 0 

13 + 2x for a: = I, 2, 3, 4, 5 
,28,5 for X = 6, 

The analysis for Case iv shows that (4) is satisfied for a; < 4. At a: = 5, 


(9) 


[28.5]+ [21]_ 24.75 >23, 


SO that (3) is satisfied. Consequently, g{x) is neither convex nor concave. 


Case vL Check your understanding of the above process by showing that the 
piecewise linear function 


( 10 ) 




'aiX + ^, for 0 < X < 

aiWi + a 2 {x — Wi) + bi for Wi<x<W 2 

,aiWi + a 2 {w 2 — Wi) + a 2 {x — W 2 ) + bi for W 2 ^x 


is convex if Oi ^ a 2 ^ a^ and concave if fli > ^2 ^ (Make the tests for x = Wi 
and W 2 ^ Draw the function by letting = 2, ^1 = 1, = 4, 02 = 3, W 2 = 6, 

fl3 = 4.) 


Several of the above examples are of special interest. For 
includes 


instance, Case Hi 


(H) 



if it > 0 
if it < 0, 


where > 0 represents a per-unit inventory holding cost and pi'>0 a, per-unit 
backlog penalty charge. Case v encompasses the example in Sec. 8.6. Finally, 
Case vi contains 


( 12 ) 


Q(^,) 


TtXt for 0 < ATt < 

TtUt + 1.5rj(A:t — Ut) for Ui<Xt< Vt 

XiUt + l,5rt(i;t — Ui) + 2rt{xt — Vt) for Vt < Xt, 


where Xt is the total man-hours scheduled in Period t, is the regular-time wage 
per man-hour, Ut is a given total amount of regular-time man-hours available in 
Period t, l.fir^ is the “time and a half” wage rate per overtime hour, Vt is a given 
total amount of overtime hours available in Period t, and 2rt is the “double-time” 
wage rate per man-hour. 


9.3 INVENTORY MODEL WITH CONVEX COSTS 

In addition to the cost, inventory, and production assumptions already made 
in Sec. 9.1, assume that 

(1) Ct(x^ is convex (production cost) 

(2) hi{i^ is convex (holding cost). 
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A convexity assumption is sometimes called a situation of decreasing 
incremental returns to scale. It is also possible to impose, in each 
period, an upper-bound constraint on Xi as well as a maximum amount bi on 
inventory. To keep the explanation simple, suppose you handle these constraints 
by letting the corresponding values of for > Wf, and for 

be infinitely large. This convention preserves the convexity in (1) and (2). Finally, 
we mention that the analysis can be extended to permit backlogging of demand 
requirements from a period to later periods. 

In brief, the algorithm starting in Period 1 proceeds period by period to fill 
each unit of demand requirement as cheaply as possible, given the production 
already scheduled and the resultant pattern of inventory. The fact that such a 
simple algorithm leads to an optimal solution, of course, rests heavily on the con¬ 
vexity assumptions (1) and (2). 

The details of the algorithm for this model are as follows. 

Step L Let p be the earliest period in which the current demand requirement 
value is Dp > 0. For each of the Periods 1, 2,..., p consider increasing produc¬ 
tion by one unit in the current trial schedule in order that one unit of Dp is 
filled. 

Step 2. For each of the possible p revisions, calculate the entire incremental 
cost from the increased production and inventory holding. Select an alternative 
with minimum incremental cost and revise the trial schedule accordingly. If 
there is more than one such alternative, schedule production in as late a period 
as possible. 

Step 3, Reduce the current value of Dp by one unit. Examine whether the cur¬ 
rent values of all Z), have now been reduced to 0. If so, stop; otherwise return 
to Step 1, 

The example to follow illustrates the method. 

The Highway Rubber Company Problem. This firm is planning its 
production schedule of tires for six months, January through June. The fluctuating 
monthly demands D^ are shown on the bottom of Fig. 9.2. Notice they accumulate 
to 18, so that the total demand for the span = 6 is the same as that in the 
Dependable Manufacturing Company Example of Sec. 8.4. 

In each period, there is a capability of producing a given number of tires at 
regular cost and an additional number at a premium cost. To demonstrate the 
flexibility of the algorithm, the regular and premium costs, and maximum 
amounts achievable at these costs, vary from period to period in the illustration. 
In Fig. 9.2, these costs appear in the pair of boxes on the diagonal, the upper figure 
being the regular cost. The corresponding maximum capacities are in the next to 
last column on the right. The far right column accumulates the monthly production 
capabilities. 
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Figure 9.2. Highway Rubber Company Convex Cost Model. 


For example, in January as many as three units can be produced at a cost of 
2 each, and as many as an additional six at a cost of 5 each. In February, one unit 
can be produced at a cost of 4, and as many as an additional three at a cost of 6. 
In March, as many as four units can be produced at a cost of 2 each, and as many 
as an additional three at 6 each, etc. Thus in each period the production cost 
function is of the form 

(3) Ct{xt) = uxt 


for 0 ^ Xt < t/j (regular time) 
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(4) Ci{Xt) == TtUt + Si{Xt — Ut) 

= SiXt + (ft — St)ut for Ui<Xt< Vi (overtime) 
with Ti < Si, so that is convex. In (3) and (4), represents the regular cost 
and Si the premium cost. The Ui and Vi are the corresponding production capa¬ 
cities. (So for / = 1, the values are fi = 2, = 5, Ui = 3, and = 3 + 6 = 9.) 

The rows of Fig. 9.2 refer to production scheduled in each month. The columns 
refer to the monthly demands. Since no backlogging is allowed, the boxes below 
the diagonal can be eliminated. Thus the first two rows of the figure indicate that 
January’s production can be allocated to the demand in any of the six months, the 
second two rows indicate that February’s production can be allocated to demand 
in February or a subsequent month, etc. Likewise, June’s demand can be filled 
by production from any month. In order for a feasible schedule to exist, it is neces¬ 
sary that the cumulative monthly production capability be at least as large as the 
accumulated demand. Check the right-hand column and the bottom row of 
Fig. 9.2 to verify that this condition is satisfied in the example. 

The computational method about to be described is a slight simplification of 
the algorithm above and is made possible because of (3), (4), and 

(5) hi{ii) = hiii (linear holding cost). 

To keep the numerical example simple, assume that hi = \ for all periods. 
Therefore the cost figures across each row of Fig. 9.2 increase by 1 to indicate the 
holding cost per item per period. For example, if an item is produced in January 
at the regular cost of 2 and held until February, the total unit cost becomes 
3( = 2 + 1), which appears in the first row of the February column. Ifitisheld 
another month, an additional unit is added, for a total of 4( = 3 + 1)> which 
appears in the first row of the March column. Incidently, we do not need to 
assume that ending inventory iy, = 0, because the cost assumptions (3), (4), and (5) 
ensure this condition holds in an optimal policy. 

To follow the details of the algorithm, copy Fig. 9.2 on a sheet of paper. Notice 
that the format resembles that of a transportation problem. This is no coincidence, 
because the linearities in (3), (4), and (5) do yield such a model. 

Computation of an optimal policy. Using transportation problem 
terminology, think of each permissible box in Fig. 9.2 as a route, with the asso¬ 
ciated unit cost displayed as usual. The idea simply is to fill successive units of 
demand, starting with January, then February, and so on, by an available route 
having the least cost. Whenever there are ties, production is scheduled as late as 
possible. 

To help you perform the process on your copy of Fig. 9.2, here is a summary 
of what happens. 

(i) Fill January’s demand of 1 at a unit cost of 2. 

(ii) Fill February’s demand of 2 at a unit cost of 3. 

(iii) Fill the first four items of March’s deiriand at a unit cost of 2, the next item 
at a unit cost of 5, and the last two items at a unit cost of 6. 
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(iv) Fill the first three items of April’s demand at a unit cost of 5, the next 
two items at a unit cost of 6, and the last item at a unit cost of 7. 

(v) Since there is no demand in May, make no entries in the May column. 

(vi) Fill the first item of June’s demand at a unit cost of 3, and the second item 
at a unit cost of 6. 

As you progress from (i) to (vi), reduce the Production Capacity figures in the 
next to last column of Fig. 9.2. As soon as any such figure becomes 0, do not 
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Figure 9.3. Highway Rubber Company Convex Cost Schedule. 
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permit the other routes in that row to be available for the rest of the schedule. 
Compare your schedule with that shown in Fig. 9.3. 

An optimal schedule is then 

Xi = 3 X 2 = 1 ^3 7 X 4 ^ 5 Xj 1 X(j 1 

^ ^ f, = 2 1*2 = 1 *3 =1 1*4 = 0 Is = 1 *6 = 0 , 

which can be obtained by looking at the capacities used (shown in the next to 
last column of Fig. 9.3) to find and by summing the amounts in the boxes 
above and to the right of the diagonal pair in Period t + 1 to find iV 

Obviously, this procedure is much simpler than computing ^ 

done in Chap. 8 . It also contains a couple of simplifications in the three-step 
algorithm above. Specifically, the tabular form makes apparent in SUfi 2 the 
minimum incremental cost alternative with available capacity. And in Step 3 
the current values of demand are reduced as much as possible, given the currently 
available production capacity for the best alternative, 

* Additional constraints. The earlier discussion of the convex cost 
assumptions ( 1 ) and ( 2 ) mentioned that the three-step algorithm for this model 
can encompass upper bounds on Xi and upper limits bi on the amounts inven¬ 
toried. In the example, the upper bounds on Xi were honored simply by never 
even considering values of Xi that were too large. (No restrictions on inventory 
level were imposed.) You saw in the Dependable Manufacturing Company 
Example of Sec. 8.4 that the imposition of such constraints can have unanticipated 
effects. But when convex costs are assumed, the following proposition is true. 

Effect of Constraint Variations, (a) The optimal value of Xp will not 
decrease if the capacity limit Up increases, or if any of the inventory limits bi 
increase for t > p. (b) The optimal value of Xp willTWi increase if ;w 3 y capacity 
limit increases for t^ p, or if 

t < p- *~^tion of the two scteoiiles A^p(V 

^'tter demand estimate i^ ' 

To see that the above proposition is not trivial^ iherely Vippcned 

in the Dependable Manufacturing Company illustration ci When the 

bound Xt<b was in force, to = 2 , and iV^ = 6 , an optimal schedule was 

(7) = 4 X2 = 0 x, = 3 X4 = 4 X5 = 5 xe, = 0, 

which can be seen by tracing the solution in Fig. 8.12. Suppose that the liimt 
on only x^ is increased to 6 . Then the following solution, found in Fig. 8.17, 
becomes optimal: 

( 8 ) = 5 ATj = 0 Xj = 6 X 4 = 0 X 5 = 5 Xe = 0. 

Thus Xy increases when the capacity limit m, increases, contrary to part (b). (Note 

that the assumption 2*0 = 2 is of no importance, since the same result would occur 
if Zq = 0 and Di = 1.) 
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►You can encompass the possibility of backlogging by making only a few minor changes 
in the wording of the three-step algorithm, but the increased computational burden is 
considerable. Let a backlog be denoted by a negative value for iV Now assume that ht{i) 
is convex for all values of i, that is, for , .., —2, —1,0, 1,2,..,. Alter Step 1 so that 
you consider increasing production by one unit for each of the Periods 1, 2,. . ., N. 
Alter Step 2 to include the incremental cost from backlogging. The bulk of added 
computation arises in Step 2 in calculating the entire incremental cost for each possible 
revision. Given the production amounts in a revised schedule, you must allocate them 
to the demands already considered so that the corresponding inventory and backlogging 
costs are minirrtlil. Leave Step 3 unaltered. M 

* 9.4 PLANNING HORIZON ANALYSIS FOR 
CONVEX GOST MODEL 

It is clear in the application of the algorithm for the special example above that 
production in each period may increase, but will never decrease as the planning 
horizon N lengthens. The conclusion also holds for the general convex cost model 
of the previous section. 

Preparatory to the statement of such a proposition, let cumulative demand and 
cumulative production for Periods 1, 2,, . ., p be denoted as 

(1) Rp=j:Dt and = 

£=1 £=l 

Suppose you are not entirely certain about the actual demand amounts, but 
you can state that Rp lies within a specified range 

(2) Sp<Rp<Tp forj& = 1 , 2 ,,.., AT. 

Let Xp{S) be an optimal production schedule assuming that Rp = 5p, the minimal 
forecast, an^^i^ilarly let Xp{T) be an optimal production schedule assuming that 
Rj, = 2^ji>4hj^^Jixirnal 2,. , ., Then the following can be 

said abo’* ^ * v • 

(a) An optimal value of Xp will not decrease 
I^v;:;\B;fb) An optimal schedule satisfies Xp{S) <Xp^ Xp{T) for 
^ = 1, 2,..., A^. 

Part (b) of the theorem is illustrated graphically in Fig. 9.4. 

The preceding result is properly called a planning horizon theorem, because 
you can regard lengthening A^ to + 1 as being the same as revising the value 
of Z)jyr+i upward from its zero level in the analysis of the i^-period horizon problem. 
Part (a) of the proposition states that you are never required to reduce a previously 
scheduled amount of production when more demand is encompassed in the plan. 

In the Dependable Manufacturing Company Example of Sec. 8.4, both parts 
(a) and (b) failed to hold. As you can see in Fig. 8.13, when June demand was added 
into consideration, so that AT increased from 5 to 6, January production decreased 
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Note: Xp(S) optimal schedule for minimal forecast 
Xp{T) optimal schedule for moximal forecost 

Xp optimal schedule 

from 5 to 4, and the combined January and February production decrezised from 
10 to 9. 

An important implication of part (b) is that in particular circumstances it may 
be possible to select an optimal production amount Xi with no more than rough 
information about demands in later periods. The idea is to find Xi{S) and Xi{T), 
and see if these differ only a little, if at all. A narrow range implies that x, can be 
set within this interval without much possible loss of optimality. Then in the fol¬ 
lowing period, when more information about demand emerges, the process can 
be repeated. If the range is large, an examination of the two schedules Xp{S) and 
Xp{T) will help pinpoint the location where a better demand estimate is required. 

► One case of a convex cost model is of special interest, since it occurs frequently and 
provides extremely powerful and useful planning horizon results. Suppose there are no 
production and inventory constraints, the production cost relation is the same in every 
period, and the inventory holding costs arc so negligible that they can be ignored, giving 

(i) 


Figure 9.4. Bounds on 
Optimal Schedule for^Con- 
vex Costs. 


For this situation, drop the assumption that Xi must be integer-valued, and allow Xi 
to be any nonnegative number. Therefore you require a definition for convexity that 
extends the stipulation (1) in Sec. 9.2. The diwf generalization is to say thatg(x) is convex 
if for every w and and if for all p such that 0 ^ 1, 

(>>) /’k(z) - «(*)] ^ (1 — - ^H]. 
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where 


(iii) X = />z -f (1 —p)w. 

Rearranging the terms in (ii), and using (iii), yields 

(iv) pgiz) + (1 — p)g{w) ^ g[pz + (1 — P)w] convex. 

[By letting p = w = x ~~ 1, and z = x 1, (iv) above implies (3) in Sec. 9.2.] 

Let Ap be the average demand per period for the first p periods 

(v) fo'- P‘=h2,...,N, 

and let be the maximum of these. If the maximum value occurs for more than one 
value of p, let n be the latest period. Then it can be proved that it is optimal to schedule 

(vi) = An for / = 1, 2, ..., n, 

so that in =* 0. It also can be shown that the remaining optimal values for x^ satisfy 


(vii) 


Xt < An for t>n. 


A further implication is that Periods 1, 2,..., n comprise a planning horizon, and the 
remaining schedule can be found by looking at Periods n + 1,..., N by themselves, 
assuming only that inventory entering Period n + 1 is zero. This means that the calcula¬ 
tion in (v) can be repeated, now using 

(viii) Bj, = ~ for p = n + 1,..., N. 

Then the next planning interval starts in Period n + 1 and ends in Period q, which 
occurs at the latest p yielding the maximum Bp, Once again Bq represents the actual 
production amount for each of the periods in the second horizon. Of course, this line of 
reasoning can be repeated until xj^ is scheduled. One consequence of this result is that if 
-Dt+i ^ for every t, then (v) yields n = N, In the opposite case, if Di+i ^ /)(, then, 
for every /, Xj = Dj is optimal. 

The above construction can also be characterized in terms of a. “graphical” procedure: 
insert pegs at the points (p, Rp) for /> = 1, 2,..., indicated on ordinary rectangular 
coordinate paper. Place a piece of string such that it lies above each peg, and tie one 
end of the string to the point (0, 0). Holding the string against the peg at {N, R/f), pull 
the string until it is taut everywhere. The resultant graph indicated by the taut string 
represents an optimal cumulative schedule Xp, 

If the assumption (iv) is applied to a case with the demands in Fig. 9.2, calculating 
(v) would yield 


(ix) i = ^ = = = 

so that n = 4, and X( = -ip for < = 1, 2, 3, 4. Then (viii) would, in turn, yield 


(X) 


„ 16-16 0 „ 18-16 2 
= T ^«=“6^r4' = T’ 


so that ^ = 6 and X( = | for ^ = 5, 6. 

Notice that if several values of Di change but An remains the largest value of Ap, the 
first horizon does not change. For example, a downward alteration of any Dt, t = 
n 4- I, • • • f N, does not alter the length of the first horizon. 

Finally, observe the significant and surprising fact that to establbh an optimal schedule 
and determine the planning horizons, the numerical values of C(X|) are not required. All 
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you need to know is that the same production cost function applies in every period, that 
its shape is convex, and that inventory holding costs are negligible. 


9.5 LOT-SIZE INVENTORY MODEL WITH 
CONCAVE COSTS 


The model to be presented here is another special case of the elementary inven¬ 
tory model in Sec. 8.3. It is sometimes referred to as a dynamic lot-size 
model and is noteworthy because it represents a situation of frequent practical 
importance. Furthermore, it provides a case of complexity intermediate to that 
of the general model in Sec. 8.3 and the convex cost model in Sec. 9.3. 

In addition to the cost, inventory, and production assumptions already made in 
Sec. 9.1, assume that 

(1) Ctixj) is concave (production cost), 

(2) Ai(ze) is concave (holding cost). 

A concave cost assumption is sometimes referred to as a situation of increasing 
incremental returns to scale. Although backlogging is not permitted, 
the analysis can be easily modified to handle such a possibility, as we discuss in 
the advanced material of Sec. 9.6. In sharp contrast to the convex cost model, 
upper limits on and are not permitted. Recall that an upper bound on pro¬ 
duction can be viewed as letting Ci{xi) be infinitely large for x^ beyond the critical 
value. In such terms, Ci{xi) would no longer remain concave. 

One frequent example of concave production costs occurs when production 
involves an initial setup and then each item incurs a standard unit cost: 


( 3 ) 



+ CiXi 


for = 0 
for Xi > 1. 


Recall that this was the cost function of the Dependable Manufacturing Company 
Example in Sec. 8.4, with setup cost St — 13, unit cost Ct = 2, for Xi ^ 5. Although 
Ci{x^ was concave over the range = 0, 1,..., 5, that example does not fit the 
assumptions of this section, since upper bounds on x^ are not allowed here. 

Another example of concave costs often arises when the model is applied to a 
situation in which inventory is replenished by purchasing the item from an outside 
vendor. In these instances, the seller may offer so-called quantity discounts 
for large orders. To illustrate, the schedule of price breaks may be given as 

$10 per item for any amount ordered up to a dozen 
(4) $ 8 per additional item above a dozen and up to a gross 

$ 5 per additional item beyond a gross. 

The mathematical expression corresponding to this schedule is 

lOxt for 0<x,:^12 

120 + 8(re ~ 12) for 13 144 

120 -f- 1056 + 5{x^ - 144) for 145 < 


( 5 ) 
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The example is of the same form as (10) in Sec. 9.2. Thus Ct{xi) is concave, since 
the coefficients of namely 10, 8, and 5, are progressively smaller. If a setup cost 
is added to (5), the function remains concave. Hence, in finding an optimal inven¬ 
tory policy for this situation, a firm can add a lump-sum cost attributable to its 
own paper work and handling associated with its placing and receiving the order. 

Structural result. The fundamental insight for the analysis of this model 
is given in the following proposition. 

Form of an Optimal Policy. There always exists a minimal cost policy with the 

property that Xi has one of the following values: 0, Z)^, + D^+i,..., -j- Dt+i -f- 

• - +Dy, 

The statement implies that in seeking an optimal policy, you need only consider 
[l-f-iV^ — (^— l) = iV’—f + 2] possible values for Xi. Compare this with the 
general model in Sec. 8.3, where could take on any value between 0 and Di + 
Dt+i Dy, An important algorithmic implication of the proposition is 

that to compute an optimal policy you need consider only approximately ,5N^ 
different possibilities. 

A further result of the proposition is that there always exists an optimal policy 
with the property that, in each period, if entering inventory is positive, then 
no production is scheduled. (State why this follows from the form of an optimal 
policy.) Symbolically, the implication is that there exists an optiinal policy for 
which 

(6) ii-iXt = 0 for all t. 

Therefore, if Dt is satisfied from inventory > 0, all these items were manu¬ 
factured in the latest period in which production occurred. This was certainly 
not true for optimal policies in the convex cost case. 

Whenever an optimal x^ > 0, the situation can be referred toasaregenera- 
tion event. The term reflects the notion that, at such a period, entering inven¬ 
tory is zero, so a production cycle starts afresh. 

What now remains to be discussed is how a dynamic programming recursion 
can be used to efficiently find an optimum among the schedules satisfying the 
stated form. In preparation, we illustrate the main ideas with a verbal description. 

Suppose January is the first month in the horizon. Then you begin by calculat¬ 
ing the cost of producing in January for January’s demand requirement. You then 
lengthen the horizon month by month, using the calculations for the previous 
months. For example, suppose you have found optimal policies for the planning 
horizons January; January and February; January, February, and March; and 
so on, up to January, February, March, April, May, and June. Assume each of 
these policies has the form stated above. You next want to add July to the horizon. 

The form of an optimal policy implies that all of July’s demand requirement 
will be produced in either January, February,..., or July—altogether seven 
possibilities to consider. Further, if it is produced in April, say, then April’s pro- 
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duction quantity consists of the demand requirements for April, May, June, and 
July (state why), and an optimal policy for January, February, and March is the 
three-month horizon policy that you already know. Consequently, an optimal 
seven-month horizon policy is one that has the smallest total cost of the seven 
possibilities. 

► We can easily establish the validity of the form of an optimal policy for an in¬ 
ventory model with concave costs. Whereas (6) was stated as a consequence of the form 
of an optimal policy, the approach taken here is first to demonstrate (6), and then show 
why it follows that the stated form is optimal. 

Consider any feasible schedule in which > 0 for at least one Period t. Assume 
Period p is the first such instance. Examine the earlier period in which the items compris¬ 
ing were produced, and calculate the additional cost of producing one more unit 
and storing it until Period p. Compare this figure with the additional cost of producing 
one more unit in Period p. 

If the incremental cost of an additional unit of production in Period p is the smaller 
amount, alter the schedule by adding tp_i to Xp and correspondingly reducing produc- 
• tion in the earlier period. Because of the concave cost assumptions (1) and (2), the 
full cost increase in Period p will not be larger than the total decrease in cost achieved 
by eliminating the earlier production and inventory. By the same token, if the incre¬ 
mental cost of an additional unit in Period p is the greater amount, then total costs do 
not increase if you add Xp to the earlier period’s production and eliminate production 
in Period p. In either event, you have constructed a feasible schedule such that (6) 
now holds for / = 1, 2, ...,/>. If there are later periods violating the condition, apply 
the same line of reasoning to them. The resultant schedule, where (6) holds for all 
periods, costs no more than the starting schedule. 

The proposition itself then follows since to assert otherwise means there would be 
a period, in which 0 < < Di, requiring for feasibility that X( > — iVi > Oj 

contrary to (6). ^ 


9.6 ALGORITHM FOR CONCAVE GOST MODEL 


Let 


Ckj = total cost of producing in Period k + 1 to meet the demand requirements 
of Period k + 1, . . . , j, where X: = 0, 1, . .., — 1 and k + 1 < J < N. 

Thus 


( 1 ) 


Ckj — 


Ck+i{^k+i) forj = A: + 1 

Q:+i(-^jfc+i + * • * + ^j) + + * * • + 

H-+ for 7 > k+l. 


To illustrate, if A: + 1 is January and j is March, then Ct) is the sum of the produc¬ 
tion cost for all of January, February, and March demand, the holding cost levied 
at the end of January on the inventory to meet February and March demand, and 
the holding cost levied at the end of February on the inventory to meet March 
demand. 
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The Selmore Company Problem. The numerical data for this com¬ 
pany appear in Fig. 9.5, where the horizon iV = 4. In addition to the Ci{xi) 
displayed, assume that 

( 2 ) = 1 and ht{ii) = ^ for all periods. 

(Try to guess the optimal schedules for iV^ = 1,2, 3, 4.) 

The following are a few illustrations of (1) using the data in Fig. 9.5: 

Cqi = Ci(l) = 5 

^02 = Cy{l + 1) + h,{l) = 10 + .1 = 10.1 

^03 = c,(l + 1 + 1 ) + h,{l + 1) + h^il) = 15 + .2 + .1 = 15.3 

(3) = C,(l + 1 + 1 + 1) + ^,(1 + 1 + 1 ) + ^^(1 + 1) + h,{\) 

^ ^ = 16 + .3 + .2 + .1 = 16.6 

Ci2 = ^ 2 ( 1 ) = 6 

^,3 = C2(1 + 1) +^ 2 ( 1 ) =9+ .1 =9.1 

^14 = C 2 (l + 1 + 1 ) + h2{l + 1) + ^3(1) = 12 + .2 + .1 = 12,3. 

Calculate the remaining values ^ 23 , C 24 > and C 34 , and compare your answers with the 
amounts shown in Fig. 9.6. 
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Figure 9.5. Selmore Company. Concave Produc¬ 
tion Cost. 
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Cost Model. 


Define 

fn = minimum policy cost for Periods 1, 2,..., n, given that the 
inventory level is zero at the end of Period n. 

Then the appropriate dynamic programming recursion is 
(4) fn = minimum [/* + c*„] for n = 1, 2,..., iV, 

whereto = 0. Let k„ designate a value of k that yields/, in (4). What is being sought 
is the value of/^. 
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The bracketed expression on the right of (4) is the sum of two costs. The first 
cost,yi., represents all the costs during Periods 1, . . ., k, incurred by an optimal 
policy for this horizon. Such a policy leaves no inventory at the end of Period k. 
The second cost, represents the additional costs in Periods k + 1, , . . , n 
of starting in Period k + 1 with no inventory, and producing for all the remaining 
demand. A value kn that minimizes these sums provides an optimal policy for a 
horizon of n periods by specifying Period kn + 1 as the period (in the interval 
1, 2, . . . , w) at which the last regeneration event occurs. The algorithm can 
therefore be viewed as a way to determine an optimal sequence of regeneration 
points. [Because entering inventory is always zero at a regeneration point, we are 
able to suppress the state variable i (= 0) in expression (4).] Obser\^e that the 
algorithm has been formulated so as to proceed forward in time. 

The method is illustrated below for the example appearing in Fig. 9.5: 

/o=0 

fi = minimum [/o + Cqi] = [0 + 5] = 5 and ki = 0 
yi = minimum [fo + ro 2 , fi + <^ 12 ] 

— minimum [0 + 10.1, 5 + 6] = 10.1 and kn = 0 
(5) /a = minimum [/, + Coj, fi + Cu, fi + ^sa] 

= minimum [0 + 15.3, 5 + 9.1, 10.1 +5] = 14.1 and Aa = 1 
fi = minimum [/„ + <r„ 4 , f + c,.„ f + c-i, /a + Caa] 

= minimum [0 + 16.6, 10.1 + 12.3, 10.1 + 7.1, 14.1 + 3] 

= 16.6 and k^ — 0. 

These calculations show that the optimal schedule for a planning horizon of 


length N is 

Xi = 1 



for N = \ 

(6) 

Xi=2 

;C2 = 0 


for N = 2 

Xi = 1 

;C2 = 2 

II 

o 

for = 3 


x, =4 

o 

il 

Cl 

II 

o 

X 4 = 0 for N = 4. 


To illustrate how il '?<^e were found, suppose N = 3. Then Ats = 1 in (5) signifies 
that an optimal policy for Periods 1, 2, 3 is determined by considering Period 1 
by itself, and ordering in Period 2 for both Periods 2 and 3. The value ^2 = 2 
represents the total demand in these periods. The solution to Period 1 considered 
by itself is given by the schedule for N = which indicates that production 
AT, = jDj = 1. 

Observe that the optimal value of Xi (January production) fiuctuates up and 
down as the horizon lengthens. As you saw in the horizon theorem of the previous 
section, this dynamic phenomenon does not occur if the costs are convex functions. 
But in the concave cost model, such fluctuations do arise, even in very simple cases, 
as you can see in Sec. 9.7. Thus, although the algorithm is not much more compli¬ 
cated than that for the convex cost case, the dynamic properties of the model do 
exhibit more complex behavior. 








308 Dynamic Optimization of Inventory Scheduling 


9.6 


► It is instructive to draw a network corresponding to the dynamic programming 
recursion (4). Let the nodes be 1,2, ...,5 and c^i be the unit cost of travers¬ 
ing arc (5 — 6 ~ A:). The diagram is given by Fig, 9.7, (Notice it is identical in struc- 


16.6 



ture to Fig. 6.10.) The computations in (5) can be viewed as finding a shortest route to 
Node 5 from each of the other nodes, where yj, is the length of the shortest route to 
Node 5 from Node (5 — n). Following the calculations in (5), trace this process on the 
diagram. 

You will find it instructive to compare (4) with the forward dynamic programming 
relation (v) given in the advanced material in Sec. 8.3. Observe the connection 

/» = ^n(0). 

Thus by knowing the of an optimal policy, you are able to eliminate completely 
the effort of calculating g„(zj for all i > 0, 

Without much trouble, you can find an optimal policy under which backlogging is 
permitted; that is, ii is allowed to be negative. Now assume that hi{i) is concave for both 
* = 0, 1, 2,..., and for t = 0, — 1, —2,... ; however, hiij) need not be concave for 
..., — 2, — 1, 0, 1, 2,,,.. What continues to hold true is that is either zero or the 
sum of demands in consecutive periods containing Period t; but here the sum can include 
demands prior to, as well as beyond. Period t. The key observation is that you still 
seek an optimal sequence of regeneration points at which entering inventory is zero. 
Production occurs only once between these periods. The definition of Cicj is generalized 
to mean 


Clej = minimum total cost of producing once during the 

Periods k 4- , j> to meet all demand in this interval. 

Thus an intermediate optimization is required to find these costs: 


CkJ-= 


C'ifc+i(D/c+i) for J — A; + 1 

minimum [Q (/>*+! -j- 
fc+l<tO 


+ Dj) + hic+i{ik+i) 

4- ' • • 4- Aj-i(tj-i)] forj > A; 4“ 


◄ 


The resultant values are used in the recursion (4). 
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* 9.7 PLANNING HORIZON ANALYSIS FOR 
CONCAVE COST MODEL 


As you already noted, an essential difference between the convex and concave 
cost models is that for the concave model the optimal value for may decrease as 
the horizon lengthens. This can happen for the simplest examples. 

To illustrate, suppose 

( 1 ) Ct{xt) = Si + CtXi for > 0 (setup cost plus linear production cost), 
where > 0 and Ci > 0. Specifically, let 

( 2 ) = 1 = 2 = 4 ^4 = 8 = 7 

Cl ^ 0 and hi{ii) = Vi == h = 0 

and assume 


(3) = 1 £>2 = 1 Z )3 = 2 A = 3 A = 4. 

Employ the algorithm to verify that 



/. 

= 1 

A 

= 2 

fi 

= 5 

f* 

= 9 

= 16 



ki 

= 0 

ki 

= 0 

A, 

= 1 

ki 

= 2 A, 

= 4, 


from which it follows that 










Xi = 1 








for JV = 

1 


=2 


0 





V 

for N = 

2 

(5) 

Xi = 1 

X 2 = 

3 

= 

= 0 




for = 

3 


^4=2 

X 2 = 

0 

^3 = 

: 5 

Xa = 

= 0 


for JV = 

4 


x» = 2 

X 2 = 

0 

^3 = 

= 5 

Xi = 

= 0 

il 

H 

for i\r = 

5. 


Once again, to show how ( 5 ) follows from (4), consider iV = 4. Remember 
V + 1 represents the latest period in which production occurs when Periods 
1, 2,.. ., n constitute the entire span of the schedule. Since = 2, Periods 1 
and 2 form a horizon, and production is scheduled in Period 3 for Periods 3 and 
4 . Thus X 3 = 2)3 + 2)4 = 2 + 3 = 5 . To determine production in the first two 
periods, examine the case of iV = 2. You find = 0, which implies that produc¬ 
tion occurs in Period 1, and equals the first two periods’ demands, namely 

Xi = Di -f- 1^2 = 1 “I” 1 = 2. 

Notice that as the planning horizon increases, both Xy and Xj fluctuate up and 
down; however, the following planning proposition is true. 


Setup Cost Horizon Theorem: Assume (1) holds and Ct ^ for / = 1,2, 
.,,, jY — 1. (a) If V = n — 1, for « ^ 2, then it always is optimal to schedule 
Periods 1, 2,..., n — 1 as a span by itself, (b) If / ^ then ky ^ V- 

To illustrate part (a), suppose when you apply the algorithm to the periods 
January, February, March, and April, you find k^ = 3. This signifies that for 
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iV' ^ 4, April demand should be met from April production. Then part (a) states 
that it is correct to adopt the optimal production plan for the three-month span 
of January, February, and March, irrespective of what demand is beyond April. 
Of course, if you lengthen the horizon further, you will obtain more information 
about the correct value Xi of April production. But in any case, you know Xi > 0, 
that is, the April production level is positive in an optimal schedule for ^ 4. 
Notice that part (a) is satisfied for = 5 in the example (4). Thus the values in 

(5) of ATt, t = 1, 2, 3, 4, which are optimal for iV' = 4, are also optimal for N'^ 5, 
regardless of the values of / > 5. 

To illustrate the rest of the above proposition, suppose when you apply the 
algorithm to periods January through April, you find that = 2. This signifies 
that when iV’ = 4, April demand should be satisfied from March production, and 
the schedule for January and February should be the schedule derived for N = 2, 
Part (b) states that the demand in any month after April will be filled by produc¬ 
tion after February, that is, in March or a subsequent month. Observe that the 
example ( 2 ) satisfies the hypothesis of the theorem, and accordingly the results 
in (4) agree with part (b). 

The theorem reduces the amount of computation required by the algorithm. 
When the hypotheses of the proposition apply, the recursion can be written as 

( 6 ) fn = minimum [/i + for n = 1 , 2 , . . . , 

which cuts down on the search for the minimum. To illustrate, in the example (2), 
f can be found by considering in ( 6 ) only A; = 2, 3, 4, since A :4 = 2 in *(4). With 
this approach, not all for A: = 0, 1, . . . , iV” — 1, and A: + 1 < j < iV, will be 
needed in executing ( 6 ). Therefore it is advantageous to calculate each Cn) only 
if it is required. A helpful recursion for this purpose is 

(7) Cfcn = + * • ’ + 1 (^ 71 ) W > A: + 1, 

where we assume the holding cost function is linear = hiu. 

In contrast with the Convex Cost Horizon Theorem, part (a) here gives a 
sharper result (it establishes a definite production schedule within a planning 
horizon), but part (b) is weaker [since as iV'increases, Xi may fluctuate, as you saw 
in (5)]. Further analytic results about this model can be obtained if you assume 
that the production cost ( 1 ), the holding cost, and remain the same each period, 
and the horizon becomes unbounded. This case will be taken up later in Sec. 12.5. 

► Observe that the theorem assumes both the production cost function (1) and Ci ^ c^+i. 
The example in (5) of Sec. 9.6 shows that part (b) is false for the general concave model. 
The following two examples show further that the theorem does not hold if the inequality 
condition on Ct is dropped, or if all are the same but do not satisfy ( 1 ). 

Consider a case where the restriction ct ^ c^+i in (1) is not satisfied; 

= 1 ^2 = 12 Js = 1 ^4 = 2 

(i) Cl = 3 Cj = 1 cj = 2 (;4 = 1 

jDj = 1 £>2 = 4 Ds = 2 D4 = 2, 
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and assume ht{ii) = 0. (Sec if you can guess the optimal schedules for iV' 1,2, 3, 4.) 
You should apply the algorithm to check that 

.... /i = 4 /2=16 /3 = 21 /4 = 24 

(”) 

/:] = 0 k2 — 0 A*3 = 2 A4 ~ 1, 

which violates part (b). What are the optimal schedules for N— 1,2, 3, 4? (How 
accurate were your guesses?) Do any of the production levels increase and later decrease 
as the horizon lengthens? 

Next consider a case in which Q(a-) and ht{ii) arc the same in every period: 

(iii) Ct{\) = 5 Q(2) = 10 Q(3) = 11 and 

SLudDt = 1, / = 1, 2, 3. NotethatQ(A) does not satisfy the form given in (1). Verify that 


/i = 5 /o=10 /3=14 

A'l = 0 Ao = 1 A3 = 0, 


in a disagiecmcnt with part (b). 


* 9.8 INVENTORY MODEL WITH 
PRODUCTION SMOOTHING 

The economic optimization embodied in the inventory models you have studied 
up to now strike a balance between the costs of production on the one hand and 
the costs of inventory on the other. As a result, within a specified planning horizon, 
production is allowed to fluctuate considerably. The very form of an optimal 
policy for the concave cost model of Sec, 9.5 is batch ordering. This sort of 
policy is often most appropriate when a firm produces many difi'erent items. 
By the same token, when a company purchases items for inventory from outside 
vendors, it is usually sensible to order in lot sizes large enough to cover the require¬ 
ments of several periods’ demand. If the situation under analysis pertains to 
scheduling one of a firm’s main product lines, however, one should take account 
of an important economic consideration that has been ignored so far. This factor 
is the cost of varying production levels from one period to the next. 

Conceptually, it is simple enough to add this consideration to the general 
model in Chap. 8. Keeping the same symbols Xi for production and // for ending 
inventory at Period t, the objective function is revised to 

N 

(1) minimize 2 i/, 

For example, 

( 2 ) Ct{xt, it, xt^i) = Ct{xt, it) + vt{xt, ^ 4 -,), 

where the function on the right represents the cost of a variation in production 
between Periods t and t — 1. 

Two simple illustrations of a smoothing cost function are 

(3) vt{xt, xt^i) = vr{xt — Xt^t)\ where Vt > 0, 
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and 


(4) 


'arixt — Xt^i) for xi — Xt^i ^0 ^ 0) 

.br{xt.i — Xi) for Xi — Xt^i ^0 {bt> 0). 


When production smoothing is of consequence in planning, the analysis is often 
undertaken by measuring demand, production, and inventory in labor units, such 
as man-weeks. Then, if the representation (4) is appropriate, the value of 
relates to the cost of increasing the labor force, whereas bt relates to the cost of 
reducing it. (Assume + b^ > 0,) 

What changes are requires in the dynamic programming recursion to accom¬ 
modate this new factor? Recall that the relation previously used in Chap. 8 was 
written as 


, , /n(0 = minimum [Cn{x, i + x ^ dn) +fn-i{i + x- dn)] 

(5) 

for rt = 1, 2, ..., 

where i = 0, 1,. . . , i -(- • • • + (/n '\nd the minimization is over only nonnega¬ 
tive integer values of x in the range dn -- i < x < d^ + • • • + — i, assuming 

inventory is 0 at the end of the horizon. Now, knowledge of entering inventory is 
not sufficient to characterize the state of the system at the beginning of a period. 
You also need to know the production level in the previous period. Therefore, the 
state variable for this model must contain both these dimensions. The rest of the 
alterations are straightforward. 

Let 


fn{hy) = minimum policy cost when entering inventory is at level i and 
previous production at level with n more periods to go 

Xfxihy) = a production level yielding 

The new recursion in place of (5) is 

= minimum [c„(a:, i + x- d„,y) + x - d„, x)] 

( 6 ) 

for 7z = 1, 2, ..., 

What is sought is the value ofwhere_yo is the level of production prior 
to Period 1. The calculations are initiated with the values of 

/yx Mhy) = Ci{di — I, 0,y) for z = 0, 1, ..., and 

y = 0,l,.,.,d,+d,. 

Do you see any important differences between (5) and (6)? There is one. 
Whereas previously you needed to obtain/„(:) for all possible values of i, here you 
must calculate fn{hy) for all possible pairs of i and y. Although the dynamic 
programming approach is still vastly superior to complete enumeration, the added 
dimension to the state variable implies a significant increase in the burden of 
calculations, as is demonstrated in an example below. Thus it is conceptually 
possible to generalize the recursive technique to take into account even more 
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considerations—such as the production level two periods earlier, or the change in 
the level of inventory; but the resultant increase in computations quickly makes 
the approach impractical for real situations. If the application demands that you 
consider a multitude of interrelationships, you will need to use other solution tech¬ 
niques, such as linear and nonlinear programming, or you will have to exploit 
the form of an optimal policy. 

The Surety Manufacturing Company £xample« Assume the 
demand, production and inventory constraints, and cost data for this firm are 

(8) Dt = 3 (stationary demand), 

(9) Xi = 0, 1, ...,5 It = 0,1,..., 4 and no ending inventory, 

(10) Ct{xi, i^, a:^_i) = C{xt) + hit + v{xt - Xi^^y (stationary cost), 
where 

(11) C(0) = 0 C(l) = 15 C(2) = 17 C(3) = 19 C(4) = 21 C(5) = 23, 

(12) h = \ and y = 1, 

for all periods. [These data are identical with the example analyzed in'Sec. 8.4 
except for the third smoothing cost function on the right of (10).] The recursion 
(6) can be written as 

fn{hy) = minimum [C(^) + l(f + ^ — 3) + (a: — yY 

(13) 

Sn-xi} "f” ^ ^)] W = 1,2,..., Ny 

where i = 0, 1, . . . , 4, = 0, 1, . . ., 5, and the minimization is over only non¬ 

negative integer values in the range 3 — i < r < minimum (5, 7 — i). 

The calculations for n = 1 in (7) are displayed in Fig. 9.8. There must be six 
pairs of columns, one for each possible^ = 0, 1,. . ., 5. The shaded parts of the 


/,(/.y) = C(3-/) + [(3-/)-y]^ 
Previous Production: 


Entering 

Inventory 

y = 

= 0 

y = 

= 1 

y = 
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Figure 9.8. Surety Manufacturing Company Production Smoothing 
Model (n = 1). 
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table indicate impossible combinations of i and j due to the production and inven¬ 
tory constraints in (9). For example, ifj = 0, then entering inventory cannot be 
greater than 1, for at most 4 units of inventory could have entered the previous 
period and the previous demand would have reduced the amount by 3. Analo¬ 
gously, if j = 5, then entering inventory must be at least 2, again because demand 
would have reduced the amount by 3. Since ending inventory is 0 by assumption, 
the optimal production depends only on i and is independent ofj. As you 

will observe next, docs depend on j for ;z > 1. 

The computations for n = 2 and y = 2 are exhibited in Fig. 9.9. Study the 



Figure 9.9. Surety Manufacturing Company Production Smoothing 
Model (« = 2,^ = 2). 


entries in the table to see how they are derived from the dynamic programming 
recursion (13), In particular, notice that the third entry in each box of the main 
part of the table comes from Fig. 9.8, For example, if i = 0 and a = 3, then the 
amount 19 is the value of/i(i + x — 3, x) = /i(0, 3) contained in the i = 0 row 
and y = 3 right column of Fig. 9.8. Observe also that the row for i = 4 is shaded 
out in Fig. 9.9, because entering inventory in the previous period could at most 
be 4 units. Given that production y — 2 and demand is 3, then at most 3 units 
could enter the present period. 

It should now be evident that the computational burden here is roughly six times 
as great as that for the general model in Chap. 8, since the calculations such as 
are shown in Fig. 9.9 are performed for each possible value of_y = 0, 1, . .. , 5. 

The complete set of /^{hy) and corresponding ^ 2 ( 1 ,^) are given in Fig. 9.10. 
Test your understanding of the recursion (13) by verifying the values of f^ii^ 5). 
For convenience, the final column of Fig. 9.10 shows the optimum production 
levels from the earlier analysis in Sec, 8.4 that did not contain a production smooth¬ 
ing cost. Study the difference between the two sets of solutions. Notice that when 
= 2, it is no longer optimal to let production be 5 when entering inventory is 



















• 9.8 . 


Inventory Model with Production Smoothing 


315 


Previous Production: 


Entering 

Inventory 

i 

y = 

= 0 

y = 

= 1 

y = 

= 2 

y = 

= 3 

y = 

= 4 

y = 

^5 

Fig. 

8.10 

X2(/.0) 

fzU.O) 

X2(/,1) 

fzii.n 

X2(/,2) 

faU.S) 

X2(/,3) 


X2(/,4) 

f2(/.4) 

x2(/,5) 


X2U) 

0 

3 

47 

3 

42 

3 

39 

3 

38 

3 

39 

3 

42 

3 

1 

2 

4I 

2 

38 

2 

37 

2.3 

38 

3 

39 

3 

42 

5 

2 



2 

36 

2 

35 

2 

36 

2 

39 

4 

41 

4 

3 





0.3 

32 

3 

31 

3 

32 

3 

35 

0 

4 







2 

25 

2 

28 

2 

33 

0 


Figure 9.10. Surety Manufacturing CJompany Production Smoothing 
Model (m = 2). 


1. Now it is preferable to schedule 3 if the previous production level is 3 or more, 
and otherwise to schedule 2. In contrast, it is no longer optimal to let production 
be 0 when entering inventory is 4. Now it is preferable to produce 2. 

The dynamic impact of imposing a smoothing cost can be seen in Fig.-9.11. 


Figure 9.11. Optimal 
Production Schedules for 
Planning Horizon N = 2 
and Previous Production 
xo = 4. 
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Observe that when initial inventory, 2o> is 1 or 2, the smoothing schedule calls for 
production in both January and February, whereas the previous schedule produced 
in only January. When z’o is 3 or 4, the smoothing schedule calls for production in 
January, whereas the previous schedule produced the same amount in February. 
The preference toward January production arises because, in the present model, 
it is expensive to let production fall to zero and then increase it again later. 
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►Review Sec. 6.7, which contains a description of an employment scheduling smooth¬ 
ing problem closely related to the preceding model. The decision variable Xij of that 
situation was the number of crews beginning work at the start of Period i and terminating 
at the start of Period j. The constraint to be met was to have at least R/c crews available 
in Period k, where k = 1, 2,..., n — 1. The cost associated with Xij included wages, 
as well as recruiting, transporting, and briefing. The nonwage expenses occasionally 
made it desirable to have a work force above the minimal requirements. Thus the 
objective was to establish a feasible work force schedule that minimized total labor 
expenses. 

The following connection can be drawn between that situation and the smoothing 
model above. Let 

n-l 

(i) ^ 1=2 Xt) and Dt = Rt, 

}=t+i 

so that the total number of crews hired at the beginning of Period t is aggregated. In 
this application, the services of a crew cannot be inventoried from one period to the 
next. Therefore, if a schedule has crews in excess of the requirements R^ in Period k, 
the surplus of labor cannot be stored until Period k -f 1. Hence, the dimension of the 
state variable in the recursion (6) can be reduced by eliminating i. The resulting dynamic 
programming relation is 

(ii) gn{y) = minimum + Cr,{x, 0,^)] for n> I 

(iii) = minimum Cl (x, 0, for n = 1. 

X>tfi 

The value ofgjv(>>o) is sought. 

Observe the differences between (ii) and the problem in Sec. 6.7. The earlier model 
has the flexibility of assessing a smoothing cost that depended on both the beginning and 
the length of the interval (from the start of Period i to the start of Period j). The cost 
Ci) was charged for each crew, however, so that the cost was a linear function of the 
number of crews hired for the interval. 

The present model with cost functions Q(x(, 0, has the opposite flexibility. It 
assesses a smoothing cost only against the change in employment level between two 
successive periods, independent of how long new crews remain engaged or how long the 
released crews were employed. But this model does permit the employment costs within 
a period to change nonlinearly with the total number of crews utilized. In addition, 
it allows the variation in total smoothing cost to depend nonlinearly on the actual 
employment levels in two successive periods. 

If all the costs are linear, 

(iv) Cij = Ci + Ci+i + * • • 4- + bj, 

where in Period t, for each crew, Ct is the wage cost, the hiring cost, and bi the firing 
cost, then an optimal schedule can be found by either approach. However, in such a 
situation further analysis of (ii) leads to a very simple algorithm that is related to the 
approach used in the convex cost inventory model of Sec. 9.3. M 
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REVIEW EXERCISES 

1 Verify the claims in Sec. 9.1 that 

(a) Formulas (3) and ( 4 ) are equivalent. Give a verbal interpretation of (4). 

(b) Requiring ii ^ 0, where it is given by (4), implies that demand is satisfied on time. 

2 (a) Give a verbal interpretation of the definitions (3) and (4) in Sec. 9.2. 

(b) Draw a sketch of the functions described in Cases i through vi in Sec. 9.2 (let 
a = 2, b = 1 , and r = 3 in the first four cases, and use the parameters given in 
the text for the last two cases). 

(c) Draw a sketch of the function described in Case w, letting the parameters be 

= 1, = 4, ^2 = 3, W 2 = 6, and Cj = 2. 

3 Write the cost functions Ct{xi) as given by (3) and (4) in Sec. 9.3 for each of the six 
periods using the data in Fig. 9.2. 

Exercises 4 through 7 refer to the case of the Highway Rubber Company 
shown in Fig. 9.2. Reapply the algorithm to the new data and calculate the incre¬ 
mental effect on total cost. Be sure to indicate total production Xi and ending 
inventory it in each Period t. 

4 (a) Assume January demand is 2 instead of 1. 

(b) Assume May demand is 1 instead of 0. 

(c) Suppose demand in every period remains the same, except in Period t. How large 
can Dt be such that there still exists a feasible schedule? (Answer this question 
for ^ = 1 , 2 , ..., 6 .) 

5 (a) Assume regular-time pioduction capacity in January is 4, instead of 3. 

(b) Assume regular-time production capacity in January is 5, instead of 3. 

(c) Assume overtime production is unavailable in March and April. 

6 In each part, explain the impact on the solution in Fig. 9.3 if the cost parameters are 
changed as indicated below. 

(a) Production costs in each period are identical. (For example, two per item at 
regular lime, and five per item at overtime.) 

(b) Overtime costs equal regular-time costs. (For example, two per item in Period 1, 
with a total capacity of 9 items; 4 per item in period 2, with a total capacity of 
four items, etc.) 

(c) Holding cost is A = 6 per item. 

(d) Holding cost is A = .1 per item. 

’•‘7 Suppose the cumulative demands Rp, defined in (1) of Sec. 9.4, satisfy 

1 < < 1 1 ^ /?2 < 7 6 ^ R 3 < 14 

10 <R 4^20 10:^/?5^25 12^i?«:^27. 
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Find the associated values for Xp{S) and Xp{T)y specified in the Convex Cost Horizon 
Theorem. Draw a diagram like that in Fig, 9.4. Can you give an optimal value for 
production in Period 1 ? Explain why. 

*8 Consider the special convex cost model given in the advanced material at the end of 
Sec. 9.4. Find an optimal schedule when demand requirements are 

(a) Di = 1, jD 2 = 2, D 3 == 7, D 5 = 0, and Dq = 2. Let = 5. Let = 4. Let 
£>4 = 3. Let D 4 = 2, 

(b) D, = 1, D 2 = 2, D 4 = 6 , D 5 = 0, and Do = 2. Let D 3 = 11. Let D 3 = 12. 

(c) Di = 1, D 2 = 2, D 3 ^ 7^ Di — 6 , and Dq = 2. Let D 5 = 1. Let D 5 = 2. Let 
D 3 = 4. 

(d) D 3 = 7, D 4 =- 6 j D 5 = 0, and Dq = 2. Let Dj = 2 and D 2 = 1. T^ct = 3 and 

D 2 = 0. ' 

9 Suppose a vendor offers the following price-break (quantity-discount) structure: SIO 
per item for any amount ordered up to a dozen units; S8 per item (that is, for every 
item) for a total amount ordered above a dozen and up to a gross: S5 per item for a 
total amount ordered beyond a gross. Plot the total purchase cost as a function of the 
number of items ordered. Is this a concave cost function? Why? Is the cost function 
concave if a setup cost is added ? Why ? 

10 (a) Explain why the form of an optimal policy given for the lot-size model with 

concave costs in Sec. 9.5 implies that there always exists an optimal policy with 
the property that, in each period, if entering inventory is positive, then no produc¬ 
tion is scheduled. 

(b) Given this form of an optimal policy, calculate exactly how many distinct produc¬ 
tion schedules need to be considered. For the sake of definiteness, assume that 
every Di> 0 (recall initial inventory Iq = 0 ). 

(c) Calculate how many different given by (1) in Sec. 9.6, have to be calculated 
in order to apply the algorithm for the general concave cost mpdel. 

N 

*11 The validity of the form of an optimal policy for the lot-size model with concave 
costs is established in the special material at the end of Sec. 9,5. The argument 
involves either adding the amount to Xp, or adding the amount Xp to an earlier 
period’s production level. Given a schedule in which it-iXt > 0 for one or more 
periods, is the alternative policy suggested by the argument optimal? Justify your 
answer. 


12 Selmore Company (Sec. 9.6). Consider the example in Fig. 9.5. Find an optimal policy 
if the holding cost function is hiijii) = where, in all periods, 

(a) /i,= l. 

(b) = 2. 


13 In the concave cost model of Secs. 9,5 and 9.6 suppose that demand Dj = 1, 


Ct{xd = 


,12 + 


for xt = 0, 1, 2, 3 
for Xt ^ 4, 
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and /zi(it) = hiii in. all periods. (Note the example resembles the Selmore Company 
Problem, where the production cost function is the same as that for / = 1 in Fig. 9.5.) 
Find an optimal policy for = .1 in all periods, when the horizon length is 

(a) AT = 4. (e) AT = 8. 

(b) iV' = 5. (f) AT = 9. 

(c) A^ = 6. (g) Do the same for all hi = 1 and Af = 9. 

(d) N = 7, (h) Do the same for all hi = 2 and Af = 9. 


14 Answer the questions in e.xercise 13 where 


Ci{xi) 


JO for Xi = 0 

(3 + for x^^l. 


15 Answer the questions in exercise 13 where demand = 2 in every period. 


16 Answer the questions in exercise 13, except assume demand alternates over the entire 
horizon as follows: 

(a) = 1 ,Do = 2 ,D 3 = 1 ,Z )4 = 2 ,.... 

(b) A = 2, D2= 1,D3 = 2,/>4= I,..-- 

17 Answer the questions in exercise 13, except assume that the holding costs alternate as 
follows: 

(a) = 1, = 2, ^3 = 1, ^4 = 2 ,.... 

(b) /ii = 2,yi2= 1,A3 = 2,A4= 1,.... 

*18 The advanced material at the end of Sec. 9.6 asserts that the dynamic programming 
recursion (4) can be viewed as hnding a shortest route in an acyclic network. Follow 
the calculations in (5) and trace the process in Fig. 9.7; show the connection with a 
shortest-route algorithm for an acyclic network. 

* 19 Suppose you can backlog demand and fill it at a later period; thus ii is allowed to be 
negative (for / < N). (All demand must be met by the end of the horizon.) Answer 
the questions in exercise 13, parts (a) through (f), where the inventory holding and 
backlog penalty cost function is hi{ii) = for 

(a) hi=,L 

(b) hi = 1, 

(c) hi = 2. 

♦20 Consider the example (1) through (3) in Sec. 9.7. Verify the results in (4) and (5). 

♦21 Compare the planning information given by the Convex Cost Horizon Theorem in 
Sec. 9.4 and by the Setup Cost Horizon Theorem in Sec. 9.7. 


♦22 Consider the Setup Cost Horizon Theorem in Sec. 9.7, and assume the first month in 
the planning horizon is January. 

(a) What does = 5 imply about an optimal schedule, according to part (a) of the 
theorem? 
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(b) What does = 3 imply about an optimal schedule, according to part (b) of the 
theorem? 

♦23 (a) Verify the validity of the recursion (7) in Sec. 9.7 for the case where the holding 
cost function is linear. 

(b) Modify (7) for the case where the holding cost is a general concave function. 

♦24 Verify the results given in the advanced material at the end of Sec. 9.7, and explain 
why part (b) of the Setup Cost Planning Horizon Theorem is not satisfied. 

(a) (ii). 

(b) (iv). 

*25 Surety Manufacturing Company (Sec. 9.8). The dynamic programming recursion (13) 
shows how to incorporate a quadratic smoothing cost function. 

(a) Rewrite (13) to incorporate a piecewise linear smoothing cost function (4), with 

flj = »= 1 for every period. 

(b) Using your answer in part (a), calculate tables like those in Figs. 9.8, 9.10, and 
9.11. 

♦26 Suggest at least two smoothing cost functions that are reasonable alternatives to (3) 
and (4) in Sec. 9.8. 

♦27 Explain how to modify the recursion (6) in Sec. 9.8 for the case where the cost func¬ 
tion in Period t depends on X£_ 2 > ^ well as on i^, and Assuming this dependency 
is added to the case of Surety Manufacturing Company, indicate how the number of 
computations will increase as compared to those required in Figs 9.8, 9.9, and 9.10. 

28 Explain your understanding of the following terms; 

convex function setup cost (or lump-sum cost) 

concave function quantity discounts (or price breaks) 

decreasing (increasing) incremental regeneration event (point) 

returns to scale ♦batch ordering 

♦planning horizon analysis ♦smoothing cost, 

dynamic lot-size model 


COMPUTATIONAL EXERCISES 

Exercises 29 through 37 refer to the inventory model with convex costs, as de¬ 
scribed in Sec. 9.3. (Recall that initial inventory Zq = 0.) 

29 Assume the production cost functions arc given by Fig. 9.12. For example, the total 
cost in Period 1 associated with xj = 12 is 32 (= 1-5 *4- 3*5 -f 2*6). Assume the 
holding cost functions arc hi(ti) = where = 1, A, = 2, and Aj = 1. Suppose 
the demand requirements are 

Di= 10 A = 3 A = 17 A = 23. 
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Produclion Cos! of Unit k 


Production of 
Unit k 

Period 1 

Period 2 

Period 3 

Period 4 

1 s A < 5 

1 

2 

1 

4 

6 s A < 10 

3 

3 

6 

4 

II < A < 15 

6 

5 

7 

8 

16 £ Ar < 20 

10 

8 

12 

9 


Figure 9.12 


(a) Find an optimal production schedule. Be sure to indicate total production and 
ending inventory ii in each Period t. 

*(b) Find an optimal production schedule if you impose the constraint ii ^ 5 in every 
period. 

(c) Indicate the impact of requiring an additional unit in Period 1 (that is, let Di 
= 11). In Period 2. In Period 3. In Period 4, 

(d) Find an optimal production schedule where all hi = 0. Where all hi = 5, 

(e) Find an optimal production schedule where the demand requirements arc revised 
such that Z)i = 3 and D2 = 10. Such that Di = 23 and 1)4 = 17. Such that 
Do ^ n and = 3. 

’•‘30 Consider the data in exercise 29, part (a). Suppose the cumulative demands /?p, 
defined in (1) of Sec. 9.4, satisfy 

( = l t «sl 

In each part below, find the associated values for Xp[S) and Xj,{T), specified in the 
Convex Cost Horizon Theorem, indicate the associated production levels and ending 
inventories, and draw a diagram like that in Fig. 9.4. Explain whether or not you can 
give an optimal value for production in Period 1. Let 

(a) A: = 1. 

(b) = 2. 

(c) k = 3. 

*31 Consider the data in exercise 29, part (a). Suppose the cumulative demands 
defined in (1) of Sec. 9.4, satisfy 

^Dt-pk^R^^'tOi+pk. 

In each part below, find the associated values for Xp{S) and Jfp(7'), specified in the 
Convex Cost Horizon Theorem, indicate the associated production levels and ending 
inventories, and draw a diagram like that in Fig. 9.4. Explain whether you can give 
an optimal value for production in Period 1. Let 

(a) A: = 1. 

(b) A; = 2. 

(c) A: = 3. 

32 Assume the planning horizon iV’ = 3 periods and that the production and inventory 
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cost functions arc Q(.Vf) = xj and = hiit for every Period t. In each part below, 
find an optimal schedule for production and indicate ending inventory for each period. 

(a) Assume hi = 0 and the demand requirement is = 6 for every Period t. 

(b) Assume = 0 for every Period t, and D, = 11, D 2 = 5, and D 3 = 2. 

(c) Assume hi = 0 for every Period t, and D, = 2, D 2 = 5, and Dj = 11. 

*(d) Assume the data in part (c), and impose the constraint that production xi ^ 5 
in every Period t. 

*(e) Assume the data in part (c), and impose the constraint that ending inventory 
I 4 ^ 3 in every Period t. 

(f) Assume hi = 3 in every Period t and the same demand data as in part (c). Next, 
assume hi = 11 in every Period t. Compare the results for hi = 0, 3, 11. 

(g) Assume = 0 in every Period t, the same demand data as in part (c), and 

^(^ 3 ) = I.SaTj. Next assume hi = 3. Finally assume hi = 11. Compare your 

results with those in part (f). 

*33 Assume the production and inventory holding cost functions in exercise 32. Suppose 
the cumulative demands Rp, defined in ( 1 ) of Sec. 9.4, satisfy 

1 ^ ^ 3 6 < /?2 < 8 23 < /?3 < 29. 

Let hi = 0. Find the associated values for Xp{S) and Xp{T), specified in the Convex 
Cost Horizon Theorem, indicate the associated production levels and ending inven¬ 
tories, and draw a diagram like that in Fig. 9.4. Explain whether or not you can give 
an optimal value for production in Period 1. Do the same for hi = 3 and for = 11. 

*34 Assume that backlogging is permitted, and that the production cost and the inventory 
holding and backlog penalty cost functions are 

. n {^tU if H ^ 0 

— 1 for every Period t. 

l-M ifh <0 

In each part below, find an optimal schedule for production, and indicate ending 
inventory or backlog for each period. (All demand must be met by the end of the 
horizon.) 

(a) Assume hi— pi = Q for every Period t, and demand requirements are Dx = 2, 
D 2 = 5, and D^— 11. 

(b) Assume = 0 for every Period t, and demand requirements are Di = 11, 

D 2 = 5, and D 3 = 2. 

(c) Assume the same demand data as in part (b), and = 0 and pi = 3 for every 
Period t. Next, assume = 0 and = 11 for every Period t. 

(d) Assume hi = pi = 0 for every Period t, and Di = 2, D 2 = 11, and = 5. 

(c) Assume the same demand data as in part (d), and hi = pi = 3 for every Period t. 
Next assume hi — pi— 11 for every Period t. Finally assume hi= 2 and pt = ^ 
for every Period t. 

35 Assume the planning horizon N — S, and that the production and inventory cost 
functions are Ci(xi) = xf and /it(it) = hiii for every Period t. Assume the demand 
requirements are 

Z)i = 4 Dj = 20 Dj = 2 D 4 = 0 Dj = 6 = 4 Dj = 1 Dg = 3. 
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In each part below, find an optimal schedule for production and indicate ending 
inventory for each period. 

(a) Assume hi = 0 for every Period t. 

(b) Assume = 1 for every Period t. Next assume ht = 3 for every Period t. 

(c) Assume = ij for every Period t. 

(d) Assume = 0 for every Period t. Also assume Ct{xi) = for / = 2; then for 
t = 4, instead; then for / = 7, instead. 

♦ (e) Assume hi — 0 for every Period t, and the planning horizon N = 9, What is the 
largest value for the demand requirement Dg such that the optimal production 
level for found in part (a) remains unchanged? Answer the same question for 
a: 3 ? Answer the same question for X|? 

*(f) Assume hi = 1 for every Period t. Answer the same questions as those in part (e). 
*(g) Assume /if = 3 for every Period t. Answer the same questions as those in part (e). 

*36 Assume the production and inventory holding cost functions in exercise 35. Assume 
hi = 0 for every Period t. Suppose the cumulative demands Rp, defined in (1) of 
Sec. 9.4, are as specified in each part below. Find the associated values for Xp{S) and 
Xp{T)y specified in the Convex Cost Horizon Theorem, indicate the associated produc¬ 
tion levels and ending inventories, and draw a diagram like that in Fig. 9,4. Explain 
whether you can give an optimal value for production in Period I. 

(a) j^Di-k^R,^'t,D, +k.Do{QTk = I and k = 2. 

f=l 

(b) ^ Di — pk < Rp < ^ Di pk. Do for A: = 1 and k = 2, 

t=i 

(c) Perform the same analysis with hi = I for every Period t. 

*37 Assume that backlogging is permitted, and that the demand requirements and 
production cost function arc the same as in exercise 35, and that the inventory holding 
and backlog penalty cost function is 

(hiii if If ^ 0 
^tdt) = j . ^ n 

l-/>fif if if^O. 

(All demand must be met by the end of the horizon.) In each part below, find an 
optimal schedule for production, and indicate ending inventory or backlog for each 
period. Let all 

(a) hi = pi = 0. (d) hi = 1 and pi = 3. 

(b) hi= pi=^ 1. (e) hi = 3 and pi = 1. 

(c) hi=pi^ 3. 

Exercises 38 through 43 refer to the inventory model with concave costs, as 
described in Sec. 9.5. (Recall that initial inventory Zq = 0*) 

38 Assume the planning horizon N = 3, and the production and inventory cost fimctions 


are 


Ci(xi) = 12 + Sxi for X, ^ 1 

= 2 -f 9x2 for J'fs ^ 1 

^s(^s) = 5 + lOxj for Xj ^ 1 


and Af(if) = Afif, 
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Assume the demand requirements are Dx = 0, Z )2 = 3, and = 20. In each part, 
find an optimal production schedule, and indicate ending inventory in each period. 
Assume all 

(a) h, = 0. 

(b) h, = 1. 

(c) ht = 2. 

39 Assume the planning horizon = 4, and the production cost functions are given by 
Fig. 9.13. For example, the total cost associated with Xi= 12 is 86 (= 10-5 +6-5 
+ 2*3). Assume the holding cost functions are = hfiiy where = 1, = 2, 


Production Cost of Unit k 


rroaucTion oi 
Unit k 

Period 1 

Period 2 

Period 3 

Period 4 

1 < /r < 5 

10 

8 

12 

9 

6 10 

6 

5 

7 

8 

11 <k<\5 

3 

3 

6 

4 

\e<k<zo 

1 

2 

1 

4 


Figure 9.13 


and ^3 = 1 . Suppose the demand requirements are 

= 10 £>2 = 3 £>3 = 17 Z )4 = 23. 

(a) Find an optimal production schedule. Be sure to indicate total production Xi and 
ending inventory ii in each Period t. 

(b) Indicate the impact of requiring an additional unit in Period 1 (that is, let 
Di = 11). In Period 2. In Period 3. In Period 4. 

(c) Find an optimal schedule where all hi = 0. Where all hi = 5. 

(d) Find an optimal production schedule where the demand requirements are revised 
such that Dj = 3 and D 2 = 10. Such that £>3 = 23 and £>4 = 17. Such that 
JD 2 = 17 and = 3. 

(e) Find an optimal production schedule where the holding cost function is hiiii) = 

for every Period t. 

♦40 Consider the data in exercise 39, and assume that backlogging is permitted; let the 
inventory holding and backlog penalty cost function be 

[hiii if ii ^ 0 

hi{i^ = < for every Period t. 

[-ptit ifit <0 

(All demand must be met by the end of the horizon.) In each part, find an optimal 
schedule for production, and indicate ending inventory or backlog in each period. 
Assume all 

(a) hi=pt = 0 . 

(b) A^=A=l. 

(c) Suppose instead that the inventory holding and backlog penalty cost function is 
hi{ii) = VRil for every Period t. 
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41 Assume the planning horizon AT = 8, and the production cost function is Q(xt) = 
for every Period t, and the demand requirements are 

Dy = 4: JDs = 20 = 2 D^ = 0 = 6 = 4 Dj = I 3^ = 3. 

In each part below, find an optimal schedule for production and indicate ending 
inventory in each period. Assume for every Period t that the holding cost function is 

(a) ht{tt) = 0 for every Period t. 

(b) ht{ii) = it for every Period t. 

(c) ht{it) = a/Ti for every Period t. 

♦42 Consider the same data as in exercise 41, and assume that backlogging is permitted; 
let the inventory holding and backlog penalty cost function be 

ht{i^ = j ^ ^ for every Period t. 

1-M 

(All demand must be met by the end of the horizon.) In each part, find an optimal 
schedule for production, and indicate ending inventory or backlog in each period. 
Assume all 

(a) ht = pi = 0. 

(b) ^c=^=l. 

(c) Suppose instead that the inventory holding and backlog penalty cost function is 
hi{ii) = \/R7l for every Period t. 

43 In each part, indicate an optimal production schedule and the associated ending 
inventory level in every Period t for horizon lengths 1, 2,.,., A/", where AT is specified 
in each part below. Assume that the production and holding cost functions arc of the 
form 

Ci{xt) = and ht{it) = htit for every Period t. 

+ ctXt for xi ^ 1 

(a) Let N = 10, = 36, = 5, = 1, and the demand requirements Di = 3 for 

every Period t. 

(b) Assume tlie data in part (a), except that the demand requirements are Dt = 20 
for Periods 3, 6, and 9 and Dj = 2 for all other periods. 

(c) Assume the data in part (a), except that the demand requirements arc Dt = 20 
for Periods 2, 5, and 8 and 2)^ = 2 for all other periods. 

(d) Assume the data in part (a), except that the demand requirements arc Di = 20 
for Periods 1,4, 7, and 10 and Dt = 2 for all other periods. 

(e) Let N — 12, ct = 5, ht = 1, the demand requirements Dt = 3 for every Period t, 
and St = 100 for Periods 4, 5, 6, 10, 11, 12 and si = 36 for all other periods. 

♦(f) Assume the data in part (a), except that the holding cost function is kt{it) = <\/17 
for every Period t. 

*(g) Assume the data in part (a), and that backlogging is permitted; let the inventory 
holding and backlog penalty cost function be hi{it) = |itl for every Period t. 

♦(h) In parts (a) through (e), indicate how the computations in the algorithm simplify 
by employing the Setup Cost Horizon Theorem in Sec. 9.7, Also show how you 
' can use recursion (7). 
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44 Consider the problem in exercise 44 in Chap, 8. Determine whether you get an optimal 
schedule if you apply 

(a) The algorithm in Sec. 9.3 for the inventory model with convex costs. 

(b) The algorithm in Sec. 9.6 for the inventory model with concave costs. 

(c) Interpret your results in parts (a) and (b). 


MIND-EXPANDING EXERCISES 

45 Consider the inventory model described in Sec. 9.1. Suppose that production cost in 
Period t consists of two components: labor cost Lt{xt) and raw materials cost. The 
latter arises as follows. In each Period r, the company can purchase an unlimited 
quantity of raw material at the price pr per unit (where the units have been selected 
so that each item of production requires one unit of raw material). Every price pr is 
known at the beginning of the planning horizon. If raw materials are purchased in 
early periods to be stored for use in later periods, then the company incurs a storage 
cost. Specifically, the charge is Hr per unit of raw material held at the end of Period r. 
Suggest an efficient way to incorporate the raw materials purchasing decision into the 
models of Sec. 9.3 and 9.5. 

46 In each part below, state how you would modify the specified algorithm to account 
for a positive level of initial inventory Iq > 0. First state the modifications assuming 
that all holding cost functions are of the form hi{ii) — hiif. Then indicate how to 
revise your approach when the holding cost functions are not all linear. 

(a) Inventory model with convex costs in Sec. 9.3. 

(b) Inventory model with concave costs in Sec. 9.6. 

47 (a) Suggest a rewording of the algorithm in Sec. 9.3 to take account of an upper 

bound ui on production and an upper limit bi on the amount inventoried at 
the end of Period t. 

(b) Consider the concave cost model in Sec. 9.5 and assume that items become 
obsolete (unusable) after being held in storage for K periods. Give a valid restate¬ 
ment of the form of an optimal policy and the resultant algorithm. 

48 Consider the inventory model described in Sec. 9.1, and assume initial inventory 
I'o = 0. Ignoring integer-value restrictions on the variables, the constraints (4) and (5) 
can be written as 

/ { 

^ for t = l,2f,,, y N with equality holding for t =Ny 

and all X/c ^ 0. Suppose the cost function to be minimized is C(xi, X 2 ,..., Di, 

.., Assume that this function is concave in all the decision variablesxjk, which 

means that given any two feasible solutions, the cost associated with another 
solution that is a positive-weighted average of these two solutions is no greater than 
the same positive-weighted average of the costs of the two solutions. It can be shown 
(Sec. 14.9) that an optimal solution occurs at an extreme point of the constraint set. 
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(Recall a solution is an extreme point if it cannot be expressed as a positive-weighted 
average of other feasible solutions.) 

(a) Verify that an extreme-point solution has the following property. If there are 
Periods p and q, where p < q, such that 

Xj, > 0, Xp^i = Xp+2 = • • • = Xq^i = 0, Xg >• 0, 

then 

t^P 

(b) Show that part (a) is equivalent to asserting the form of an optimal policy given 
in Sec. 9.5. Explain why there exists an optimal solution with integer values for 
the decision variables provided that each Dt is integer-valued. 

(c) Show that there are 2'^"' feasible extreme-point solutions. Indicate how you can 
use this result to calculate an optimal policy. (Assume Di > 0.) 

In exercises 49 through 58, you are to formulate a model in terms of a dynamic 
programming recursion. Be sure to define all the symbols you use. Give the 
appropriate optimization function where there is a single period remaining, as 
well as the recursion for Stage n. Explain how to initiate and when to terminate the 
calculations. 

49 Consider the inventory model described in Sec. 9.1. 

(a) Add the restriction that (i^i + *«)/2 ^ Bt for t = 1, 2,..., iV”, where each Bt 
is a nonnegative integer. 

(b) Add the restriction that + Xt ^ Vt for t ^ 1, 2,..., iV, where Xg = 0 and 
each Vi is a nonnegative integer. 

(c) Add the restriction that + xt > 1 for / = 1, 2,..., iV, where Xg = 0. 

50 Batch Service Problem, Several food companies send shipments for the Mom-and-Pop 
Grocery Store to a local public warehouse. Mom-and-Pop pays c dollars per day per 
hundred cubic feet of storage it utilizes in this warehouse. Suppose Mom-and-Pop 
forecasts, for each of the next N days, that shipments amounting to Df, hundred cubic 
feet will arrive on Day t. Suppose each day Mom-and-Pop can arrange to have a 
truck pick up all the goods stored at the warehouse. Since the cost of this .trucking 
service is j, Mom-and-Pop does not usually order a pickup every day. However, it 
never lets a shipment remain in the warehouse longer than n days. (Assume that ship¬ 
ments arrive at the start of a day, and that the trucking service picks up the goods at 
the end of a day; assume that warehousing costs arc assessed on the maximum space 
utilized during a day.) Show how Mom-and-Pop can obtain an optimal schedule for 
picking up goods at the warehouse. 

51 Site Location Problem, The Crack Oil Company has a substantial share of the gasoline 
market in a major population center that is located along a peninsula. The area 
consists of N cities that are connected by a six-lane highway. Assume the cities arc 
indexed 1, 2,..., iV, and to travel from City j to City k, where j < A:, you must 
proceed along the highway passing by Cities j-f l,j4-2, ...,k — 1. The Crack 
Oil Company wants to locate distributors in one or more of these cities. The annual 
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fixed cost of operating a distributorship in Cityj is sj. The forecasted annual gallonage 
that must be supplied to Cityj is Dj, The distance between adjacent Cities j and j + 1 
is Ljj+i, If a distributor in Cityj serves the requirements in City k, then the annual 
cost associated with transportation is A • • (distance between City j and City k). 

The company wants an optimal pattern of locations for its distributors. Formulate a 
dynamic programming model for this decision problem. 

(a) Assume that City k can only be served by a distributor in City j, where J < k, 
*(b) Assume that City k is served by the closest distributor. 

52 Assortment Problem, A Japanese industrial firm, Itsa Steel Company, manufactures 
structural beams of a standard length. The strength of a beam depends on its weight, 
and Itsa Steel indexes the various strengths it can make as J = 1, 2,. . ., iV, where 
J = 1 is the heaviest beam and J = N the lightest. Assume that if a customer requests 
Strength k, then Itsa Steel may, if it chooses, supply the demand by a beam of possibly 
greater Strength j, where j < k, Itsa Steel must solve the following assortment problem. 
The demand requirement for Strength j is beams; all demand must be satisfied. 
If Itsa Steel decides to manufacture Strength j, then it incurs an expensive setup cost Sj. 
If the company meets the demand requirement D^hy shipping beams of Strength k, 
where k then the company incurs a loss of h*{wic ~ where W}^ and Wj 

are the respective weights of the beams and h is the cost per unit of weight. Formulate 
a dynamic programming model that will enable Itsa Steel to decide an optimal 
assortment of strengths to manufacture and the corresponding amounts of each. 


53 Consider the inventory model described in Sec. 9.1. Suppose there is a cost of starting 
up production whenever production in the previous period is 0. Specifically, assume 
the cost function for Period t is 


where 


Q(*(, «(, *<-i) = Ctixi) + ht{ii) + »<(*(.„ xi). 


X,) 


7C( ifx£^ 1 and = 0 
0 otherwise. 


(a) 

(b) 


Formulate the problem as a dynamic programming model. Compare the compu¬ 
tational difficulty of solving the recursion with that of the model in Sec. 9.1 
(making no further assumptions-about the shape of the cost functions). 

Suppose 


= 


[0 


for Xt = 0 
for Xt ^ 1, 


and ht{it) is concave. Assume all St ^ 0, ^ ct+i, demand requirements Dt > 0, 

and initial inventory Iq = 0. Suppose there exists an optimal policy in which 
I't-i Xt = 0 for every Period t. Suggest an appropriate modification to the algorithm 
in Sec. 9.6. Draw a shortest-route diagram illustrating your idea. 

*(c) Prove that assumptions in part (b) imply the existence of an optimal policy with 
tt.,Xt = 0. 


54 Consider the inventory model described in Sec. 9.1. Suppose that production is 
permitted in at most K periods during the horizon of N periods (that is, no more than 
Xofthext,fort== 1,2,..., X, can be positive). Formulate the problem as a dynamic 
programming model. Compare the computational difficulty of solving the recursion 
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with that of the model in Sec. 9.1 (making no further assumptions about the shape of 
the cost functions). State how you would modify your answer if production must occur 
in exactly K periods during the horizon. 

In exercises 55 through 58 the single-item inventory model as described in Sec. 
9.1 is generalized to a problem having two items (the further generalization to more 
items involves straightforward extensions of the results below). Accordingly, let 
Dll 2 md D 2 t denote the demand requirements in Period t for Products 1 and 2, 
respectively, and similarly let Xu and X 21 denote the production levels, and in 
and (21 the ending inventory levels. (Any assumptions left implicit are the appro¬ 
priate generalization of those made for the model in Sec. 9.1.) 


55 Assume the objective function is 


N 

minimize 2 * 2 t)> 

where 

^2t> hti Ut) = ^2t) "i" 


Formulate a dynamic programming model, and compare the computational difficulty 
of solving the recursion with that of the model in Sec. 9.1 (making no further assump¬ 
tions about the shape of the cost functions). 


56 Suppose the cost functions in exercise 55 are 


^ 2 () 


[0 

+ CitXii -|- €21X21 


for xit = xjt = 0 
otherwise. 


where Jt ^ 0, Cn'^Cn+i and Czi"^ € 2 , 1 + 1 , and that the holding cost functions 
and A 2 t(* 2 d concave. 

(a) Assume initial inventories ijo = i 20 — 0* Suppose that there exists an optimal 
policy for which (I'l.t-i + * 2 ,«-i)-(j>fit + ^ 2 () = 0 for every Period t. Explain 
how you would modify the algorithm in Sec. 9.6 to find an optimal production 
policy for both items. 

*(b) Prove the existence of an optimal policy having the form stated in part (a). 


57 Suppose the cost functions in exercise 55 are 


€t{xit, Xzt) 


=(: 


for xit = = 0 

U + CH(Af,t) + € 2 t{x 2 t) otherwise, 
where St ^ 0, €ii{xit) and € 2 tix 2 t) are concave, and that the holding cost functions 
and h 2 i{i 2 t) are concave. 


(a) Assume that the form of an optimal policy in Sec. 9.5 remains valid for each item. 
Explain how you would simplify the calculations in the recursion for exercise 55 
by exploiting this information. 

♦ (b) Prove that the assumption in part (a) is valid. 


58 Suppose the production cost functions in exercise 55 are €i{xn, Xj() = €it{xii) -f- 
and that the production levels are constrained in each period by 
+ ^ 21^21 ^ where ai^, azt, bi are nonnegative integers. Show how you would 
modify your formulation in exercise 55 accordingly. 
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CHAPTER 


Other Examples of 
Dynamic Programming 


10.1 REMINDER 

Formulating a model in terms of a dynamic programming recurrence relation 
is partially an art. Hence a variety of examples are illustrated in Secs. 10.2 through 
10.6 so that you can acquire a better feeling for such formulations. Since you have 
seen the numerical solution for two examples worked out in gory detail in Chap. 
8, we do not emphasize here the computational aspects of solving the models. 

At the end of this chapter, in Secs. 10.7 through 10.10, we give an overall assess¬ 
ment of dynamic programming. Specifically, we discuss the ingredients that are 
common to all dynamic programming model formulations, and summarize 
how this kind of analysis yields insightful information for managerial decision¬ 
making. Some perspective is given on what makes the difference between a theo¬ 
retical exercise and a practical application of dynamic analysis. Finally, we point 
out in brief the kinds of industrial applications that have proved profitable, and the 
reasons for their success. 

In presenting the models below, we start slowly and provide a number of the 
minor details; as the chapter progresses, the pace picks up, and you are asked 
to carefully study the models and provide some of this detail yourself. Our words 
of guidance from Chap. 8 still apply—be patient and give yourself plenty of time 
to study the sections. Plan to read each section at least twice. And to keep check 
on your understanding, for each of the models answe»' the questions: 

1. What are the policy or decision variables? 

2. What is the criterion or objective function for determining an optimal 
policy? 

3. How is the problem characterized and then analyzed in terms of stages? 
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. 10.2 • 


4. What characterizes the state of the problem at each stage? 

5. How do the constraints influence the states of the problem and the feasible 
values of the policy variables? 

10.2 DISTRIBUTION OF EFFORT—ONE 
CONSTRAINT 

The following is a hypothetical, but suggestive, example of the so-called 
distribution of effort model. The owner of Shopping Basket Markets has a week’s 
supply N of eggs to distribute among his s stores. From past experience he knows 
that if he allocateseggs to Store j, his profit will be He suspects that in 

order to maximize his overall profit, he should not put all his eggs in one Shopping 
Basket! He wants to find an optimal distribution of eggs. 

This decision problem can be formulated as 

( 1 ) maximize 2 

subject to 

s 

(2) (available number of eggs) 

(3) 2, . . . for each j (distribute only whole eggs). 

To convert the problem statement (1) through (3) into a dynamic programming 
version, define 

gj{n) = profit when n eggs are distributed optimally to Store 1, Store 2, 

(4) ..., Store j, 

yj{n) = a distribution amount for Store j that yields gj{n). 

Watch the notation here. The letter g indicates a value for the company’s 
^oal, namely profit. The letter n refers to the number of eggs to be distributed. And 
the index j denotes just a store. 

Observe that this example is not dynamic in the sense of a time-staged decision. 
Rather, the multistage property of the problem relates to the optimizing technique 
of considering one additional store at a time. 

The dynamic programming recursion can be written as 

(5) gM) = maximum - y^)] torj = 1, 2,..., ^ 

Vi 

(6) ^o(n) = 0 for J = 0, 

where n = 0, 1,. . ., iV^ and the maximization is over only nonnegative integer 
values oiyi that satisfy y^'^n. 

Given numerical data for the profit functions R){y)) and a value of you solve 
the problem by beginning the computations for j = 1 and finding ^i(O), ^i(l), 
, . ., g\{N), Then continue by finding^ 2 ( 0 ), ^ 2 ( 1 ),. . ., You proceed in this 

fashion for successively larger values ofj until you finally find gs{N), You obtain 
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the actual optimal allocation by tracing back to see the values o£yj that together 
yielded 

General model. The Shopping Basket Example is a special case of what 
is called a distribution of effort problem; the example is special 
because the single constraint (2) is linear. The same dynamic approach is equally 
valid when the single constraint is nonlinear. Specifically, suppose the model is 

$ 

(7) maximize 2 ^jUj) 
subject to 

(8) ±Hj{yj)=N 

^-1 

(9) = 0, 1, 2, . . . for each 

Assume that each is a nondecreasing function, integer-valued for each 

yj = 0, 1,2,..., and, for simplicity, satisfies Hj{0) = 0. To simplify the dis¬ 
cussion, assume that Hi{yi) =yi so a. feasible solution exists for any value of N, 
Each can also be restricted by an upper bound, as will be illustrated in a 
numerical example below. 

A dynamic programming recursion appropriate for (7) through (9) is 
(18) SM = maximum {Rj{yj) + — Hj{yj)]} for j = 1, 2,,.., j 

Vi 

(11) ^o(«)=0 for 7 = 0, 

where w = 0, 1,. . ., iV, and the maximization is over only nonnegative integer 
values o£yj that satisfy Hj{yj) < n. The value of gs{N) is sought. To perform 
the computations, find each gj{n), for « = 0, 1,..., in (10), beginning with 
7=1, and terminating with j = s. 

Numerical example. We illustrate the computational process of solving 
a distribution of effort problem by assuming data for the model in (7), (8), and 
(9). Specifically, we let j = 3 and N = S; the values for the functions Rj{yij and 
H){y)) are displayed in Fig. 10.1. Observe that 

^ H^{y^)=2y, //,(y,)=3y,. 
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Figure 10.1. Distribution 
of Effort Example. 
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which are linear functions, but is not linear. In addition, we have imposed 

an upper bound 

(13) yj each j\ 

(In the context of this problem, we equally well could have imposed the bound 
^3 ^ since letting ^^3 = 2 does not increase the objective function.) 

The values of^^(n) and the associated optimal decisions j>j(n) are tabled in Fig. 
10.2. Let us review how these numbers are computed. 



y= 

=1 

J' 

= 2 

/ 

= 3 

n 

/,(/») 

5i(n) 

ygt/j) 


yi(n) 

g^in) 

0 

0 

0 

0 

0 

0 : 

0 

1 

1 

2 

0 

2 

0 

2 

2 

2 

4 

0 

4 

0 

4 

3 



1 

5 

0 

5 

4 



1 . 

7 

0 

7 

5 



2 

8 

1 

8.5 

'6 



2 

lO 

0 

lO 

7 





1 

II.5 

8 





1 

I2,5 


Figure 10,2. Optimal 
Policies for Distribution of 
Effort Example (AT = 8). 


The calculations of_yi(n) and gi{n) are trivial. For example, if n = 2, then you 
can see in Fig. 10.1 that i/i(2) = 2, so that (2) = 2 and, correspondingly, 
5,(2) = i?,(2) =4. 

Similarly, the calculations o£y 2 {n) and^ 2 (w) for « = 0, 1 are almost trivial since, 
for these values of n, the restriction // 2 (j 2 ) < n can only be satisfied by J 2 = 0* 
Therefore 

(14) ^ 2 ( 0 ) = 0 and ^^2(0) = 0 

(15) ^ 2 ( 1 ) = MO) + g,{l) =0 + 2 = 2 and ^ 2 ( 1 ) = 0. 

A more typical calculation occurs in finding_y 2 (2) and ^ 2 ( 2 ), since, when n = 2, 
the values 72 = 0 andj >2 = 1 are both feasible, according to Fig. 10,1. Then (10) 
indicates 

(16) ^ 2 ( 2 ) = maximum [i? 2 ( 0 ) + ^i(2), /? 2 ( 1 ) + ^i(O)] 

= maximum (0 + 4, 3 + 0) = 4 and ^ 2 ( 2 ) = 0. 
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To test your understanding of this process, verify the remaining entries in Fig. 
10.2. (Notice in Fig. 10.2 that as n increases from 0 to 8 , the optimal amount of 
fluctuates.) 

The optimal policy for N = S is found as follows: starting with j = 3, you 
see that ^3 ( 8 ) = 1. Consequently, to find an optimal you must calculate 
w = 8 — 3 ^ 3 ( 8 ) = 5. Thus ^> 2 ( 5 ) = 2. Then the new n = 5 — 2_>>2(5) = 1, and 
thus^i(l) = 1. So an optimal policy isji = 1,^2 = 2, and ^3 = !• The associated 
optimal value of the objective function is ^ 3 ( 8 ) = 12.5. 

► For the special case in which the objective function (7) and constraint ( 8 ) arc linear, 
(0 and == Hjyj for J = 1 , 2 ,,.., j, 

further simplifications in the analysis are possible. Continue to assume that Hi = 1, 
and for 7 9 ^ 1, //^ is an integer larger than 1. The following recurrence relation can be 
employed instead of ( 10 ) and ( 11 ): 

(ii) F{n) = maximum [/?j + F{n — Hj)] for n = 1, 2, ..., iV 

j 

and 

(iii) F( 0 ) =0 for « = 0 , 

where the maximization is over values ofj = 1, 2,..., j that also satisfy Hj ^ n. The 
computational process is to determine F(1),F(2),. .. ^F{N — 1), and finally F{N), 
which is the optimal value of the objective function. Here the notions of stage and state 
variables coalesce, so a subscript is not needed. 

Comparison of (10) and (ii) shows immediately that whereas you must determine 
gj{n) for n = 0, I, ..., AT and j = 1, 2,... , J, you need find F(n) only for n = 
0, 1,. . ., AT. Thus gs{n) = F{n), 

The difference between the two approaches (10) and (ii) is nicely demonstrated by 
the inventory model with concave costs treated in Sec. 9.5. Suppose demands and cost 
functions are unchanging over time, that is, suppose they are stationary; 

(iv) Dt = D Ct{xi) = C{xt) and ht{it) = h{ii) for all periods. 

Then the total cost Cfcj of producing in Period k 4 - 1 to meet the demand requirements 

of Periods k -f Ij • • • > j depends only on the length of the interval. Therefore for J — k 
= p, let Ckj = Fp = C(pD) + Us;-' h{mD), To illustrate, if AT = 4, 

^kj ~ ^01 = ^12 = ^23 = ^34 = Fi for j — k = 1 

Clej = C 02 = ri 3 = ^24 = F 2 for J — A: = 2 

(v) 

Ckj = f 03 = ^14 = F 3 for J — A: = 3 

c^j = coi = Ri for j-k^^. 

In this situation the proposition on the form of the optimal policy implies that each 
production amount will be an integer multiple of D, such 3s ID, 2D ,..., ND, Let 
yj be the number of times the production amount j^D is scheduled, where j = 
1,2, ,,,, N, Then the^j must be determined so that total production Sjli 
equals total demand N^D, The associated optimization model for an AT-period horizon 
can be stated as 
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If 

(vi) minimize ^ Rjyj 
subject to 

(vii) and = 0, 1,.. . for all 
j'l 

where D has been divided out of both sides of (vii). [Thus the summation on the left 
of (vii) is merely the total span of the schedule, and has to equal the length of the plan¬ 
ning horizon.] 

Observe that the objective in (vi) is minimization. Rather than convert (vi) to a 
maximization problem by changing the signs of Rj, you can simply change the sense 
of optimization in (10). The formulation (vi) and (vii) views the problem in terms of 
how many batches of demand requirements for j periods are to be scheduled. Here 
s = N, since one possibility is to schedule the amount N-D in Period 1 to satisfy all 
demand requirements. 

For examples (vi) and (vii), you can simplify (10) and (11) to 
(viii) gj{n) = minimum [Rjyj -f gj-i{n —j-yj)] for j = 1, 2, . . . , A^, and 

Vi 

(ix) ^o(«) = 0 for J = 0, 

where n = 0, 1,. .., and the minimization is over only nonnegative integer values 
of that satisfy j •yj < n. 

The recursion (ii) becomes, for = 1, 2,..., iV, 

(x) F{n) = minimum [Rj + F{n — J)] and /^(O) = 0. 

Letting/„ = F{n) and k = n — j, an equivalent expression for (x) is 

(xi) /„ = minimum [/t + and /o = 0, 

itoO.l.71-1 

which is the same as the recursion (4) in Sec. 9.6, given the cost assumption in (v). 



Figure 10.3. Concave Cost Inventory Model—^Nodc Description: [nj). 






















• 10.3 . 


Distribution of Effort—^T wo Constraints 337 


To compare the relative merits of (viii) and (x), let iV = 4. Then the recursion (x) 
corresponds to finding the shortest route in an acyclic network structure that has the 
appearance of Fig. 9.7, p. 308. The recursion (viii) corresponds to a shortest-route 
problem in the network exhibited in Fig. 10.3. Here each node label (n,j) refers to a pair 
of values for n andj in (viii). Each arc out of Node (n, j) represents a value oiyj such 
th 2 itj^yJ ^ 7 z. Clearly (viii) involves more effort than (x). [Both recursive approaches 
can be further streamlined, but even after these additional efficiencies are exploited, 
(x) remains better.] M 


10.3 DISTRIBUTION OF EFFORT-^TWO 

CONSTRAINTS 


The Genghis Motor Company is planning a radio advertising campaign for 
its automobiles in a large metropolitan area. The spot commercials will introduce 
the firm’s new models, and will be aired over a two-week period. The area has 
s radio stations. For Radio Station j, the company’s market research organization 
has estimated a sales response relation indicating the net return (sales less adver¬ 
tising costs) R){yj) due to allocating dollars to commercials played on that 
station. The total promotional budget is N dollars. The company’s advertising 
manager also wants only a limited number M of announcements during the 
daytime hours. Radio Station j has indicated that Kj{yj) is the number of daytime 
spots it would schedule, given that the company purchased a total of^j dollars of 
advertising. 

The manager’s optimization model can be written as 


( 1 ) 

subject to 
( 2 ) 

( 3 ) 

( 4 ) 


maximize ^ Rjiyj) 


s 

N (promotional budget) 

2 ^Ayi) ^ ^ (daytime announcements) 
= 0, 1, 2,... for eachj. 


Assume that M, and K){yj) are all nonnegative integers. 

Since previously you needed a one-dimensional state variable in a dynamic 
recursion to represent one constraint, such as (2), now you might reasonably, and 
correctly, guess that a two-dimensional state variable is required. An appropriate 
recursive relation is 


(5) giin, m) = maximum {R}(ji) + -y,, m — iCjCrj)]} 

yj 

for J = 1, 2,. •., J, 

where n = 0, 1,..., and m = 0, 1,..., M, and the maximization is over 
only nonnegative integer values of y^ such that yj'^n and Kj{y^) ^ m. Give a 
definition for gj{n, m) in the context of the radio advertising campaign. What is 
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the essential difference between (5) and the earlier relation (10) in Sec. 10.2? 
How would the recursion appear if you added a third summation constraint to 
(2) and (3)? 

► Another illustration of the model arises in the process of capital budgeting. Each year 
the Giant Electric Company, which manufactures heavy equipment, considers a large 
number s of independent proposals for major plant and machinery investments. Proposal j 
requires a total outlay of Kj dollars, and the funds M for all such capital investments 
are fixed. The expected return on the investment is Rj dollars, and the company seeks 
to maximize its overall return. The corporate directors have also realized that due to 
the limited availability of supervisory management, the total number of projects under¬ 
taken in any one year should be restricted to N. Each proposal is unique, and the firm 
must decide to either accept or reject the proposal for the year. To represent this deci¬ 
sion, let the variable associated with Proposal j be restricted to be = 0 (reject) or 1 
(accept). All these conditions can then be expressed by the problem 

s 

maximize 2 
J=i 

s 

^ (project limit) 

(iii) j] Kjyj ^ M (budget limit) 

(iv) = 0 or 1 for each j (reject or accept). 

In this context 

(v) ^j(w, 7w) = return when m dollars are available to invest optimally 

in 71 projects selected from Proposal 1, Proposal 2,..., 

Proposal j. A 

10.4 DISTRIBUTION OF EFFORT - NESTED 

PROBLEM 

The Amazing Products Chemical Company, which is divisionally organized, 
annually allocates funds for research and development (R & D) projects. Each of 
the s divisions submits information in three categories. The first pertains to research 
activities of an exploratory and highly speculative nature. If i/j thousand dollars 
are allocated to such projects in Division j, the long-term expected return is 
Pj{vj) million dollars. The second category refers to products that have been 
researched and are ready for development and field testing. In these cases, an 
outlay of Wj thousand dollars is expected to return Q^j{wj) million dollars over 
the long run. The final category is development work to improve existing manu¬ 
factured products. An expenditure of Xj thousand dollars is forecasted to yield a 
total or Rj{xj) million dollars additional revenue. 

The board of directors authorizes a total budget of N thousand dollars for all 
R & D projects, and the chief executive places an upper limit Lj on the amount 


(i) 

subject to 

(ii) 
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that Division j can receive. The vice-president responsible for R & D management 
is to allocate the funds so as to maximize the company’s total return subject to 
the several budget restrictions. 

The mathematical model of the vice-president’s problem is 

(1) maximize 2 [/'j(oj) + Q,j{W}) + 
subject to 

(2) 2 {vj + Wj + Xj) < N (company R & D budget) 

(3) Vj + Wj + Xj ^ Lj for j = 1, 2, .. ., j (division budget limit) 

(4) vj, Wj, X) nonnegative integers for eachj. 

Since there is only one budget constraint (2) imposed on all the policy vari¬ 
ables, and the other budget and integer-value constraints (3) and (4) pertain 
to Division j, you can expect the distribution of effort problem for one constraint 
to apply here. Pursuing this line of reasoning, you obtain the recursion 

J 

(5) gi(n) = maximum [Pj{vj) + + 5j-i(n — Vj — W) — x^)] 

for J = 1, 2, . .., 

where n = 0, 1, . . . , iV’ and the maximization is over only nonnegative integer 
values of Vj, Wj, and Xj that satisfy Vj Wj Xj ^ minimum (Lj, n). 

Notice a new difficulty appears in (5). Whereas in the previous distribution of 
effort examples you had only one variable^j to vary in the search for a maximum 
at each stage, now you must solve an optimization problem involving three vari¬ 
ables. Of course, you could try to enumerate all possibilities, but that is not neces¬ 
sary. In this particular illustration, al each stage you are able to use the distribution 
of effort technique to solve the maximum problem, which can be stated as 

(6) maximize Pi{V)) + -f R^ix^) 

subject to 

( 7 ) 

where and Xj must be nonnegative integers. You need the solution for each 

value = 0, 1,. . ., 

The recursive approach for (6) and (7) is to let 

(8) pj{y) = for ^ = 0, 1 ,..., Ij, 

(9) gi{y) = maximum [Q.,(wj) + p}{y — W})] forji = Q,\,... ,L), 

Vi 

where the maximization is over only nonnegative integer values of W) ^y, and 

(10) = maximum + q,{y — x^)] for^ = 0, 1 ,..., Z,^, 

where the maximization is over only nonnegative integer values of Xj 
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In summary, for each j you use p){y) from (8) and (9) to find and use 
q){y) and (10) to find r^(^), for = 0, 1, . . ., Lj. Then you restate and solve (5) 
by the relation 

(11) = maximum [rj(j») + — j)] for j = 1, 2,. . ., s, 

V 

where n = 0, 1, , . . , ^ and the maximization is over only nonnegative integer 
values of y that satisfy y ^ minimum (L^, n). Thus this problem requires 
nesting s distribution of effort calculations within an overall distribution of 
effort model. 

*10.5 LINEAR PROGRAMMING WITH 
INTEGER VARIABLES 

The approaches illustrated for the distribution of effort models extend imme¬ 
diately to the model 

i 

(1) msiximize ^ c^yj 
subject to 

(2) 'Eoijy) = bi for i= 1,2, 

(3) = 0, 1, 2,. . . for each j\ 

where each and bi is a nonnegative integer. A recursion applicable to this model 
is 

(4) F{ni, Tizy,.,, = maximum [cj + F{ni — riz — flzj, 

where, for each i = 1,2 ,...,/;, we consider = 0, 1, . . . , and the maxi¬ 
mization is over each value j = 1, 2,. . ., j that satisfies ^ for every i = 1, 
2 ,...,/;. 

This approach is completely impractical for problems of ordinary size, however, 
since the recursion requires finding (4) for every possible state (kj, n^y . . . , «a). 
To illustrate, if each equals either 0 or 1, and every bi = \y then each rii can 
equal 0 or 1Consequently, («i, • • • > are enumerated by letting each rii 

be 0 and 1—altogether 2* possibilities. If k is 100, which is not a very large 
linear programming application, more than 10^° possible (w„ • • •, «jt) exist, 

which implies a number of calculations far beyond the capability of electronic 
computers. Therefore the multistage approach (4) is not feasible unless k is very 
small. We will return to this model in Chap. 13. 

10.6 EQUIPMENT REPLACEMENT MODEL 

In Sec. 6.5, pp. 180-181, you learned how an acyclic network can be employed 
to represent a problem of determining a minimum-cost equipment replacement 
policy over an (N — l)-year horizon. Review that material before proceeding. 
In that section, represented the entire cost associated with using the machin- 
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ery from the beginning of Period i until the beginning of Period j, when it is 
replaced. Although the example in Sec. 6.5 referred to the leasing of equipment, 
the model is equally applicable to the situation in which a company purchases 
the unit. In this case, Cij is the sum of the purchase price and operating costs of the 
new equipment, less the salvage value of the old machine at the beginning of 
Period j. The problem is of the regeneration type, in that a replacement policy is 
tantamount to a list of periods in which new equipment is purchased. Let 

(1) fn = the minimum replacement-policy cost for Periods n, — 1, 

assuming that a new machine is purchased at the start of Period n. 

Then we seek the value of fi, and the dynamic recursion appropriate to such a 
point of view is 

(2) fn = minimum [Cnk +/J forn = N — 1, — 2,. . . , 1 

(3) /v = 0. 

Give a verbal description of (2). The associated acyclic network is represented for 
N ~ 1 = 5 in Fig. 10.4. 



► The same problem can be approached from a somewhat different dynamic view. 
Assume that the costs incurred by any replacement policy are comprised of the quantities 

pin = net cost of selling (salvaging) an x-ycar-old machine in 
Period n and replacing it with a new machine 

(i) 

kni = cost in Period n of operating a machine that at the end 
of the period is t years old. 

Let 

(ii) fn(i) = minimum replacement-policy cost for Periods n, n-f- — 1 

when Period n is entered with an i-year-old machine. 

Observe that the subscript notation in (ii) differs in this example from previous examples. 
Here n refers actually to Period n and beyond, until the end of the horizon. (In the 
inventory examples, n indicated the stage at which n periods remained before the end of 
the horizon.) 

If it is optimal to keep the machine entering Period n, then 

(iii) fn(i) = knMi +/n+i(* + 1) (kcep), 
but if it is optimal to replace it, then 

(»V) /„(«•) = Pi „ + k„i +/„+i(l) (purchase). 
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Thus 

(v) /n(0 = minimum -f/n+i(i + 1), pin + Kx +/n+i(l)] 

for n = 1, 2 ,..., iV — 1, 


v/herc 

(vi) /n(0 = 0. 

The value of/i(io) is sought, where Zq is the age of the machine at the start of the plan¬ 
ning horizon. If no machine exists at that time, then pt^i is merely the cost of purchasing 
a new machine, and the keep decision is not applicable for n = 1. 

Figure 10.5 is a pictorial representation of (v). The exhibit assumes that no machine 



exists prior to n = 1, and iV^ — 1 = 5. The nodes are labeled (i, n) corresponding to 
the notation in (v). What arcs represent the purchase decision? Compare the relative 
merits of (2) and (v), using Figs. 10.4 and 10.5 to justify your opinions. A 

10.7 STRUCTURE OF MULTISTAGE ANALYSIS 

Having seen a variety of dynamic programming applications and explored a 
few in depth, you may find it helpful to summarize the essential features in the 
approach. The remaining sections provide an overview of dynamic programming 
models. 

The constrained optimization problems encompassed by dynamic programming 
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are really no different conceptually from those you already saw prior to Chap. 8, 
as well as those you will study later on. Only the point of view for analyzing the 
model is decidedly novel. The dynamic programming approach attacks an optimi¬ 
zation problem with multifold constraints and many variables by splitting the 
problem into a sequence of stages in which lower-dimension optimization takes 
place. In contrast, most linear and other nonlinear programming approaches 
attempt to solve such problems by considering all the constraints simultaneously. 

The dynamic programming approach casts a problem into the following struc¬ 
ture : 

(i) The decision variables with their associated constraints are grouped 
according to stages, and the stages are considered sequentially. 

(ii) The only information about previous stages rdevant to selecting optimal 
values for the current decision variables is summarized by a so-called state 
variable, which may be n-dimensional, 

(iii) The current decision, given the present state of the system, has a forecast¬ 
able influence on the state at the next stage. 

(iv) The optimality of the current decision is judged in terms of its forecasted 
economic impact on the present stage and on all subsequent stages. 

► Unlike linear programniing, which refers to a specific mathematical model that can 
be solved by a variety of techniques (the most notable being the simplex method), 
dynamic programming deals with a particular analytic approach, which can be applied 
to a variety of mathematical models. There is a canonical form that conveniendy 
illustrates the structure of points (i) through (iv) above. 

Let the symbol s stand for a state of the system and Sn the collection or set of all 
possible states at Stage n. Define dn as the decision made at Stage n, and let D„(j) 
designate all the feasible values for </„ given that the system is in State s. Finally, given 
that the system is in State s, let Rn{ 5 , dn) denote the immediate economic return of deci¬ 
sion dn, and Tr.{s, dn) the transformed state of the system at Stage n — 1. Then a common 
form for a dynamic programming recursion is 

(DPR) fn(s) = optimum {/?„(j, dn) +/n-i[T„(j, </„)]} for every s in Sn, 

in /?«(«) 

* where optimum means maximum or minimum, depending on the particular context. 
All the recurrence relations in Chaps. 8 through 10 were of this form. Take each point 
from (i) through (iv) and show exactly how it affects the appearance of (DPR). Point (ii) 
is sometimes referred to as the Markov property. State which point contains the principle 
of optimality j given in Chap. 8: an optimal policy must have the property that regardless 
of the route taken to enter a particular state, the remaining decisions must constitute an 
optimal policy for leaving that state. 

You should keep in mind an important feature of (DPR). The optimization at Stage n 
does not require the values of the decision variables which bring about the return 
yn-i[Tn(j, <fn)]- The “bootstrapping”nature of the recursion means that at each stage you 
only optimize with respect to a limited number of decision variables, and you retain for 
further calculation only the objectivefunction values. This essential simplicity is what 
makes the method attractive. Employing linear programming terminology, you can 
think of the dynamic programming process as sequentially obtaining values for the dual 
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variables. Such an analogy is exact in the examples of this chapter for the reasons to 
follow. 

In all of the illustrations, the stage, state, and decision variables were represented by 
integer numbers. (The term discrete variable problem is applied to such situa¬ 
tions.) In addition, the decision processes extended over a finite time span or number of 
stages. Finally, no probabilistic elements influenced the outcomes and transitions from 
state to state. 

All these assumptions made it possible to analyze (DPR) in terms of finding a best 
route in an acyclic network, such as you studied in Chap. 7. To construct the network, 
you establish a Node [s, n] for every possible value of s and n in the dynamic program¬ 
ming recursion (DPR). Each in D„(j) results in an arc from Node [s, n] to Node 
[Tn(s, dn), n — 1] with arc cost /?„(5, dn). The State s at the final stage is specified (for 
example, as “ending inventory must equal zero”). Therefore, this node is taken as the 
terniinal of the network, and fn{s) represents the length of a best route to the terminal 
from Node [s, n]. 

In the linear programming characterization of the network problem,/„( j) represents 
the value of the dual variable for the conservation of flow equation at Node [s, n]. Of 
course, to solve (DPR) you never need to draw the acyclic network, but you should note 
two implications of the network equivalence. First you were facilitated in comparing 
the relative computational merits of alternative dynamic programming formu¬ 
lations by analyzing the complexity of the implied acyclic networks. A rough but usually 
adequate rule is that the network with the fewest arcs is to be preferred. (This guide is 
qualified because some formulations require substantial “side calculations” to determine 
the arc traversal costs prior to applying the best-route algorithm.) 

Second, once you characterized a model by a particular recursion, nothing truly 
innovative appeared in the technique of solution. New considerations do arise, however, 
in dynamic programming problems that are not discrete, finite, and deterministic. You 
will study several of these in Chaps. 11, 12, 17, and 18. ^ 

10.8 INSIGHTS TO DYNAMIC PHENOMENA 

The dynamic programming approach by its very nature is conducive to studying 
questions of time horizon sensitivity. The various inventory models in Chaps. 8 
and 9 demonstrated that an optimal policy can depend essentially on the length 
of the planning horizon. Several important ideas emerged from the analysis. 

One is the notion of a strategy. By solving a dynamic programming recur- . 
sibn, you are in effect determining an optimal policy for every possible value of 
the state variable at each stage. Thus you obtain a prescription of what to do in 
every eventuality. To observe the consequences of this, consider a production 
scheduling model as an example. Suppose you act as if demand and the cost func¬ 
tions do not change over time, and you repeatedly consider your horizon to be N 
periods ahead. Then all you need is the optimal production schedule for each 
State 3 when periods are left to the end of the horizon. Each period you ascertain 
State s and schedule accordingly. Another impact is that the only thing that mat¬ 
ters, when n periods remain, is the current State s and not how you arrived there. 
So if your decision does not turn out the way you expect, you can let bygones be 
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bygones in your next decision. In Chaps. 11 and 12 the notion of a strategy will 
play a central role, except there you will learn how to find an optimal policy 
when an unbounded time horizon assumption is built right into the analysis. 

Another concept of significance is determining the form of an optimal policy. 
By an auspicious characterization of a dynamic process into a sequence of stages, 
you sometimes can exploit the special properties of the constraints and eco¬ 
nomic functions to discover an optimal type of policy. One illustration is regenera¬ 
tion type processes, such as the equipment replacement example in Sec. 10.6. 
Whenever a regeneration point occurred, you effectively could split the span of 
optimization into “before” and “after,” and handle each part separately. 

The preceding two notions of finding a strategy and the form of an optimal 
policy are closely related to the next. A planning horizon theorem, when available, 
can provide key information on the way an optimal policy is influenced by dynamic 
considerations. 

^You observed two instances, in Secs. 9.4 and 9.7, of the Way particular economic 
structures can be helpful for discovering horizon properties. In the convex cost case, 
you saw an illustration in which the only possible effect of lengthening the plarming 
span was to increase the value of the production decision variables. In the concave cost 
case, you noted that it was possible to ascertain when a horizon occurred without 
recourse to information beyond the regeneration event. ^ 

10.9 COMPUTATIONAL FEASIBILITY 

For the problems illustrated in Chaps. 8 through 10, dynamic programming 
offers a vast improvement over complete enumeration of all possibilities. The 
feasibility of the technique depends critically on the dimension of the state variable, 
and this is one reason why skill in formulation is necessary. The difference between 
a clumsy and a clever characterization can be the deciding factor in whether the 
approach is workable. 

As the examples of this chapter show, the approach can be applied to small- 
scale nonlinear and integer-value problems. Because no satisfactory canonical 
form exists for all of these (not even for finite, discrete, deterministic models), 
there is no single efficient computer program to handle all dynamic programming 
problems. But the form of the dynamic programming recursion is usually simple 
enough so that it is not much trouble to write a computer program for the par¬ 
ticular model to be solved. A number of technical difficulties must be overcome in 
such programs, but they mainly concern the specialist and will not be discussed 
here. It suffices to say that in most practical problems with a state variable of one 
or two dimensions, designing a computer program offers no insurmountable obsta¬ 
cles. Except in rare cases (such as the convex cost inventory model of Sec. 9.3, 
where the algorithm is quite simple), a computer approach is required—if for no 
other reason than to eliminate the tedium of the calculations. 

Questions of convergence have not arisen in Chaps. 8 through 10, since every 
model essentially involved only a shortest-route problem. But these considerations 
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will reappear in the next two chapters, when you study models with unbounded 
horizons and solution techniques based on successive approximations. 

Again due to the variation in types of models solved by dynamic programming, 
it has not been possible to provide any universally applicable technique of sensi¬ 
tivity analysis. Most sensitivity questions that are raised in dynamic programming 
models tend to affect several costs or constraints simultaneously. Then the formulas 
required for the analysis need to be worked out for the specific problem. 

10.10 APPLICABILITY OF DYNAMIC PROGRAMMING 

The results of Chaps. 8 through 10 can be put into two categories. Much of the 
analysis deepened your understanding of phenomena in dynamic optimization 
models. The remainder concerned applications of dynamic programming to 
important industrial problems. By far the most frequent applications of the tech¬ 
nique deal with inventory replenishment, production scheduling, and certain 
regeneration type processes, including equipment replacement. Because a number 
of dynamic programming problems involve probabilistic elements and unbounded 
horizons, you should consider the examples you have seen as just an introduction 
to the subject. There are many more illustrations in the chapters to come. 

The statement made at the beginning of the chapter warrants repeating in these 
closing remarks. The practical limitation on the dimension of the state variable 
in dynamic programming tends to narrow the focus of problems solved by the 
technique. Only infrequently can all the salient factors in an economic problem 
of any magnitude be boiled down in such a way as to make feasible the numerical 
solution to a dynamic programming formulation. This realization does not 
detract from the value of dynamic programming; rather, it puts into focus certain 
features that distinguish typical applications of dynamic programming from those 
of linear programming. 

Specifically, many real applications of dynamic programming deal with operat¬ 
ing decisions—when to reorder, when to make a machine setup, when to replace 
a piece of equipment, etc. Many of these decisions are ordinarily delegated by top 
management to lower echelons. Therefore, implementation of a dynamic program¬ 
ming model involves either using a computer to assist in routine decision-making 
or providing lower echelons Avith rules or tables based on an optimal strategy 
calculation. The limited impact of a single decision found by the usual dynamic 
programming applications, and the ease with which this decision can be deter¬ 
mined, are the very reasons for the technique’s success. The approach relieves top 
management of the burden of reviewing a multitude of individual actions, each 
having minor import. However, the senior executive is able to exercise his policy¬ 
making responsibilities effectively by selecting an appropriate model containing 
what he feels are the significant economic and technological considerations. 
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REVIEW EXERCISES 

1 Suppose the restriction (2) in Sec. 10,2 is generalized to the linear constraint 

(where//| = 1). Revise (5) accordingly, and indicate the largest 
value oTyj that needs to be considered in the maximization process at Stage j, 

2 Suppose you have a solution to the problem (1) through (3) in Sec. 10.2 for a specified 
value of N, and now you want to find the solution for the value iV + I. Indicate the 
additional computation required, assuming you use recursion (5) to obtain the answer. 

3 Shopping Basket Markets Problem (Sec. 10,2). Suppose when eggs are shipped to Store j 

that a fraction fj (0 < 1) breaks in transit. Revise recursion (5) accordingly. 

4 Shopping Basket Markets Problem (Sec. 10.2). Suppose the owner has found from past 
experience that the profit function Rj{yi) increases as^j increases up to a point, and 
then decreases foi larger values of^j (too many eggs in a store create costly handling 
and storage problems). Indicate how (if at all) you would modify the formulation and 
resultant dynamic programming recursion to permit shipping less than N eggs to all 
the stores. 

5 In describing the distribution of effort problem in Sec. 10.2, we assumed, for simplicity, 
that each Hj(yj) is a nondccreasing function, is integer-valued for^yj = 0, 1, 2, — , 
and satisfies //j(0) = 0; in addition, we postulated that Hx{y{) =* y^. Explain what 
complications arise when each of these four assumptions is dropped, and how the 
recursion (10) must be modified (if at all). 

Exercises 6 through 12 refer to the numerical example (12) and (13) in Sec. 
10.2, and the solution in Fig. 10.2. 

6 (a) Verify the calculations in Fig. 10.2 for j = 2 and /i = 3, 4, 5, 6, and for j = 3 

and ;i = 0, 1 ,.... 8. 

(b) Indicate all optimal solutions for N = 4, 5, 6, 7. 

7 Assume the upper-bound restrictions are changed to yj ^ 3, yo ^ 3, and y^ < 2. 
Determine all optimal solutions for 

(a) iV=4,5,6,7,8. 

(b) 9, 10. 

8 For expositional simplicity, the three stores were arbitrarily indexed 1, 2, 3, and the 
recursion (10) was applied to the specified indexing. If the stores are relabeled, the 
computations then will be carried out in a different order, but the end results will be 
identical. Let the first two columns in Fig. 10.1 denote Store A, the next two columns 
Store B, and the last two columns Store C. The recursion (10) and the results in Fig. 
10.2 correspond to letting j = 1 refer to Store A;^ = 2 to Store B; and J = 3 to Store 
C. Apply the algorithm (10) to the following alternative assignments of the stores to 
the indexes: 
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(a) j = 1 to Store A, j = 2 to Store C, andj = 3 to Store B. 

(b) j = 1 to Store C, j = 2 to Store B, and j = 3 to Store A. 

(c) j = 1 to Store B, j = 2 to Store A, and j = 3 io Store C. 

(d) j = 1 to Store C,,; = 2 to Store A, and j = 3 io Store B. 

(c) J = 1 to Store B, j = 2 to Store C, and j = 3 to Store A. 

9 Suppose the return function for j = 3 is changed to = 5 ^ 3 * Use recursion (10) 

to determine all optimal solutions for 

(a) 4. 5, 6 , 7, 8 . 

(b) N=9, 10. 

(c) Answer parts (a) and (b) when all the bound restrictions on^j are dropped. 

*10 Solve exercise 9, part (c), using recursion (ii) in the special material at the end of 
Sec. 10.2. Compare the computational effort of using recursion (10) versus that of 
using recursion (ii). 

♦11 In each part below, formulate the lot-size inventory model as (vi) and (vii) in the 
advanced material at the end of Sec. 10.2; indicate the appropriate value for each Rj, 
Solve the problem using recursion (viii). Also solve the problem using recursion (x), 
and indicate the correspondence between recursions (x) and (xi). Use the data in the 
following exercises in Chap. 9. 

(a) Exercise 13. (d) Exercise 43, part (a). 

(b) Exercise 14. *(e) Exercise 43, part (f). 

(c) Exercise 15. 

12 Suppose the restriction^! 4 - 3^2 +^3 4 is added to the example (12) and (13) in 

Sec. 10.’2, and the restriction ( 8 ) is written as an inequality (<). Determine all 
optimal solutions for = 4, 5, 6 , 7, 8 . 

13 In describing the distribution of effort problem with two constraints in Sec. 10.3, wc 
assumed, for simplicity, that the constraints ( 2 ) and ( 3 ) arc inequalities, and that each 
Kjiyj) is nonnegative and integer-valued. Explain what complications arise if each of 
these assumptions is dropped, and what is the resultant impact on computing a solu¬ 
tion with recursion (5). 

14 (a) Suppose the restriction (2) in Sec. 10.3 is replaced by S5=i ^ where 

the value of Hj{yj) is a nonnegative integer for^j = 0, 1,.. .. Indicate how the 
recursion (5) and the computational process must be modified. 

(b) Suppose the restriction ^)^x^Ayi) ^ ^ is added to (1) through (4) in Sec. 
10 . 3 , where the value of is a nonnegativc integer for^^- = 0 , 1 ,,.., and 

P is a positive integer. Indicate how the recursion (5) must be modified. Discuss 
the resultant increase in computational effort. 

15 Amazing Products Chemical Company Example (Sec. 10.4). Imagine that the board of 
directors has elected a new president who feels it is important to encourage the 
pursuit oi basic research. Lct>'j thousand dollars denote an allocation of Division j for 
such basic research, and T){yi) million dollars be the best estimate the division has 
on the ultimate return to the company from this expenditure on basic research. The 
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total allocation to basic research is drawn from the company’s overall R & D budget, 
and the amount^^ is drawn from the division’s budget. Show how the formulation (I) 
through (4) must be modified to include the allocations for basic research. Indicate 
the necessary changes in the recursion (5), and the maximization process ( 6 ) and (7). 
Show how to appropriately modify the approach given by ( 8 ) through (11). (Be sure 
to define your notation and symbols.) 

16 Amazing Products Chemical Company Example (Sec. 10.4). Imagine that there is a conser¬ 
vative member of the board of directors who insists that the total amount of money 
the company allocates to exploratory and speculative projects does not exceed V 
thousand dollars. Show how the formulation (1) through (4) must be modified. 
Indicate the appropriate recursion, the corresponding minimization process in ( 6 ) and 
(7), and how to modify the approach given by ( 8 ) through (11). (Be sure to define 
your notation and symbols.) 

*17 Explain how the linearity of the objective function and the constraints are exploited 
in the recursion (4) of Sec. 10.5. [For example, comment on why (4) would be 
inappropriate if the objective function were 2 * i^j(j';)> least one of the 

functions Cj{yj) were nonlinear.] 

*18 (a) Use the recursion (4) in Sec. 10.5 to solve the problem 

maximize Zy^ -j- 4^2 + 6^3 + 8y^ 6^5 

subject to 

+ 2y. 4- 2y, -f 3y, H- ]y, < 3 
2^1 + 1>'2 + 2>’3 4- 1^'4 4- 3^5 ^ 4 
every a non negative integer. 

(b) By how much docs the objective function decrease when the right-hand-side 
constant in the first constraint is 2, instead of 3? When the right-hand-side 
constant in the second constraint is 3, instead of 4? When the right-hand-side 
constants in both the first and second constraints arc decreased by 1 ? 

(c) By how much docs the objective function increase when the right-hand-side 
constant in the first constraint is 4, instead of 3 ? When the right-hand-side constant 
in the second constraint is 5, instead of 4? When the right-hand-side constants in 
both the first and second constraints arc increased by 1 ? 

*19 Equipment Replacement Problem. In each part below, find an optimal replacement policy 
using recursion (v) in the special material at the end of Sec. 10.6. (Be sure to indicate 
the values for each and k^t.) Assume that a replacement must be made at the 
start of Period 1. Compare the algorithmic process with recursion (2). 

(a) The case of the Rhodc-Bloch Trucking Company, given in exercise 15, part (d). 
Chap. 6 , and c.xcrcisc 68 of Chap. 7. 

(b) The case of the Goodc-O’Toolc Company, given in exercise 41 of Chap. 6 and 
exercise 69 of Chap. 7. 

*20 Consider the recursion (DPR) in the special material of Sec. 10.7. 

(a) Indicate how each of the points (i) through (iv) affects the appearance of (DPR). 
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(b) Show the correspondence between each symbol in (DPR) and the appropriate 
symbol in the recursion (5) of Sec. 10.2. In the recursion (10) of Sec. 10.2. In the 
recursion (5) of Sec. 10.3. In the recursion (5) of Sec. 10.4. 

(c) Draw an acyclic network associated with the application of recursion (10) in 
Sec. 10.2 to the data in Figs. 10.1 and 10.2. [Label the nodes and arc costs 
appropriately; indicate the contents of each D„(j), and display each T’„(j, </n)-] 

21 Explain your understanding of the following terms: 

distribution of effort problem *discrete variable problem 

nested problem strategy, 

multistage analysis 


FORMULATION AND COMPUTATIONAL EXERCISES 

In many of the exercises below, you are asked to formulate a model in terms of 
a dynamic programming recursion. Be sure to define all the symbols you use, and 
answer the five questions at the end of Sec. 10.1. Give the appropriate optimiza¬ 
tion function when there is a single period remaining as well as the recursion for 
Stage n. Explain how to initiate and when to terminate the calculations. 

22 Knapsack Problem, Ben Dover, an exuberant mountain climber, is preparing for a lengthy 
hike up a dangerous slope. He can manage up to W pounds in his knapsack, which 
he carries on his back. He has N different types of items that he can include in his 
pack, and each unit of Item j weighs Wj pounds. For every Item j, he calculates a 
numerical value representing the essentiality of each unit of the item. For example, 
if he packs five items of type 3 and seven items of type 9, the “value” to him of this 
knapsack selection is 5* 7‘Eg. How many items of each type should he include 
in his knapsack ? 

(a) Formulate the problem as a dynamic programming model. Explain why the 
decision problem would be trivial if it were possible to load fractional amounts 
of the items; indicate what would be an optimal selection in such a case. 

(b) Alter your answer in part (a) assuming that Dover places an upper limit on 
the number of units of each Item j that he packs. Explain why the decision 
problem is trivial if it is possible to load fractional amounts of the items; indicate 
an algorithm for finding an optimal selection in such a case. 

(c) Suppose Dover wants to select the lightest knapsack that has a value to him of at 
least E, Show how to revise the formulations in parts (a) and (b). 

23 Howie Kramms is a graduating senior and needs to do as well as possible on the final 
examinations in order to receive his degree. He divides his available weekend study 
time into 10 periods of equal length. He is taking four courses, two of which he judges 
are easy and two difficult. He estimates that for each “gut” course, he will earn no 
grade points if he does not study at all, two grade points if he studies either one or two 
periods, three grade points if he studies three periods, and four grade points if he studies 
four periods. Similarly, he estimates that for each difficult course, he will earn no grade 
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points if he docs not study, only one grade point if he studies cither one or two periods, 
two grade points for studying three periods, three grade points for studying four periods, 
and four grade points if he studies five periods. Disregarding the consideration of 
whether he is overoptimistic in his assessments, how would you advise him to allocate 
his time to maximize the total grade points he receives? Formulate an appropriate 
dynamic programming model and find an optimal solution. How much would an 
extra study period be worth? What harm will be done if he goofs off for one study 
period ? 

24 Libby Doe, a girl friend of Howie Kramms, whom you advised in exercise 23, has 
just arrived in town and has called Howie for a date. He reassesses his situation and 
decides that all he really needs arc nine grade points to graduate. Now he wants to 
allocate his time so that he spends the fewest number of study periods necessary to 
guarantee his receiving at least nine grade points. Formulate this decision problem as 
a dynamic programming model, and find an optimal solution. (If there is more than 
one optimal allocation, indicate the alternative solutions.) 

25 The United Fund wants to assign 10 of its volunteers to solicit contributions from 
the companies having offices in three large downtown buildings. The executive director 
estimates that ifj/j men are assigned to Building j, then total contribution pledgcs'will 
be Rj{yj) hundred dollars, where /?j(0) = 0 and 

i?,(l) = 5 /?,(2) = 10 i?,(3) = 15 /?,(4) = 25 ^,(5) = 35 /?i(6) = 50 

/?2(1) = 3 /?2(2) = 6 i?2(3) = 12 /?2(4) = 18 = 30 i?,(7)=55 

/?3(1) = 20 /?3(2) = 35 /?3(3) = 45 i?3(4) = 55 /?3(5) = 60 /?3(6) = 65. 

(No additional pledges would be received by sending more than seven men to Building 
1, more than five to Building 2, and six to Building 3.) How many volunteers should 
be allocated to each of the buildings? Formulate an appropriate dynamic program¬ 
ming model and find an optimal solution. Show how the solution changes if there arc 
eight volunteers. How docs it change with 9, 11, or 12 volunteers? 

26 Consider the ease of the United Fund described in exercise 25. Suppose the executive 
director is in short supply of volunteers and restates his allocation problem as follows. 
He wants to use as few men as possible provided that they raise at least R hundred 
dollars in the three buildings. Forifiulatc this problem in terms of a dynamic program¬ 
ming model, and find an optimal solution for 

(a) R = 80. 

(b) /? = 90. 

(c) R = 100. 

27 Consider the case of the United Fund described in exercise 25. Suppose the executive 

director wants to allocate no more than M man-days to the solicitation in the three 
buildings. He knows that if he assigns from one to three men to any building, they 
require in toto 1 man-day to complete solicitation; if he assigns more than three men, 
they will require 2 man-days. How many of the 10 volunteers should he allocate to 
each of the buildings if ^ 

(a) M= 3? 

(b) M=4? 
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28 Consider the problem 

maximize /?i(^i) + R 2 U 2 ) + ^ 3 (^ 3 ) 

subject to 

2>i -t- 3^-2 + 4^3 ^ IV 
each^^ = 0, 1, 2, 3, 

where each Rj{0) = 0 and 

R,{1) = 3 R,{2) = 2 R,{3) = 8 

7?2(I) = 5 /?2(2)=11 /?2(3) = 9 
R,(\) = 9 .R,{2) =14 /?3(3) = 15. 

(a) Find an optimal solution for W = S, Also, indicate solutions for ly = 6 and 
py = 7. Indicate any alternative optimal solutions. 

*(b) Draw an acyclic network associated with the recursion that you employ. Label 
the nodes and arc values appropriately. 

29 Consider the Economic Order Quantity Problem for the case of a single item, as 

^escribed in Sec. 1.6. Suppose that the firm has N items, and is going to replenish 
each one by means of a lot-size rule. Specifically, for each Item j, let Mj be the number 
of units the company consumes per week, Kj the fixed setup cost of placing an order, 
Cj the purchase cost per unit ordered, and hj the holding cost per unit per week. 
Similarly, let Q.j be the replenishment lot size for Item j. The average cost per week 
for all the items is the summation of N expressions, each of the form (12) in Sec. 1.6. 
Suppose the company places an upper limit L on the average value of all inventory 
on hand, where each Item j contributes to this average. Show how a dynamic 

programming formulation can be used to find optimal values for each lot size Q^j. 

*30 Consider the case of the Giant Electric Company, described in the special material 
at the end of Sec. 10.3. Suppose s = 8, Kj = j, hr j = 1,2,..., 8, and 

R, = 16 i?2 = 28 723 = 45 . = 48 Rr^ = 65 /?o = 66 R^ = 79 Rs = 80. 

Find an optimal selection of projects for 

(a) The project number limit = 4 and the budget limit M = 10. Indicate a solu¬ 
tion for N = 2 and for = 3. Also indicate a solution for N = 3 and M = 9, 

(b) .V= 3andM= 11. 

31 Consider the case of the Amazing Products Chemical Company described in Sec. 10.4. 
Let J = 4, each Lj = 5, N = 10, and 

Pj{vj) == ajP{vj) where P(0) = P(l) = />(2) = 0 P(3) = 1 P(4) = 8 P(5) = 17 

Q^j{wj) = bjQ^{wj) where Q^(O) = 0 0,(2^) = 3 -f- for w; ^ 1 
Rj{xj) = cjR(xj) where R{0) = 0 i?(l) = 8 R(2) = 9 

R{3) = 9.5 72(4) = 9.75 72(5) = 10. 

Find an optimal budget allocation when 

(a) All aj = bj = cj= 1. 

(b) ai = bi = Cl = 1, a2 = b2 = C 2 = 2, a^ = b^ = = 3. 


n- 
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(c) = ^2 — ^^3 = Ij == ^2 — — 2, C| — ^2 — ^3 — 

(d) Indicate the effect from letting N = 7, 8, 9 in each of the above parts. 

(e) Indicate the effect from letting N = II in part (a). 

(f) Indicate the effect from letting each Lj — A in part (a). In part (b). In part (c). 

32 Three brothers operate their father’s cattle ranch in Arizona called “Where the Sun’s 
Rays Meet.” At the beginning of a planning horizon of N years, the ranchers own S 
hundred cattle. The expense of caring and feeding for s head of cattle during Year n 
is Cn{s). If the ranchers decide to send^„ cattle to market at the end of Yearn, they 

. receive Rni^n) dollars of revenue. The herd not sent to market increases in size by 
1.6-fold in the following year. (Thus, if none of the S cattle are sent to market at the 
end of Year 1, then the herd size is 1.6.9 in Year 2.) Formulate a dynamic programming 
recursion to determine how many cattle to send to market in each of the N periods. 

33 Capacity Expansion Problem. The Bill Deplant Company is planning an expansion 
program over the next N years. At the beginning of Period 1, the firm has a capacity 
of Cq units. The company estimates that it should have at least a capacity of Rt units 
in Period t, for / = 1, 2, . . ., M If it expands its capacity by units in Period t, 
the associated cost is Ki{c, x^), where c represents the capacity at the start of the period, 
and .v^ is integer-valued. [If certain levels of-v^ are impossible to build, then the corre¬ 
sponding value for Kt{c^xi) can be defined as arbitrarily large.] Assume that new 
capacity becomes available early enough in a period that it can be used to meet the 
requirement Ri. Suppose the company also incurs the cost Hi{C^R) in Period t of 
operating a plant at the level /?, when capacity at the end of the period is C units, 
where R^C. Finally, assume that if C units of capacity arc available by the end of a 
period, then only D{C) units are available in the beginning of the next period, due to 
depreciation of the equipment in the plant. Formulate the expansion model in terms 
of a dynamic programming recursion. 

34 The ARKA Mutual Fund is unmatched in its ability to perform well on the stock 
exchange, because the fund is able to perfectly forecast the price of a given security. 
Despite this uncanny talent, ARKA must still resort to operations research to derive 
an optimal strategy for buying and selling. Specifically, assume that pi is the price of 
the security in Period t, and that ARKA knows these prices over a horizon of T 
periods. Let Xi be the number of shares held by ARKA at the end of Period t, 
where .v^ > 0. Assume that at the start of Period 1, ARKA docs not own any shares 
of the stock. ARKA must pay'a transaction cost C{xi — for purchases and sales, 

N where C(0) ^ 0. The fund is also prohibited from speculating in that it cannot make 
a purchase of shares that exceeds its cash on hand (where M is the cash on hand at 
the start of Period I), and it cannot sell short (that is, it cannot sell any shares it docs 
not own at the start of a period). 

Forr..ulatc the problem in terms of a dynamic programming recursion, assuming that 

(a) ARKA wants to maximize the total amount of cash it can accumulate by the 
end of the horizon. 

(b) ARKA places a value Vt{dt) on withdrawing the amount of cash di in Period t 
for distribution as dividend payments, and wants to maximize the sum of the 
y^ijdt) over the entire span of the horizon. 
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35 Van Fuller, the local dispatcher for a moving and storage firm, must arrange to 

pick up Xi different shipments that require units of space each, and X 2 different 
shipments that require units of space each. (Thus the total space requirements for 
the day are SiXi -j- 52 ^ 2 *) available N different trucks. Truck i has a capacity 

of C( units of space, and incurs an operating cost of Et, (Assume that Ci + • • • + > 

jjATi + ^ 2^2 there is a feasible solution.) Fuller wants a minimum cost selec¬ 

tion of trucks for picking up all the loads. Formulate the decision problem as a dynamic 
programming model, and indicate how the recursion optimally selects truck sizes and 
allocates shipments to the ti ucks. 

36 Great Auto Works Company (Sec. 2.6). Formulate the optimization problem in terms of 
a dynamic programming recursion. 

37 Consider the case of the Spartan army, described in exercise 31 of Chap. 2. Formulate 
the optimization problem in terms of a dynamic programming recursion. 

38 Consider the case of the Expando Manufacturing Company, described in exercise 36 
of Chap. 2. Formulate the optimization problem in terms of a dynamic programming 
recursion. 


MIND-EXPANDING EXERCISES 

In many of the exercises below, you are asked to formulate a model in terms of 
a dynamic programming recursion. Be sure to define all the symbols you use, and 
answer the five questions at the end of Sec. 10.1. Give the appropriate optimization 
function when there is a single period remaining and the recursion for Stage n. 
Explain how to initiate and when to terminate the calculations. 


39 Where-or-When Production Problem (Sec. 1.6). Consider the model discussed in Sec. 1.6: 

... Ax? 

iinimize > —^ 


subject to 


minimize • 


2 2 i^d each x^ > 0, 


where each Cj> 0, Use a dynamic programming formulation to establish the result 
(11), namely, that given the values for X|,X 2 , ..., the corresponding optimal 
value for Xt = Ct{D — — • • • — xt^^)l(ci 4 - ... + Of), 


40 Consider the problem 


maximize • 



Cn 


subject to 


2 = D and each x^ ^ 0, 
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where each C) > 0. Use a dynamic programming formulation to derive the form of 
an optimal solution. Indicate how the optimal value of the objective function depends 
on D, 

41 Consider the problem 

s 

maximize ^ 

j-i 

subject to 

s 

N and each Xj nonnegative integer. 

Notice that all the functions in the objective function arc identical. 

(a) Suppose j is a pow'er of 2 (such as 4, 8 , 16, 32,...). Devise a way to compute an 
optimal solution using dynamic programming that docs not require you to calculate 
g^in) for allj = 1 , 2 ,. . ., j. 

* (b) How would you modify your approach in part (a) to accommodate any value for j? 

42 Suppose a state’s legislature has R representatives. The state is sectioned into s districts, 

where District j has a population pj and s < R. Under strictly proportional repre¬ 
sentation, District j would receive Rjpj = Tj representatives; this allocation is not 
feasible, however, because rj may not be integer-valued. The objective is to allocate 
representatives to District j, for j = 1, 2,'. .., j, so as to minimize, over all of the 
districts, the maximum difference between and r^, that is, minimize [maximum 
Ux - , . . yy, - r,)l. 

(a) Formulate the model in terms of a dynamic programming recursion. 

(b) Apply your method to the data /? = 4, j = 3 and T\ = .4, ri = 2.4, and rz — 1.2. 
Discuss whether the solution seems reasonable, given the context of the problem. 

43 Consider the problem of selecting for j = 1 , 2 ,..., j, so as to 

maximize {minimum [Ri(yi), • • • > ^*(^' 4 )]) 

subject to 

± Nj(yj) < N 

j-=i 

= 0 , 1 , 2 ,,.. for eachj, 

where each Hj{yj) is a nondccreasing function, integer-valued for each yj = 0, I,.., y 
and satisfies Hj{0) = 0. Develop a dynamic programming recursion to solve the 
problem. 

44 Minimax Route in a Network, An exchange student from England, Red Bricker, has 
purchased a not-so-new automobile and plans to spend his summer motoring from 
New York to San Francisco. To keep the exposition simple, suppose he can choose a 
route from the same map as did Mark Off, in Fig. 8 .1. Red is concerned that if he 
travels along a route between two cities where the temperature is very hot, the engine 
in his automobile may explode. He estimates that Ci^ is the highest temperature he will 
encounter enroutc from State i to State j. He wants a routing that minimizes the 
maximum temperature along the way. {See {a) and {b) on p, 356,) 


356 


Dynamic Programming Examples 


(a) Formulaic a dynamic programming approach, and apply it to the Cij given in 
Fig. 8.1. 

(b) Slate the recursion in sufficient generality to apply to acyclic networks like those 

considered in See. 7.7 [that is, express the recursion in a form analogous to (1) 
and (2) in See. 7.7]. « 

45 Second-Best Route in Network, Consider the problem of finding a shortest route from a 
source node to the terminal node, as discussed in Sees.. 7.6 and 7.7. Devise a dynamic 
programming formulation to find the second shortest route from the source to the 
terminal. To keep the task unencumbered with details, assume that the network is 
acyclic and the shortest route is unique. If you can think of more than one approach, 
then indicate the alternatives. {Hints: a second shortest route must differ from the 
shortest route in at least one arc. Alternatively, establish the validity and implication 
of the observation that the second shortest route from Node j to the terminal must 
consist of traveling along some arc {j\ k) and then along either the shortest or second 
shortest route from Node k to the terminal.) What complications arise in a cyclic 
network ? 

46 Consider the problem of finding a shortest route from a source node to the terminal 
node, as discussed in Sees. 7.6 and 7.7. Devise a dynamic programming formulation to 
find a shortest route from the source to the terminal that involves exactly K arcs, 
assuming that at least one such route exists. (Might such a route involve traveling 
around a loop of arcs ?) 

47 Knapsack Problem. Consider the optimization problem described in exercise 22. Suppose 

the largest ratio Ejjwj is for Item k. Prove that there is a value IF*, such that 

if the weight limit W > W*, then there is an optimal solution that includes at least one 
unit of Item k. 

48 Equipment Replacement and Overhaul. Consider the model in Sec. 10.6. Suppose you have 
the decision opportunities of overhauling as well as replacing the equipment each 
period after the initial procurement in Period 1 and prior to the salvage of the machine 
at the beginning of Period N. Assume that the length of time required to overhaul 
the machine is negligible in comparison with the length of the period; hence, a machine 
that is overhauled at the beginning of a period is still available for use during the 
period. Consider a machine is bought at the beginning of Period n, overhauled at the 
beginning of Periods li < to < • • • < tp where n < /|, and is replaced at the begin¬ 
ning of Period k, where k > tp. Assume the associated cost is 

ll) + Onihy ^ 2 ) + * * * + an(<p-l, h) "i" 

The quantity Qnihjj), where n < j, represents the cost of operating a machine 

during the Periods h through j, plus the expense of overhauling it at the beginning 
of Period j, given that the machine was initially purchased in Period n and was last 
overhauled in Period h (or was bought if/i = n). The quantity bnic(ip) represents the 
net total cost resulting from acquiring the machine in Period n, from salvaging it in 
Period k, and from operating it during Periods tp through k between final overhaul and 
s* le. The quantity bnk{^) represents the cost of providing the machine during Periods n 
through k when there are no overhauls. 

Show how to use a dynamic programming recursion to calculate values for 
that then can be appropriately employed in the recursion (2) of Sec. 10.6. 
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49 Consider a classical transportation problem like that in Sec. 6.2, where there are 
m = 2 supply points, and total supply equals total demand ( 6 ). Suppose that the 
shipping cost of sending units from Supply Point i to Demand Point j is given by 

a nonlinear function Cij{xij). Assume all St and Dj are integer-valued and restrict each 
Xij to be integer-valued. 

(a) Formulate the optimization problem in terms of a dynamic programming recur¬ 
sion. {Hint: you can use the property that total supply equals total demand to 
yield a one-dimensional state variable.) 

♦ (b) Show how to modify your formulation in part (a) for m = 3. 

*50 Caterer Problem. Consider the case of the Cole Food Company, described in exercise 48 
of Chap. 6 . Formulate the optimization problem in terms of a dynamic programming 
recursion. 

*51 Trim Problem. Consider the case of the Fine-Webb Paper Company, described in 
exercise 25 of Chap. 2. Recall that a combination was defined as a way to slit a jumbo 
reel of width 70 inches into smaller reels of widths 22 inches, 20 inches, and 12 inches. 
Suppose in solving the linear programming model of part (b) that you include only 
a subset of all the possible combinations. Let and ^3 be the resultant optimal 
values of the dual variables for the restrictions associated with the reels of widths 22 
inches, 20 inches, and 12 inches, respectively. Use the Dual Theorem and a dynamic 
programming recursion involving the^i to devise a test for whether the linear program¬ 
ming solution obtained is truly optimal, or whether one of the combinations omitted 
from the subset would improve the current linear programming solution. 
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CHAPTER 


Decision-Making Over 
an Unbounded Horizon 


11.1 MODELS WITH A LIMITLESS VISTA 

Unquestionably most, if not all, decision-making is part of an unending history 
of actions. Earlier choices have affected the present, current decisions will influence 
the future, and so on. In this light, all models must be viewed as imbedded in an 
unbounded horizon. Several of the dynamic models you have studied so far 
simply ignored the future beyond a designated horizon period, and sometimes 
(as in Chap. 9) a planning horizon theorem could be established to demonstrate 
that such a procedure might yield an optimal current decision. Other models 
attempted to account for the future by selecting certain “terminal” conditions 
(such as a specified minimum level of work force or productive capacity). In con¬ 
trast to these models, the illustrations in this chapter assume that the planning 
horizon is limitless. 

In order to derive any definite answers for models with an unbounded horizon, 
it is necessary to add a restrictive assumption: broadly, the hypothesis is termed an 
assumption of stationarity. In the simplest cases, you assume that all 
economic return functions, decision possibilities, and external phenomena (like 
demand requirements) are identical in every period. Each application below is of 
this type, and you should keep in mind that the results are based on these stationar¬ 
ity assumptions. Chapters 11 and 12 consider only deterministic outcomes. Proba¬ 
bilistically determined outcomes will be treated in Chaps. 17 and 18, and there 
stationarity of the probability laws is postulated. With advanced analysis you 
can also solve situations with less stringent stationarity assumptions, including 
cyclic phenomena such as seasonal demand (an example is given at the very end 
of Chap. 12). 
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Since in reality stationarity rarely exists for an extended period of time, you well 
may wonder whether models assuming stationarity over an unbounded horizon 
have much practical significance. They do. We will distinguish two types of 
application to show their importance. 

The first type pertains to situations in which dynamic optimization models 
are used to improve day-to-day operating decisions, such as replenishing inventory 
and scheduling production. For example, consider a firm that monitors its inven¬ 
tory levels daily and reorders an item every few weeks when the stock level reaches 
a critical point. Suppose, as is frequently true, that for at least twelve months the 
item’s demand rate and the ordering and holding cost functions are stable. Then 
it is reasonable for the firm to use the same inventory replenishment rule during 
three to six months, and at the end of that time to revise its inventory policy 
based on a new twelve-month forecast. (Another important consideration also 
justifies this mode of operation. If the firm stocks hundreds of different items, 
it would be too time-consuming and disruptive to recompute a new replenishment 
rule at frequent intervals for every item—the costs of doing so would far outweigh 
any efficiency savings from the improved decisions.) 

How, then, should the company set each specific replenishment rule? One pos¬ 
sibility is to use the methods of Chaps. 8 and 9: determine, for a horizon length 
N = 365 days, the minimum costf^{i) and optimal policy when entering inventory 
is I, and for the first few months employ the replenishment policy associated with 
N = 365. But given the stationarity assumptions, an unbounded horizon model 
can provide equally excellent answers, and typically requires much less calcula¬ 
tion. 

Accordingly, one justification for eniploying a stationary, unbounded horizon 
model is that in the context of daily operations, the approach is both an effective 
and relatively easy way to derive optimal decisions for the initial interval of time. 

The second type of application pertains to situations in which dynamic optimiza¬ 
tion models are used to make recurring strategic investment decisions, as in the 
replacement of expensive equipment. Large pieces of machinery may be replaced 
as seldom as every 15 or 20 years. Consequently, when the next replacement is 
necessary, completely new types of machinery are likely to be available. How, then, 
should the current investment decision be made? One possibility, of course, is 
simply to ignore the fact that the equipment eventually must be replaced. This 
can be misleading and hazardous, as witness the following illustration, based on an 
actual case. 

A food processing company facing production bottlenecks realized it could 
alleviate its problem by either prestocking inventory during the slack season for 
sales during the peak season, or by purchasing new equipment to expand its pro¬ 
duction capacity. When the ever-recurring cost associated with increased early- 
season inventory was compared with the initial cost of the added equipment, it 
seemed preferable to purchase the machinery. However, as soon as the equipment 
decision was analyzed to take account of ever-recurring future replacements, 
it turned out to be far the less attractive alternative, even with optimistic projec¬ 
tions of subsequent replacement costs. 
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Therefore, a second justification for employing a stationary, unbounded horizon 
mod^l is that in situations of repeated investment it can lead to better current 
decisions than those arrived at by ignoring the future. 

Optimal stationary policies. Assuming stationarity, you can safely 
intuit the meaning of “making a current decision in the face of an unbounded 
horizon.” What may surprise you is that your intuition is of limited help in fathom¬ 
ing how to make an optimal decision. 

To illustrate, consider first a finite horizon dynamic programming model. 
At any period you need to know only the state of the system and the number of 
stages remaining. Optimality of any strategy is judged according to the sum of a 
finite stream of returns. Now let the horizon be unbounded, so by definition the 
“number of periods remaining” is always the same, and any strategy employed over 
the horizon results in an unending stream of returns. Since for every strategy this 
stream may grow without bound as the horizon lengthens, you need a way to 
compare the strategies. Of course, if one strategy accumulates more returns than 
another for every horizon length, no problem of comparison occurs. But typically 
one strategy looks better for certain finite horizons and worse for others, so the 
resolution is by no means obvious. Thus a limitless vista raises two pertinent 
questions about determining an optimaksolution. 

(i) What criterion is appropriate forjudging the relative desirability of differ¬ 
ent infinite streams of returns? 

(ii) Is it optimal in a limitless vista to consider only stationary strategies, that 
is, ones depending solely on the current state of the system? 

This chapter focuses on answering (i). You will critically examine several fre¬ 
quently used criteria for evaluating infinite streams. You also will see how to 
apply these criteria to a simple but important regeneration model. In studying the 
solution of this model, you will discover several numerical methods of successive 
approximation that can be applied to more general problems of optimization in an 
unbounded horizon. 

Chapter 12 will consider a large class of models in which the answer to (ii) 
is affirmative, that is, where you can always find a stationary strategy that is opti¬ 
mal. Such policies require that you know only the current state of the system and 
not the historical sequence of events leading to this state. Thus you make the same 
decision each time you return to the same state. Specifically, in Chap. 12 you 
will study the techniques for obtaining optimal stationary strategies. Such policies 
that can be made available in the form of rules or tables provide the key to a 
workable dynamic programming application for operating decisions. 

11.2 SUBTLETIES OF INFINITE STREAMS 

You may be surprised to learn that businessmen, economists, and mathe¬ 
maticians have argued for centuries over how to assess infinite streams of 
returns. You will easily see why in the examples below, and you should try 
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to articulate the economic insights and managerial significance of these illustra¬ 
tions. 

Experience has shown that most decision-makers can not intuitively make 
consistent judgments about the relative desirability of different infinite streams 
of returns. As a consequence, most businessmen turn to formulas for providing 
at least a preliminary screening or ranking of decision alternatives. We too will 
apply formulas that convert an infinite stream of returns into a single number so 
as to indicate the relative merit of the associated alternative. But before doing 
so, we want to make sure that you see some of the substantive issues involved in 
different methods for choosing among infinite streams. Only with this knowledge 
in mind can you appreciate both the strengths and limitations of the simple-to- 
apply formulas. 

The discussion to follow is not intended to provide an exhaustive survey, along 
with suggested solutions, of the conundrums of infinite streams. Hopefully, the 
examples will make you cautious about accepting uncritically what seem to be 
alluringly simple-sounding solutions. 

This section investigates two questions that are central to optimization in a 
dynamic setting: 

(i) When is an evaluation formula appropriate for comparing different 
strategies? 

(ii) Does such a formula always reduce an infinite stream to a single number that 
can be used as the basis for comparison? 

These points are treated in considerable detail below, since the associated problems 
can be quite subtle. 

We investigate three criteria of merit. The first is average return per period. 
Actually, this criterion arises most often when the economic measure is cost. Then 
the selection rule recommended is to choose an alternative having the least 
average cost per period. The second criterion is present discounted value. As you 
will soon see, these two criteria do not always select the same alternatives, and 
occasionally give rise to some nasty technical problems. 

The third criterion is called equivalent average return. The idea is probably new to 
you, and is important in operations research models because it provides the mathe¬ 
matical connection between the other two criteria. Often you can derive the form 
of an optimal policy using ihis criterion; and then with this result you can calculate 
specific numerical solutions for either the average return per period or the present 
value criterion. 

The “internal rate of return” is another criterion that, unfortunately, is used 
often in business practice. It is not discussed in this text because the approach 
frequently leads to incorrect decisions. 

As you study the numerical examples below, you will find that most of the alge¬ 
braic manipulations have been performed for you. At first reading, take it for 
granted that the formulas arc correct—the important insights have nothing to do 
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with the underlying algebra. Afterwards, you may want to go back to see whether 
you can derive the formulas. 


Utility of money. You often hear it said that a dollar today is worth 
more than a dollar a year from today. Why? This maxim is based on several 
considerations. The decision-maker may find the sheer utility or personal worth 
of a current dollar is greater now than later. For example, consider a company 
that has paid its stockholders a regular quarterly dividend for 25 years. The 
firm may be very reluctant to forgo paying out a current dividend, even if it can 
promise to pay it eventually, perhaps with interest added. This difference in 
the utility of money at distinct points in time is the heart of the problem of 
making commensurate several unending streams of returns. 

Actually, the same comparability problem exists in finite horizon models. In 
the dynamic programming examples of the previous chapters, the different policies 
certainly led to different time streams of profits or costs. But the previous discussion 
simply ignored the difficulty of comparability. Since it was always possible to sum 
the profits or costs over the bounded horizon, the resultant criterion function 
attached a unique finite number to each policy and made optimization straight¬ 
forward. Here you can no longer ignore the comparability difficulty, even if you 
want to. Since the sum is now over an unbounded horizon, the total returns are 
infinite for most strategies. As you might imagine, any naive approach for com¬ 
paring several infinite streams of returns can succeed in only the simplest of cases. 
To illustrate, comparing policies in one way or another for each and every finite 
horizon length docs not always work. The following example shows why. 

Suppose you must choose one of the profitable alternatives that are described 
in Fig. 11.1 by the sequence of returns (= profits) for each period, starting with the 


Figure 11.1. I llustration 
of Infinite Streams of 
Returns. 



present. At the current period you receive a profit of 3 from A and B, 4 from G, 
etc. In the next period, you obtain a profit of 2 from A, a profit of only 1 from B, 
etc. 

It is reasonable that you would rule out F right away, since you can do strictly 
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better each period with A. In other words, A dominates F. It is also plausible to 
argue you could just as well eliminate E, since D returns a greater cumulative 
profit at any period. 

But you cannot eliminate G. Although its cumulative return after Period 2 is 
not as good as that of A and B, its profit in Period 1 is strictly the best among all 
policies. If you require as large a return as possible in the current period, then G 
is the optimal choice. 

How would you select among A, B, C, and D? In the second period, C looks 
most attractive. For every even-numbered period, D gives a better cumulative 
return than B. In the third period, B provides a cumulative return of 7 as against 
6 for A and C, and 6§ for D. What is more, B is “ahead"’ of A, C, and D at every 
Period 3 + 6n, for n = 0, 1, 2, . . . . Choosing among streams such as these is 
more the rule than the exception when models with unbounded horizons are con¬ 
sidered. 

Average return. You must make further assumptions in order to state 
that either A, B, C, D, or G is best. For example, you could malce the additional 
postulate that a unit of return received in any period is just as good as a unit 
received in any other period. “Just as good” mcahs there is no benefit of any sort 
in having the return earlier instead of later. How would this assumption resolve the 
problem ? 

It is reasonable now to look at the average return per period, letting 
the number of periods grow without limit, and prefer the alternative with the 
largest average. For A, you would compute 3/1, (3 + 2)/2, (3 +2 + l)/3, . ..; 
for B, you would similarly calculate 3/1, (3 + l)/2, (3 + 1 + 3)/3, .... These 
calculations are summarized in Fig. 11.2. As we show in the advanced material 
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5 

6 

7 

8 

9 • • • 

A 

3 

2^ 

2 

2i 

2i 

2 

2t 

2- 

2 

• • • 

B 

3 

2 

25 
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2| 

2 
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• • • 

G 
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1| 
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Figure 11.2. Average Return per Period. 


below, the average profit per period tends toward 2 for A, B, G, and D, and 
toward I-J- for G. In other words, if you let the number of periods be large enough, 
the average will be arbitrarily close to 2 for A, B, C, and D, and to 1-| for G. 
Therefore if you make the special added assumption that you have no time prefer¬ 
ence for returns, the policies A, B, C, and D look equally attractive, even though 
they are not equally good for each and every finite horizon, and Policy G looks 
inferior. 
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► As you can verify, the general terms in the sequences for the average return per period 
are 


(i) A: 


3 -f 6n 5 -f 6n 6 + 6n 
1 -f- 3/1 2 -f- 3/1 3 -f- 3/1 


for /2 = 0, 1, 2,.. . 


(ii) B: 


3 -f 4/1 4 -f 4n 
\ +2n2 + 2n 


for /z = 0, 1, 2,.. . 


1+6/1 7 + 6/1 6 + 6/1 
• 1 + 3/1* 2 + 3/z’ 3 + 3/z 


for /z = 0, 1, 2,,.. 


(iv) D: 



for /z = 1, 2, 3,.., 


4 + 4/z 4 + 4/z 4 + 4/z 
’ 1 + 3/z’ 2 + 3/z’ 3 + 3/z 


for /z = 0, 1, 2,.... 


Thus when n —> oo, each term approaches 2 for A, B, C, and D, and IJ for G. ^ 


The most obvious drawback of using average return per period as a selection 
criterion is its complete insensitivity to the level of returns over a finite number 
of periods. To illustrate, suppose you can select a policy that has returns identical 
to those in Policy A except that in Period 1 the return from the alternative policy 
is 100. Using solely the criterion of average return per period, you would judge 
that Policy A and the alternative policy are equa'ly desirable, since over an 
unbounded horizon, the first-period advantage of the alternative policy is rendered 
inconsequential. There are other limitations to this criterion as well, which we 
explore next. 

Assuming that you do want to employ the criterion, can you always be sure that 
a given stream will have a well-defined average return per period, as the number 
of periods grows without limit? For example, suppose the two streams for A and B 
are modified to have a multiplicative trend: 

Policy A*: 3, 2, 1, 3, 2, 1 6, 4, 2, 6, 4, 2 9, 6, 3, 9, 6, 3,. . . 

Policy B* : 3, 1, 3, 1, 3,1 6 , 2, 6, 2,6 , 2, 9, 3, 9, 3, 9, 3,- 

Although you may reason by analogy that the two streams should remain equally 
desirable, the rule of looking at average profit per period falters, since the averages 
grow beyond bound as the number of periods grows without limit. 

Therefore, if you want to rely on an average-per-period criterion for a measure 
of optimality, you must also assume that there exists a unique finite limiting average 
for the particular return streams you are comparing. In many applications this 
assumption is reasonable. 

Discounted return. An alternative approach for making different 
infinite streams commensurate is to deal with the so-called present dis¬ 
counted value (or present worth) of the returns. 

If the stream of returns is 

( 2 ) Ri, /? 2 > • • • > • • • > 

its merit should be judged, according to the present discounted value criterion, 
in terms of the sum: 
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(3) Present Value = /?, + o:/?, + + * • * + + • • • =2 cc^^^Ri, 

where i% is the interest rate per period and a: = [1 + (i/lOO)]"* is the single- 
period discount factor. The higher the interest rate i, the smaller the value 
of cc. The interest rale relevant for a firm’s decision making is an important sub¬ 
ject in its own right and is a lively topic of concern among scholars and practi¬ 
tioners in finance. We assume here that a firm can specify the appropriate value 
of I. 

There is an environmental assumption you can make to justify the approach. 
Suppose you can borrow or lend as much money as you desire and whenever you 
want at a fixed compound rate of interest i% per period. (Of course, 
you are eventually required to pay any debts you incur.) For example, let the 
annual rate be 5%. Then if you borrow a dollar today, you must pay back either 
(1 + .05) dollars a year from today, or (1 + .05)- dollars two years from today, 
or (1 + .05)” dollars n years from today. By the same token, a dollar received n 
years from today is really only worth (1 + .05)"” dollars right now. If you pres¬ 
ently had (1 -f .05)"” and lent it at 5% interest compounded, you would be paid 
back a dollar n years from today. 

Here is why this environmental assumption justifies employing present value. 
Consider the choice between two policies with different present values. For the 
moment, suppose you selected the policy with the smaller present value. Because 
your utility for money may differ from period to period, you may want to borrow 
and lend in various periods to redistribute the returns. 

For example, you may wish to have the benefit of R 2 right now rather than 
wait a period. Consequently you can borrow aR 2 at present, and then pay back 
Rn when it becomes available. Similarly, you may want to lend for several periods, 
and later receive the payment with compound interest earned. As you think about 
it, you will see that the value of the entire stream really is summarized by the 
number representing how much you could obtain at present by committing the 
entire proceeds of the stream for repayment. 

Now suppose instead you selected the policy with the larger present value. By 
assumption, you can borrow and lend, committing the resources of this stream, 
so as to attain the same benefits you desired with the other alternative, and in so 
doing you would have some additional value left over.Tn other words, any pattern 
you can obtain with the smaller present-value policy you can also obtain with the 
larger. The difference between the two values is a net benefit. To sum up, you do 
best by selecting a strategy that gives maximum present value, regardless of your 
personal time preference for money. 

Of course, rarely if ever is the environmental assumption about borrowing and 
lending exactly satisfied, but it is often a fair enough approximation- to provide 
adequate answers. Other kinds of arguments can be advanced in support of a 
present-value criterion. For example, in practical applications, a feature commend¬ 
ing formula (3) is that returns in the distant future are weighted by a small factor, 
and consequently have less impact on the decision. You can see in Fig, 11.3 how 
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the values of a:” drop rapidly as the interest rate 
and n increaise. But remember, present value or 
any other formula can be justified only by making 
particular assumptions regarding the decision- 
maker’s time value of money. 

Now you must examine the present-value 
formula to see whether any additional assumptions 
have to be imposed for it to be a workable criterion. 
Start by checking whether the sum of an infinite 
number of terms in (3) always yields a finite value. 
To begin, suppose all the returns are identical: 

(4) Present Value = /? + aR + ct^R 

4 - a^R + ... 


\(% 

5 

10 

20 

1 

.952 

.909 

.833 

5 

.783 

621 

.402 

10 

.614 

.385 

.161 

15 

.481 

.239 

.065 

20 

377 

.149 

.026 

40 

142 

.022 

.001 


(5) 


forO<Q:<l. 

\ — a 


Figure 11.3. Discount Factors. 



The coefficients in (4) are simply a geometric series 

giving the value in (5). Notice the restriction OL < 1. Ifo: is close to 1, then the 
present value is a large number, but it is finite. If OJ = 1, then the sum in (4) is 
unbounded for /? 9 ^ 0 , and ( 5 ) is ill defined. I'his case will be discussed later on. 


Next evaluate Policies A, B, C, D, and G in Fig. 11.1: for 0 < o: < 1, 

(6) 

Policy A : 

3 + 2a: + + 3a:’ + 2a:' + la:’ -f • • • 


(7) 


= (3 -t- 2a: + la:’)(l + a:’ + a:‘ + • • •) = 

3 +2a + la’ 

1 - a’ 

(8) 

Policy B: 

3 + la: -f 3a:’ -f la:’ + 3a:' -f lo:’ -i- • • • 


(9) 


= (3 + la:)(l +a:= -f-«' + 

. 3 + la 
^ 1 - q:^ 

(iO) 

Policy C: 

1 -1- 6a: — la:’ -I- la:’ -f 6a:' — la:’ + --- 

= (1 + 6a: - la:’) (1 -f a:’ -h a:" + • • •) = 

Cl 

1 



1 - a’ 

(11) 

Policy D: 

2f + 2a -f 2a:’ + • • • = 2| 2 


(12) 

Policy G: 

4 + 4a’+ 4a' -i- • • • 



To compare A and B you can look at the difference between their present values: 


(13) P.V.[A] ~ P.V.[B] = 


3 -|- 2qi 4 “ cC'" 
1 - a^~ 


3 4 ~ Icc 
\ - a' 


a 


(I +a)(l + q; + q:=) 

for 0 < oi < 1. 


Thus, even though B has a greater cumulative return than A in Period 3 4 “ 6 n, 
for « = 0, 1, 2, . . . , the discounted value of A is larger for all 0 <. <X <, 1. Con¬ 
sequently, the present-worth criterion rules out B. Use the same procedure to 
verify that 
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( 14 ) P.V.[A] - P.V.[C] - 

( 15 ) P.V.[A] - P.V.[D] - 

What do you conclude about the relative merit of A versus G and D ? 

If you compare B and D, you obtain 

(16) P.V.[B] - P.V.[D] = -2| + 

Verify that B is more advantageous when cc and D is better when > -y. 
Consider the situation when a =i. Then check that according to (16), the two 
policies are equally good. As we show in the advanced material below, for any 
JiniU horizon n, the present value of Policy B is strictly better than that for Policy 
D if n is odd (1, 3, 5,. . .), and the reverse is true if n is even (2, 4, 6,. ..). There¬ 
fore you would be indifferent to these policies only when the horizon is unbounded. 

^Let n = 2t, where / = 0, 1, 2,.... Then the present value of Policies B and D for 
the finite horizons n (even) and n + 1 (odd) are 

(i) P.V. [B120 = (3 + la) P.V. [D12<] = 2| + 2a ( ‘ r_fa -) 

for / = 1, 2, 3,..., • 

(ii) P.V. [B12< + 1] = P.V. [B12/] -f 3a^ P.V. [D12/ + 1] = P.V. [D1 2t] + 2oc« 

for / = 0, 1, 2,.... 

Consequently, for ct = ^ the comparison is 

(iii) P.V. [B|20 - P.V. [D|2/] = -|(i)« < 0 

(iv) P.V. [B12t + 1] - P.V. [D12/ + 1] = >0. < 


4 _oc^ + 2a — I ^ Q 

I 1 - 0^3 

for a > VY - 1 ^55: .414. 

Thus A is better only if o: > VY — 1, otherwise G is preferred. If the interest 
rate is very high, so that a is correspondingly small, then receiving a return of 4 
in the first period of Policy G outweighs the later gains available from Policy A. 
In general, as you lei a become small, the early returns are the most important, 
and when a; = 0 in the limit, all that matters is the first-period return Ri, 

So far you have seen that using a present-value criterion may occasionally 
distinguish two streams in a surprising way. But the approach has always given 
a definite answer, because all the summations in the present-value formula (3) 
yielded a finite number. Was this merely the result of a felicitous selection of return 
streams? The answer is yes. 

The kinds of difficulties occurring with the average-return criterion have their 


When you compare A and G you find 
(17) P.V. [A] - P.V.[G] = + 
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counterparts for present worth. You can see such examples in the advanced mate¬ 
rial below. On the other hand, troublesome cases for the average-return criterion 
may not cause difficulties for a discounted stream. Consider the example of the 
upward trending return stream Policy B* in (1), which did not have a finite 
average return per period. The present-value calculation can be shown to g^ve 

(18) Policy B*: (3 + let + Set* + let’ + 3ct* + lct’)[l + 2(ct®) + 3 (q:«)* + • • •] 

(19) = (3 + la:)(l +a* +a^) (i 

which is finite for 0 ^ ct < 1. Verify that the present value for Policy A* in (1) 
is also finite. 


Equivalent average return. Before concluding the discussion on how 
to attach a value to an infinite stream of returns, you need to study one other 
approach that relates the two notions of average and discounted values. 

The idea is to construct an infinite stream of returns that has the same present 
value as the original stream. The return (before discounting) in each period will 
be identical, so that this constant value can be interpreted as the equivalent average 
return of the stream. Specifically, suppose P{cl) is the present value of Policy X for 
a specified value of a. Then consider a new stream of returns 

(20) = (1 - a)P{a) for all n. 


For this stream 

(21) Ri R 20 i -j- RiCp “|- • • • = (1 — oC)P{cC){\ -|- qj -f- cP • • •) = P{pC), 


The stream in (20) has the same present value as Policy X, then, and (1 — cC)P{<x) 
is the equivalent average return. For a fixed value of o', 0 ^ CK < 1, a criterion of 
best equivalent average return always leads to the same decision as does best 
present value because the equivalent average returns are simply the present values 
of all the alternatives multiplied by the same constant, (1 — OJ). 

Applying the idea to the earlier examples, you obtain the equivalent average 
returns: 


( 22 ) 


Policy A: (1 - ^)(3 + ^ + la«) ^ 3 + 2o= + 

^ 1 — a’ 1 + a + flc* 

Policy B; (^ - a)(3 + lfle) ^ 3^1oC 
^ 1 — ot* 1 + oc 

Poliev O: (l-a)(l+to-l<f) _ 1+to-lf 
^ 1 — 0 !* 1 + 0 ! + 0 !* 


Policy D: 24(1 — o') + 2a! 


+ CL’\-CP 


The significant point is that whenever average return per period is well dejirud^ you 
always obtain it by letting OL converge from below to 1 in the formula for equivalent average 
return. 
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Thus letting oc = 1 in (22) yields 

Policy A: 2 Policy B: 2 Policy C: 2 
^ ^ Policy D: 2 Policy G: 

which are the average values for A, B, C, D, and G obtained previously. Sometimes 
equivalent average return is well defined for a: = 1 when average return per period 
is not well defined. (You can see such an example for Policy H in the advanced 
material below.) Unfortunately, equivalent average return is not always well 
defined for o: = 1; an example is Policy B*. 

The criterion of equivalent average return ranks A, B, G, and D as being equally 
desirable when o: = 1. Are they? This is a question of personal opinion and not 
scientific fact; the decision-maker alone must provide the answer. A strong caise 
can be made, however, for saying that for a: = 1, A is optimal, G and D are 
nearly optimal, and B should be eliminated. From (13), (14), and (15), 
you can conclude that for OL close to 1, the present value of A is greater than that 
for B, G, and D. For oi = 1 in (14) and (15), the difference between the present 
values is zero, so you might say that G and D are almost as good. But for B, when 
CL = 1 in (13), the difference is Thus, even though A and B have the same 
equivalent average return, their present values differ by as o' approaches 1. For 
this reason, you may want to discard B. 

In summary, the criterion of equivalent average return gives the same average 
return per period when the latter is well defined. Equivalent average return 
frequently will be adopted as a criterion function in the models of this and later 
chapters; but do not forget that its relevance for selecting an optimal policy in an 
unbounded horizon is assumed, (In the above examples, Policy B does offer a greater 
cumulative return every sixth period, starting with the third. The equivalent aver 
age return completely discounts this advantage.) Furthermore, keep in mind that 
additional postulates are required to ensure that the present value of each policy 
is always a unique finite number. And, finally, remember that if several policies 
have the same equivalent average return when o' = 1, there still may be good 
reason for preferring one of these policies to the others. 

► Tlie above examples were straightforward, and only Policies and B* in (1) were 
really troublesome. But it is easy to devise other illustrations to show why the subject 
of infinite streams is full of booby traps. We give a few of these pathological examples 
here. 

Suppose the revenue stream appears as 

Policy H: 4, —4, 12, —12, 20, —20, 28, —28,..., 4n, —4«,... (n odd integers). 

What would you guess to be a reasonable average return per period for this oscillating 
stream ? To sec what is happening more clearly, look at the cumulative returns 

4, 0, 12, 0, 20, 0, 28, 0,..., 4n, 0,... (n odd integers), 
and the corresponding sequence of average return per period 
4j 0, 4,0,4,0, 4,0,,.., 4,0,.,., 
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Here the problem is that even though the finite horizon averages are all bounded, they 
oscillate without converging. Therefore, in contrast with the behavior in Fig. 11.2, you 
cannot say for the Policy H that the average is arbitrarily close to a fixed value when 
the number of periods is large enough. The average at any fixed horizon is either 4 or 0. 
It is possible to modify the average return per period criterion to encompass such an 
oscillating sequence. For example, you could seek a policy that has the largest (or 
smallest) limit point of a sequence. For Policy H, 4 and 0 arc the largest and smallest 
limit points in the sequence of average return per period. 

Next consider the present-value calculation for Policy H 


(i) 4 - 4a -h 12a2 - 12a’ -f 20a^ - 20a’ + • • • = 

The present value is a well-defined finite number for 0 ^ a < 1. The corresponding 
equivalent average return is 


(ii) 


4(1 + a») 

(1 + ay ’ 


giving the well-defined value 2 at a = 1. Do you think this is a reasonable average 
value for the stream ? 

Consider the stream 


for n = 1, 2, 3,..., 

where R>\, and where the present value of this stream is 

(iii) I+ ccR-i- (aRy + {aRy -f- • • - , 

which grows beyond bound as soon as ^ 1/a. To rely on a present-value approach, 
you must assume that the revenue stream docs not grow fast enough to outweigh the 
discounting. 

Finally, suppose a = a is specified, and that the stream is 

(iv) = for «= 1,2,3,.... 


Then the present-value formula ( 6 ) yields 

(v) 1 I 4 . 1 1 + 1 - 1 + .... 


Clearly (v) seems finite, but what docs it equal? 

In answering, you must be wary of plausible looking arguments that really lead to 
absurdities. To illustrate, you might consider the present value of (v) for a < a, and 
see what happens in such a formula when you let U = a. This approach yields 


(vi) 



1 

1 + {ala) 


for a < a. 


Thus you would be tempted to state that (v) sums to i, since this is the value on the 
right of (vi) when a = a. The flaw in this “approach by analogy” can be seen by looking 
at the series: 




- 1 + Ic^aj'+lalay- 


Although (vi) and (vii) arc different when 0 < a < a, both series on the left arc identical 
when a = a. But in (vii) the value on the right is f if a = a. 
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There is a variety of ways to solve the puzzle of attaching a unique value to (v), 
but by necessity they require you to make an explicit assumption about how you intend 
to treat such series. [Incidentally, most of the commonly made assumptions lead to the 
value ^ for (v).] ^ 

Conclusions. The discussion in this section reached the following con¬ 
clusions about selecting an optimization criterion for dynamic models: 

(i) A method of comparing streams of returns must include an assumption 
about the time value of money. 

(ii) A technique that attempts to reduce any infinite stream to a unique finite 
number may not work for all such streams. 

(iii) Even when a technique does reduce two different streams to the same 
number, the two policies may not be equally desirable if other economic 
considerations are examined. 

At this point, you may wonder how realistic the preceding specific numerical 
examples really were. Of course, they were contrived, but you should not dis¬ 
credit them on that account. As tax experts and professional investment analysts 
can assure you, every corporation does face critical policy decisions that in effect 
are choices among alternative infinite streams exhibiting behavior similar to that 
in the examples. Such situations give rise to full-blown versions of the perplexities 
you studied, and commonly occur in firms undergoing rapid growth or facing 
steadily rising costs. 

To illustrate, consider some issues that arise in the mere accounting treatment 
of cash flows and corporate assets. A company has several choices of how it depre¬ 
ciates its equipment, evaluates its current inventory, and takes a tax allowance for 
natural resource depletion. In each case, the strategy chosen can sharply affect 
the firm’s reported earnings year after year. 

Or consider a company deciding on a major equipment replacement expendi¬ 
ture, Sometimes it can postpone such an expense by temporary, but sizeable, 
maintenance outlays. If the investment is large, the firm may also want to assess 
strategies that involve leasing the equipment. The financing itself may be under¬ 
taken by several possible combinations of retaining earnings, incurring debt, and 
raising equity capital. To make the choice, the management must compare these 
strategies in terms of their impact on the company’s cash flow and earnings streams. 

For the remainder of this chapter and in the next, the infinite stream paradoxes 
are put aside (except for an occasional brief reminder). From here on, you will 
concentrate on applying the selection criteria and on learning the techniques for 
finding policies that are optimal according to the given criterion. 

11.3 TIMBER HARVESTING MODEL 

The next illustration serves a two-fold purpose. You will use the ideas, in the 
previous section to solve a particular optimization problem. At the same time you 
will analyze a model structure that possesses wide applicability; it is discussed 
in the subsequent sections of this chapter. 
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The Tiny Timber Company is planning the forestation of a new area of land. 
The firm has estimated that a tree felled at the start of the /:th period of growth 
yields a net return > 0. To keep the discussion simple, assume that all expen¬ 
ditures on planting and maintaining a forest are negligible as compared with the 
cost of harvesting and transporting trees at the beginning of Period k. Further 
suppose that all the trees are to be cut in the same period. Consequently, 
represents the revenue received less the costs of cutting all the trees. Assume Nje 
is available at the start of Period k, and A; = 1 refers to the current period when the 
forestation commences. 


Single decision. The discount factor is applied to obtain the present 
value of the return, so the optimization problem is 

(1) maximize ot!^~ ‘ Nj^. 

*- 1 . 2 , 3 .... 

The formulation (i) assumes that after the forest is harvested, replanting does 
not take place. In other words, the problem as stated so far involves a single deci¬ 
sion : when to cut down the trees. 

Suppose K is a value of k that solves (1), and assume that the sequence 
cxrN^, . . ., has the property 

( 2 ) cc^N^ < a^N2 ^ • < cc^-^N^.i < > • • •, 


so that the present value is increasing as k goes from 1 to /C and then is decreasing 
for larger k,‘ Then the inequalities and !> 

cc^Njc^i) simplify to the conditions 


(3) 




<CL^ 


Jk 


Therefore the value K can be found by calculating the ratios starting with 

A; = 1, and terminating as soon as a ratio is at least as large as o', [Interpret (2) 
and the stopping criterion (3) for a: = 1.] 


Illustration. Consider the case in which 

(4) Nk = a — where 0 < ^ < 1 and 

(Draw a graph ot' Nf^ for a = J5 and b = .5.) Then, as you can verify after a small 
amount of algebraic manipulation, (3) can be simplified to 

(5) 6'^-* > > b’^. 

This case points up a new difficulty. Let o' = 1, so that the middle term in (5) 
equals 0. Then there is no finite value for K satisf^Tng (5), since ^ > 0 for all k. 
However, b*^ is arbitrarily close to 0 for k sufficiently large. Thus when cc = 
it is not meaningful to write ‘‘maximize” in (1) if AT* is specified by (4). Explain 
why a finite maximum does not exist in this circumstance (you will find it helpful 
to refer to the graph of Ni^). 
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To illustrate (5) by a numerical example, suppose 

(6) Nf, = ,85 - (.78)* and a = .8. 

Verify that K = 4 satisfies (5). Let P.V. [A:] denote the present value 
then, as you may want to verify for at least A: = 1 and 2, 

P.V.[1] = .070 P,V.[2] = .193 P.V.[3] = .240 

(7) 

^ ^ P.V.[4] = .246 P.V.[5] = .230. 

These values will be used later. 


Unbounded horizon. Now consider what happens when the forest is 
replanted in the period following the harvest, and thereafter the process is repeated 
infinitely often. If, each time, the forest is cut at the beginning of k periods growth, 
the present discounted value of the return stream over an unbounded horizon is 

F{k) = P.V.[Jt](l + O'* + + • • •) 


(8) 

= P.V. [A] + P.V. [A] (a* + a’* H-) 

= P.V.[A] + a* P.V.[A](1 + a* + ...) 
= P.V. [A] + a:*F(A) 

SO that 


(9) 

> 1 


An optimal policy over the unbounded horizon is one that maximizes F(A:). Let 
the maximal value of F{k) be denoted by F, which you can find according to 

/im /7 • P.V.M 

(10) F = maximum -=- ^ 

^ * jfc=i.2.s.... 1 — o:* 

given the formula forF(A:) in (9). An immediate implication of (10) is 

(11) F ^ I Z every A:, or, equivalently. 


(12) F > a}^F + P.V. [A:]. for every A:, 

where the equality holds in (11) and (12) for an optimal policy. This in turn 
implies that the value for F must satisfy 

(13) F = maximum {a!^F + P.V.[A;]). 

*- 1 . 2 . 3 ,... 


Suppose (10) yields A; = A:' as an optimal policy and assume that the sequence 
P.V. [1]/(1 — QJ), P.V, [2]/(l — O'*), .,., has the property 


(H) 


1 -a ^ 


^ p.v.rAH ^ p.v.rA' + 11 ^ 
< 1 ^ 1 _ ^ 


Then the inequalities for k' — 1, k’, and A' + 1 (14-) lead to conditions analo* 

gous to (3), namely 


q + ••• 

(1+05+ '• 




<0!^ 


(1 + oc + 

(!+«+• 


l + oQt. 


(15) 
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The value k* can be found by calculating the right-hand side of (15), starting with 
A: = 1, until the ratio is at least as large as OL. For each A:, the ratio will be larger 
than NfcINt^iy which was calculated to obtain K in (3). Hence 

(16) k'^K, 

which means that the forest is usually harvested more often in the unbounded 
horizon case (and never less often). 

Verify that when o' = 1 in (15), the inequalities can be written as 


(17) 


F^l k' k'lk'+V 


implying that k' is a policy yielding the maximum average return per period. 
For the numerical example in (6), an optimal policy is A:' = 2 and 

(18) F(l)=.350 F(2)=.536 F(3) = .492 F(4) = .417 F(5) = .342, 


where F (A:) is the value of (9). Thus in the unboimded horizon situation, if you 
erroneously employed the solution F = 4 from the single-decision case, you would 
receive only = .78 of the truly optimal present value. 


11.4 INFINITE STAGE REGENERATION MODEL 

You have already seen a few applications of so-called regeneration 
models. One such example is the timber harvesting and replanting problem in 
the previous section. Each time the forest is cut, the process regenerates itself 
in the sense that the Tiny Timber Company must again decide how long to wait 
until the next harvest period. Another illustration is the problem of equipment 
replacement discussed in Sec. 10.6. There a regeneration period occurred each 
time a machine was replaced. Consequently, the decision variables are really the 
successive intervals between replacements. Instead of pursuing any particular 
example in further detail, we will now treat them all in the context of a general 
model. 

Suppose each time the decision process regenerates itself, the decision-maker 
can choose among N alternatives, which are indexed A: = 1, 2,..., Assume 
that if Alternative k is selected at a regeneration Period t, then the next regenera¬ 
tion occurs at Period t + k, and let 

(1) F* = cost of Alternative k valued at the start of its regeneration period. 

Note that (1) embodies a stationarity assumption: Rje does not depend on the par¬ 
ticular period when the regeneration occurs. Also observe that since is to be 
interpreted as a cost, the sense of optimization will be to minimize. 

Finite horizon. Define 

(2) fn = present value of an optimal regeneration policy in which an alter¬ 

native must be chosen when n periods remain until the end of the 
planning horizon. 
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Suppose that you choose Alternative k. Then you immediately incur the cost 
and assuming that you act optimally at the next regeneration point, « — A:, 
you subsequently incur the cost cL^fn-ki where the factor o'* properly discounts the 
future cost to the present. Hence, an optimal choice when there are n periods 
remaining until the end of the horizon is a policy that minimizes the sum oc^fn-k 
+ Rky and the corresponding minimum value isf^. Assuming that n'^ you can 
characterize fn recursively by the relation 

(3) = minimum [cc^fn^k + fo = 0 forO<a:<l. 

(If n < Ny then the minimum is restricted to A: = i, 2,. . ., ;i.) Actually, if the 
costs of Alternative k occur throughout k periods, then each Rk would also depend 
on cc ; but we let this fact remain implicit and use the abbreviated symbol Rk instead 
ofFigure 11.4 illustrates a network representation of reeursion (3) for o' = 1. 



To see how (3) works, suppose A: = 1 ij optimal for all horizon lengths n. Then 

(3) yields 

fn = + -^1 = <AP^fn-2 + + Rl 

(4) = + Ril + Rl 

= • • • = Rl -f- ccRi d” • • • 

Unbounded horizon. Now suppose the planning horizon for the regenera¬ 
tion process is unbounded. Each time a regeneration occurs,, the decision-maker 
continues to face an unlimited horizon. It can be proved by rigorous argument 
that there exists an optimal strategy (or policy) that is stationary: choose the same 
Alternative k at each regeneration point. Then if cc^ 1, the appropriate generali¬ 
zation of (3) is 

(5) /= minimum [a^f + /?J for 0 < < 1. 

[Recall that in (13) of the previous section, you studied an example of a similar 
optimization relation, the only important difference being the sense of the optimi¬ 
zation. There P.V.[A:], being a present value, did depend on a. Note also that here 
the largest value possible for k is assumed a priori to be N,] 

The relation (5) is an example of what is called a functional or 
extremal equation. It is the value of f that is unknown, and (5) states the 
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optimization relation that / must satisfy, given that a stationary strategy is used. 
When dealing with extremal equations, you must always determine: 

(i) Does the equation possess a finite solution? 

(ii) If so, is the solution unique ? 

(iii) If so, is f the maximal discounted return among all (not necessarily 
stationary) policies? 

To see the relevance of these questions, suppose OJ = 1, contrary to the restric¬ 
tion on the right in (5). If you assume all Rj, > 0, then no finite value for/satisfies 

(5) . Explain why. But if instead you assume that all Rj^ = 0, then any finite value 
forysatisfies (5). Explain why. Therefore, the functional equation (5) is not appro¬ 
priate for a: = 1. 

You can view (5) as stating that f must satisfy 

(6) or for all X: 

and equality in (6) must hold for at least one value of A:. It follows that a unique 
finite solution to the extremal equation (5) does exist and equals 

(7) / = minium [^^]. 

An optimal stationary policy corresponds to any Alternative k that yields the 
optimal value for/. 

You can also derive (7) on the basis of stationarity. Since it is optimal to employ 
the same Alternative k every time a regeneration occurs, the present value of the 
policy is 

(8) R, + +- 

Thus an optimal policy is one that minimizes this quantity, as indicated in (7). 

So far, the infinite stage problem has been solved assuming ^ 1, and you 
discovered that (5) is not appropriate for a: = 1. However, you can extend the 
analysis to o' = 1 by employing the criterion of equivalent average return suggested 
in Sec. 11.2. Specifically, use ^ = (1 — Qi)/in place of f. Verify that after some 
algebraic manipulation (7) then becomes 

(9) P = minimum !•=—;- r—— r-nrrrl• 

Given 0 ^ o' < 1, an optimal policy in (7) is an optimal policy in (9), and vice 
versa. By letting oJ = 1 in (9), you can check that the solution corresponds to 

(10) g = minimum for = 1. 

As we would expect, an Alternative k is optimal if it minimizes average cost per 
period. 

To test your understanding, state how you would alter (6), (7), (9), and (10) 
if the optimization indicates ^^maximum” in (5). 
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Because the structure of this regeneration model is so simple, you can easily 
obtain an optimal stationary policy by the calculation in (7) or (10). For all prac¬ 
tical purposes, then, the solution technique is trivial. Since problems of a more 
general structure do not solve so readily, however, more sophisticated approaches 
are required. In the context of dynamic programming, these are usually referred 
to as techniques of successive approximation. To help you grasp the fundamental 
ideas underlying each of these alternative approaches, the methods will be illus¬ 
trated in the next three sections with the simple regeneration model. You will then 
find that only revised notation is required to extend them to more general problems. 

► One application of (10) is an important special case of the “inventory model with 
concave costs” that you studied in Sec. 9.5. Assume the production and inventory cost 
functions are stationary and of the form 

i s -f cxt for xi> 0 (setup + unit production cost) 

0 for Xi = 0 

^<(h) = h-it for all t (linear holding cost), 

where J ^ 0, r ^ 0, and ^ > 0; Xj is the amount produced or ordered at the beginning 
of Period t, and the ending inventory for Period t. Further assume demand is stationary, 
so that Dt = D for all /, and that demand requirements must be met on time. 

Let 0, be the amount produced at a regeneration point, when entering inventory is 
zero. When the horizon is finite, you know from the form of an optimal policy given 
in Sec. 9.5 that 

(ii) 0,= kD {k a. positive integer). 

Assume this form of policy for the unbounded horizon solution. The optimization 
problem is to find a multiple k that minimizes average cost per period. 

Observe that 

(iii) Rfc = s + ckD + AD[(A — l)-f ••• + 2+ 1] (setup -f production + holding 

costs over k periods) 

(iv) = j + cAZ) + hD L* . 

Consequently, 

(v) ^ ~ -f cjD ^ (average cost per period). 

An approximately optimal k can be found by differentiating (Rk/k) with respect to 
k and setting the quantity equal to 0. Verify that the result is 

so that 

(vii) Q,= kD = ^- 

This expression for Q,is often called the economic lot-size formula. Notice that (vi) and (vii) 
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arc approximate because in general the values arc not integers. However, when (vi) 
is not an integer, an optimal policy can be found by calculating (v) for the next smallest 
and largest integers and selecting the better answer. M 

11.5 SUCCESSIVE APPROXIMATIONS 

In this section, we initiate the discussion of niunerical techniques for solving 
extremal equations that arise in dynamic programming models having an 
unbounded horizon. We use for a prototype the functional equation in the pre¬ 
ceding section 

(1) f = minimum [a^f + for 0 ^ < 1. 

Remember, saying that you want a solution to (1) really means that you want a 
value for the unknown f that satisfies the equation; in addition, you would like 
to have an Alternative k that yields this value of/. Three solution approaches are 
frequently suggested. 

The first emanates from the dynamic context of the underlying model. The idea 
is to see whether a policy that is optimal for a very long, but finite, horizon yields 
a solution value for f when used over an unbounded horizon. The second idea is 
to guess a value for /. Then compute the quantity on the right-hand side of (1) 
using this guess, and see whether the equation is satisfied. If not, let the result of 
the computation be a revised guess, and repeat the process. The third idea is to 
guess a policy that may be optimal over an unbounded horizon. Then solve for the 
corresponding present value, and use it as a trial value for/. See whether the equa¬ 
tion is satisfied. If not, let the new guess be the policy that gives a minimum on the 
right-hand side of (1) and repeat the process. We consider the first approach in 
this section and the other two in the sections to follow. 

In all of these methods, each guess can be viewed as an approximation to the 
solution. If the guess satisfies the extremal equation, you are done. If not, you must 
guess again. This iterative process is given the label successive approxi¬ 
mation. 

Finite horizon in the limit. Perhaps the most obvious approach for 
finding a policy that yields a solution to the functional equation (1) is to solve the 
finite horizon model 

(2) /„ = minimum [tfVn-* + for 0 o' < 1 

for a very large value of n. The examples you analyzed in Chaps. 8 and 9 should 
have made you cautious in your forecast of whether such an approach can be 
relied on to work. Can you be sure that for any n large enough, a that results 
irom (2) will also satisfy (1) ? As you try each successively larger n, does a single 
k remain optimal? If the horizon n is long enough, is an optimal imbounded 
horizon policy also optimal as the initial decision? It is significant that for the 
regeneration model these questions have afiirmative answers. 
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Regeneration Model Horizon Theorem. There exists a finite value n* such 
that for any finite horizon n > n*, if 


fn = + Rk. then / = ccV + Rt. 

f= a^f + Rk then /„ = aVn-r + Rk- 


(3) 

(4) 


Thus (3) asserts that any strategy that is optimal for the current decbion when 
the horizon n is large enough (greater than n*) is also an optimal stationary 
strategy for an unbounded horizon. And (4) asserts the reverse proposition. By 
performing the calculations of (2) according to a certain computation format, 
you can ascertain n*. The details of the approach are extraneous to the purpose 
of this discussion, and therefore are omitted here. 

11.6 SUCCESSIVE APPROXIMATIONS IN FUNCTION 
SPACE (value iteration) 

The guiding idea of the preceding method was to find an optimal stationary 
policy, Ky for an unbounded horizon by examining an increasing sequence of values 
of n. In contrast, the notion below is to successively approximate the function value f 
in the extremal equation. Accordingly, the process is termed value iteration. 

Letbe an initial guess for/. Then the technique is to compute a sequence of 
approximations /S/*,/’,..., according to the recursion 

(1) = minimum [o'*/" + for 0 ^ o' < 1 (value iteration), 

where/" is the trial value for/from iteration n, [If the optimization in the extremal 
equation indicates “maximum,” then the corresponding change is made in (1).] 
An example of the method is given below. 

Although the algorithm (1) is well specified, three questions arise about its 
application: 

(i) Does the value of /" always approach the value of / that satisfies the 
extremal equation? 

(ii) If so, is there a finite n such that /" equals /? 

(iii) If Alternative k is chosen in (1) for two successive approximations, is it 


optimal? 


To answer these, suppose for the moment all > 0. If you let /° = 0, then 
it can be proved that >/", so that the /" are a monotonically 
increasing sequence of approximations. And for n sufficiently large,/" is 
arbitrarily close to the optimal value /. In general, however, there is no finite n 
such that/" equals/ and further, an alternative may be chosen on the right-hand 
side of (1) for two or more successive approximations but need not be optimal in 
an unbounded horizon. (For the regeneration model, you never return to a policy 
once it has been discarded in a function space iteration.) 



• 11.6 • 


Successive Approxemations in Function Space 381 


Example. The following illustrates how the approximation method works 
when > 0. Let N = 5 and 

(2) R, = 8.7 R 2 = 12.7 R, = 14.7 R, = 19.7 /?, = 28.7 a = .8. 

Then you can determine that the solution is 

(3) / = minimum [ , J = minimum [43.50, 35.28,30.00,33.39,42.84] 

= 30.00 

so that A: = 3 is optimal. 

The function space calculations in recursion (1) yield, for n = 1, 2, 3 and 
/°=0: 

= minimum [a^O + =8.7 for A: = ! 

= minimum [.8(8.7) + 8.7, .64(8.7) + 12.7, .51(8.7) + 14.7, 

.41(8.7) + 19.7, .33(8.7) + 28.7] 

W = 15.66 for A = 1 

y’ = minimum [.8(15.66) + 8.7, .64(15.66) + 12.7, .51(15.66) + 14.7, 

.41(15.66) + 19.7, .33(15.66) + 28.7] 

= 21.23 for it = 1. 

Check the individual terms in (4) to make sure you understand the computational 
process. For iterations n > 3, 

/« = 25.53 /’ = 27.57 /® = 28.76 /' = 29.37 /» = 29.68 /» = 29.84 
= 29.91 /" = 29.95 /*• = 29.97 = 29.98 /“ = 29.99 /•* = 29.99, 

all for A; = 3. (Calculate/*® and verify k = 3.) 

Commentary. The example shows in (4) that a policy {k = 1) can be 
selected for several successive approximations but not be an optimal solution for the 
unbounded horizon. You can alter the example so that ^ = 1 is selected for an 
arbitrarily large number of approximations by reducing Ri close enough to 6, The 
calculations in (5) indicate that there is a fast rate of convergence of/" to/ but 
that/" does not equal f for any finite n. 

Observe that for o' = 1, the process breaks down. For every n, a A; is selected if it 
produces the minimum Rj^y and such a k does not usually agree with the solution 
that minimizes the average cost per period Rtlk. 

The value iteration method given in (1) actually works for any values of Rj^ 
and initial guess /®. But then the sequence of/" values is not always monotonic. 
An alternative approach for selecting /® does always result in a monotoni- 
cally decreasing sequence of approximations, that is,/"'''* </". The 
idea is to guess an optimal poluyy and let /° be the corresponding present value 
for this policy. 
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If the policy you guessed proves optimal in calculating then =7'. 

But if a new policy is strictly better in calculating /*, then the recursion (1) 
proceeds as before and/”■*■* </”. The method is illustrated next. 

Convergence from above. Consider the example in (2) and assume 
your initial guess is /: = 1. Then 

(6) 43.50. 

and for n = 1, 

/» = minimum [.8(43.50) + 8.7, .64(43.50) + 12.7, .51(43.50) + 14.7, 

.41(43.50) + 19.7, .33(43.50) + 28.7] = 36.88 for A = 3. 

For iterations n > 1, 

P = 33.50 P =31.78 /< = 30.90 P = 30.45 /“ = 30.22 
p =30.11 P = 30.05 P = 30.02 /"> = 30.01, 

all for A; = 3. You can alter the example, by reducing close to 17.7, so that A: = 4 
is selected for an arbitrarily large number of approximations. Verify that had you 
started the process by guessing A: = 3, then = 30.00 for A: = 3. 

The motivation for letting /° = 0 in the application of recursion (1) was 
mainly numerical convenience. You would not gain much insight from a verbal 
description of the approximation process with this starting point. However, letting 
be the present value of an initially guessed policy does lead to a key idea. The 
amount/® in (6) represents the present value of adopting the policy A: = 1 over 
an unbounded horizon. Suppose that instead of A; = 1, your immediate decision 
is A: = 3, and thereafter you always let A: = 1. The present value of this strategy is 
/* in (7). Analogously,/^ in (8) actually represents the present value of letting 
A: = 3 for the first two regeneration decisions, and letting k = 1 subsequently. 
This observation suggests another mode of approximation, discussed in the next 
section. 

►What does it mean to say that the value iteration recursion (1) always works? Three 
points must be demonstrated. Two of them have already been mentioned, namely, that 
the extremal equation has a finite solution and that it is unique. Because the regeneration 
model has such a simple structure, the existence and uniqueness argument embodied 
in Sec. 11.4 suffices. More general models require additional care to establish these two 
points. 

The third proposition is that the sequence of/" converges to the solution value, for 
any Rjc and initial guess/®. The argument here is as follows. Let K be an optimal station¬ 
ary policy for an unbounded horizon and let k be selected for a given iteration n according 
to the recursion (1). Then 

(i) /= a:V+ 

since K is optimal in the unbounded horizon, and 
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(ii) ocV""‘ + Rs, 

since k gives rise to Thus 

(Hi) /" -/^ «'=•(/’•-• -/) ^ a|/"-' -/I, 

because 0 ^ a < 1. 

Similarly, 

(iv) /» = «*/"■* + Rtc 

(v) «*/+ 

so that 

(vi) /-/” < ^ -/!• 

Combining (iii) and (vi) gives 

(vii) I/" -/I ^ a|/"-‘ -/I a"!/® -/!• 

Since OJ < 1, the right-hand side of (vii) is arbitrarily small for sufficiently large n. 

Therefore /” approaches f in the limit. Notice that the convergence is at least at an 

exponential rate. 

11.7 SUCCESSIVE APPROXIMATIONS IN POLICY 
SPACE (policy iteration) 

Suppose, in calculating the right-hand side of the recursion (1) in the previous 
section, you find a policy that makes a strict improvement over the one associated 
withy*”. This means that using this policy is an improvement over using the previ¬ 
ous policy for the immediate decision. It is plausible, and correct, that using the new 
policy throughout the entire unbounded horizon would be even better than 
employing it only for the immediate decision. Then/"*^* can be calculated as 
the present value of repeatedly choosing the new policy. This process is known as 
approximation in policy space, or simply as policy iteration, 
since each iteration considers a new trial stationary policy for the unbounded 
horizon. 

The resultant sequence of/" is monotonically decreasing, and a strict improve¬ 
ment occurs at every iteration; therefore you never return to a policy once it has 
been discarded. Since there is a finite number N of distinct stationary policies, 
the approach must terminate in a finite number of iterations. As soon as a policy 
remains optimal for two successive approximations, you may stop the calculations, 
and equals the optimal value f satisfying the extremal equation. As you will 
see, the price you pay to obtain a finite algorithm is the effort involved in cal¬ 
culatingfor a new policy at each iteration. 

The algorithm is 

Step 1, Select an arbitrary initial policy, and let n = 0. 

Step 2. Given the trial policy, calculate the associated 
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R 

(1) ~ 1—(present value of trial A: over an unbounded horizon). 

Step 3, Test for an improvement by calculating 

(2) minimum [a:*/” + + R^- (select k'). 

Step 4. Terminate the iterations if + Rk> = /”. Otherwise, revise the 
policy to k\ Increase n to n + 1, and return to Step 2 with the new trial policy. 

Observe that whereas the very process of approximation in function space leads 
immediately to successive trial values (orf, now these must be computed separately 
from (1). Notice also that the test for termination in Step 4 is satisfied if k' is the 
same as the trial policy used in Step 2. That is, the calculations cease whenever 
k' is the same for two successive iterations. [How would you modify (2) if the opti¬ 
mization in the extremal equation indicates “maximum” ?] 


Example. To illustrate the approach, consider 

(3) R, = 8.7 R 2 = 12.7 R, = 14.7 R, = 19.7 R^ = 28.7, 

which is example (2) of the previous section. 

As before, take your initial policy guess to be A: = 1, so that = 43.50. The 
test calculation in (2) is the same as (7) in the previous section. Thus in the value 
formula (1) you now find/* = 30.00 for A: = 3. The second application of the test 
quantity (2) yields 

minimum [.8(30.00) + 8.7,6.4(30.00) + 12.7, .51(30.00) + 14.7, 

^ ^ .41(30.00) + 19.7, .33(30.00) + 28.7] = 30.00 for k' = 3, 

so that the process terminates. 


Average return per period. As usual, to obtain the corresponding 
method for oc = 1, it is helpful to recast the procedure in terms of equivalent 
average return. The analogy to the value formula (1) is simply 


(5) 


gn = 


R/^ _ R]^f 

I+cc-i -h “ T 


for O' = 1. 


Note that if /?* depends on oL, then the value of at o' = 1 is used in calculating 
the ratio on the right-hand side of (5). 

The expression on the left in the test quantity (2) becomes 

(6) ininimum + (1 — 

but when you let o' = 1, the bracketed expression is independent of k. To rectify 
the situation, observe the following. If k' minimizes a function ^(A), then k' also 
minimizes ag{k) + b, for a> 0. Let g{k) be the expression in the brackets of (6), 
= (1 — c^)“S and b = —^”(1 — QJ)”‘. Make this transformation in (6), and 
verify that the following optimization, analogous to the test in (2), is appropriate: 
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minimum [—(1 + o' + • • • + 

n) . 

= minimum [—kg^ + for oc = 1. 

*=1,2,...,/r 

Once again, you must use the value of appropriate for oc = 1, if in fact 
depends on cc. Give a verbal interpretation of the criterion in (7) for OJ = 1. 
To summarize, the technique is 


Step 1, Select an arbitrary initial policy, and Jet n = 0. 
Step 2, Given the trial policy, calculate the associated 


( 8 ) 



(average cost per period of k). 


Step 3, Test for an improvement by calculating 
(9) minimum [—+ /?*] = A:'g” + (select k'). 

*-l,2,...,JV 

Step 4, Terminate the iterations if ^k'g^ + = 0. Otherwise, revise the 

trial policy to k\ Increase n to /z + 1, and return to Step 2 with the new trial 
policy. 


To see how the method works, apply the algorithm to the example (3). The 
sequence of calculations is 


(i) = /?j/l = 8.7 for A: = 1 as the initial policy, 

(ii) minimum [—1(8.7) +8.7, —2(8,7) + 12.7, —3(8.7) + 14.7, 

-4(8.7) + 19.7, -5(8,7) + 28.7] = -15.1 for = 4, 

(iii) g^ = RS = 4.925, 

(iv) minimum [—A;(4.925) + = —.025 for k' = 3, 

*= 1,...,5 

(v) = R,I3 = 4.9, 

(vi) minimum [—A:(4.9) + = 0 for k' = 3. 

*=!,...,5 

Notice the same policy A' = 3 is indicated for two successive iterations, thereby 
causing termination at Step 4, and A' = 3 is optimal. 

►You may not really be indifferent between two policies thatlook equally good accord¬ 
ing to the test in (9). For example, if you add another decision A = 6 with return R^ = 
2 R 3 = 29.4, this policy has the same value of g when OC = 1 as does policy A = 3. But 
for a < I, 

^6 Rj _ ^3 > 

1 - a® ■" 2 

So for 0 ’^ < 1 the present value from A = 6 always exceeds the present value for 

A = 3. Thus, although A = 3 is optimal according to the test in (9), you may really 
prefer A = 6. 
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*11.8 A LINEAR PROGRAMMING EQUIVALENT 

In Sec. 10.7* you studied the connection between dynamic recursive relations 
and linear programming. In brief, the dynamic programming values corresponded 
to the lengths of the shortest routes to the terminal from the nodes of an acyclic 
network. In unbounded horizon models, the network analogy remains valid, 
except that an optimal stationary policy implies traversing an associated cycle 
an infinite number of times. The network interpretation will be described in Chap. 
12 , but as a preview, the primal and dual linear programming problems for the 
simple infinite stage regeneration model are developed here. 

The functional equation 

(1) f = minimum [cL^f + for 0 < o: < 1, 
can be expressed as 

(2) maximize f 
subject to 

(3) f<cC^f+R, forA: = i,2,...,A^ 

where f is unconstrained in sign. Rearranging terms in (3) gives the constraints 

(4) for A = 1,2,..., AT. 

It is convenient to transform the problem into one involving equivalent average 
return. To do so, as usual, let ^ = (1 — oi)/. Equivalent to (2) and (4) is the 


optimization 



(5) 

maximize g 

(dual objective) 

subject to 



(6) --foe*-* 

for A: = 1, 2,. . 

. ., iV (dual constraints). 


where g is unconstrained in sign. 

If you call (5) and (6) the dual problem, verify that the primal problem is 

(7) minimize g ^ (primal objective) 

subject to 

(8) = i, (primal constraint), 

where Xj^ ^ 0. Note that when o' = 1, the objective function is 

(9) minimize Xf^, 

Since the primal model has only one constraint (8), an optimal basic solution 
consists of a single x*- = 1, corresponding to a Dest regeneration policy, K, and 
all other x* = 0. 
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11.9 SHORTEST-ROUTE PROBLEM REVISITED 

In the next chapter you will study much more general dynamic models than the 
regeneration problem in the preceding sections. And to find policies that are 
optimal over an unbounded horizon, you will employ the techniques of successive 
approximation discussed in Secs. 11.6 and 11.7, so this material has served to 
prepare you for what is ahead. As further preparation, it will be helpful for you to 
consider once again the problem of finding a shortest route in a general network. 
Here we will generalize the statement of this problem so as to encompass dis¬ 
counting, and take a second look at algorithmic techniques, now emphasizing the 
ideas of successive approximation that you have learned. 

As usual, the network structure consists of p nodes and a collection of arcs. 
As before, let (z,j) denote the arc from Node i to Node j, and Cij denote the asso¬ 
ciated expense of traversing (f, j). You can let the amount c^j be a present value that 
depends on the discount factor o', but such dependence will not be shown explic¬ 
itly in the notation. The terminal is denoted by Node r. 

For an example, consider the network in Fig. 11.5, which’is the same as that in 



Figure 11.5. Shortest-Route Example. 


Fig. 7.16, p. 233. The terminal is Node 1. Assume the following time orientation 
of a path between the terminal and Node k: let Node k be the current state and 
Node 1 the state at the end of the final stage. To illustrate, a path along the arcs 
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(7, 2), (2, 3), (3, 4), (4, i) represents the current decision to go from Node 7 to 
Node 2, the subsequent decision to go from Node 2 to Node 3, etc. 

Suppose that it takes only one period to traverse each arc. (A more general 
case where it can take more than one period to traverse certain arcs is discussed 
in the advanced material below.) Let the one-period discount factor be oCy where 
0 ^ O' ^ 1. We generalize the definition ofj^t, which in Sec. 7.6 denoted the length 
of a shortest route from Node i to the Terminal Node r. Here we let 

(1) yt = present value of an optimal path from Node i 

to the Terminal Node r, 

where optimal means a path having minimum present value. Thus, if a best route 
from Node i to Node r starts by going to Node j, then 

(2) yi = cLyi + Ciiy 

and 

(3) yi < ayt, + Cij, for all k, 

[Give a verbal interpretation of (2) and (3).] Since (2) and (3) must hold for every 
Node i ^ Terminal Node r, the must satisfy a set of extremal equations, 

( 4 ) y^ = minimum {cLy^ + CiD for all z 9 ^ r and yr = 0. 

(I.i) In . 
network 

In other words, for each Node i, the minimization is taken over every arc (z, j) 
that exists in the network. [When a = 1, (4), of course, represents the same rela¬ 
tions as in Sec. 7.6.] 

In one way this is a conceptually more difficult extremal-equation problem 
than that for the regeneration model in the preceding sections. In the regenera¬ 
tion model there was only a single unknown f\ here there are p — \ unknowns 
yi{i ^ t). But in another way, the shortest-route model is simpler because there is 
a terminal point, and hence = 0. That is, although the actual time span to go 
from any Node k to Node r varies with the particular route taken, this span is 
always finite. You will see next how to use the methods of successive approxima¬ 
tion to calculate all the 

Value iteration. The same approaches you studied in the analysis of 
the regeneration model can be applied to successively approximate the The 
function space or value iteration algorithm is 

(5) yi’^^ = minimum {otyj + Cij) for all zr and y?*^ = 0 (value iteration). 

(i,J) In 
network 

Terminate the calculations at iteration n -f 1 if for every Node i. 

Alternatively, stop the iterations when for every Node i 

(6) y? ^ CLyi + Cii for all (z, j) in network, 

and equality holds for at least one arc (z,j). A convenient initial guess for thcj^i is 

(7) = 0 for every Node i. 



• 11,9 . 


Shortest-Route Problem 389 


If all the Cij > 0, then by starting with (7) the^ increase monotonically. Using 
(7), you can interpret the amount j/? as the minimal present value of any path 
from Node i that contains exactly n arcs, unless the terminal is on the path, in which 
case the path terminates at Node r. (Some arcs on a path may be traversed more 
than once.) 

Does the value iteration algorithm (5) and (7) always work? If o' = 1 and the 
cost around every loop of the network is positive, then there exists an n* such 
that for all n > «*, and finite termination does occur. The reason is that 

as n increases, eventually every path of n arcs reaches the terminal. If QJ < 1, 
however, then also assume that for each Node k the pr^ent value of traversing 
every loop that starts and ends at Node k is larger than the present value of the 
best loopless path from Node k to the terminal. With this assumption, the con¬ 
vergence is finite. 

Example* The value iteration algorithm (5) and (7) is illustrated in Fig. 
11.6 for the example shown in Fig. 11.5, where o' = 1. To test your understanding. 
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Figure 11.6. Approximation from Below in Function Space 
(Value Iteration). 

be sure to perform the detailed calculations on Fig. 11.5 (copy the figure on a 
piece of paper or write lightly with pencil on the page); verify the results for each 
n. Notice that at iteration 9 all =>f, so the computations stop. 

Observe 

(8) + ctj and + e,* for n = 0, 1,..., 5, 

Thus for n 6, the best path of n arcs from Node 2 involves a loop with Node 7. 
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Similarly, the best path of n arcs from Node 7 involves a loop with Node 2. Note 
also that the set of arcs at iteration iV = 7 is optimal, but y\ is not the correct 
value for j8> so the procedure does not terminate at this iteration. 

You can'also employ the function space approximation (5) to converge mono- 
tonically on the^j from above. For your initial guess, select an arc for each node so 
that every node is connected by a path to the terminal. Letj;J be the corresponding 
present value of the path from Node i to the terminal. As before, there exists an 
n* such that =yT for all n'>n* so that finite convergence occurs. This 

approach is most closely related to the shortest-route 
algorithm in Sec. 7.6. There, no special attention was 
given to the specific order in which the were 
approximated. Here, a literal interpretation of (5) 
requires you to calculate a new approximation 
for every node before returning to any previous node. 
Hence (5) may not converge as quickly as other 
possible techniques. 

The procedure (5) converging from above is dem¬ 
onstrated by the illustration in Fig. 11.5, with o' = 1. 
Let the starting arcs be (8, 7), (7, 3), (6, 4), (5, 4), 
(4, 1), (3,4), (2,3). Verify that the corresponding 
values for^J are those given for « = 0 in Fig. 11.7. 
Carry out the approximation process (5) on Fig. 11.5 
and verify the other values of appearing in Fig. 
11.7. Notice that the set of arcs for n = 1 is optimal, 
but the iterations do not terminate at the next itera¬ 
tion n = 2, because ^8 ^ Je- The calculations termi¬ 
nate at n = 3 for then all =yh 
As the illustration shows, the approximation in 
function space procedure may determine an optimal 
policy at iteration «, but the associated values for^ 
may not be optimal. When the function space approximation is from below, 
the yi need not correspond to any realizable path from Node i to the terminal. 

Policy iteration. Turning to the approximation in policy space approach, at 
every iteration n you must evaluate the actual yi associated with the policy being 
tested. Specifically, perform the following: 

Step L Select an arbitrary initial set ofp — I arcs such that each node is con¬ 
nected by a path to the terminal, and let n = 0. 

Step 2, Given the trial set, calculate the associated^?, where= 0. 

Step 3. Test for an improvement by calculating 
(9) minimum (ccyj + Cu) = 7? for each i 9 ^ r. 
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Figure 11.7. Approximation 
from Above in Function Space 
(Value Iteration). 
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Step 4, Terminate the iterations if T” =yi for every Node i, as the set of arcs 
is optimal. Otherwise, using an arc yielding 7* in (9), revise the sec at each 
Node k where 7” <^J. Increase n to « + 1, and return to Step 2 with the new 
trial set of arcs. 

To test your understanding, apply the algorithm to the example of Fig. 11.5. 
Start with the same set of arcs that was used in Fig. 11.7 at n =0, and consequently 
the same values for jJ. Check that at n = 1, all 7} =y\, which arc the correct 
values for Hence the iterations terminate at n = 1, as the optimal policy has 
been identified. 

In comparing the function space and policy space approaches, observe that the 
same type of calculations are indicated by the right-hand side of (5) and by the 
left-hand side of (9). But in the function space approach, the resultant value 
in (5) is then used as the new approximation to at the next iteration, whereas, 
in the policy space approach, a separate and distinct calculation for the next 
approximation to is indicated in Step 2. It is this additional calculation that 
ensures the iterations terminate as soon as the correct policy has been found. 
See exercise 58 for more detail about Step 2, 

Summary. The methods of successive approximation in both function space 
and policy space are applicable to solving the extremal equation for the problem 
of finding a best route from each Node k to the terminal. You should note two 
points of difference in the application of value iteration to the regeneration model 
and to the network model. In the regeneration model there was only one unknown, 
and therefore only one extremal relation, whereas in the network model there are 
p — I unknowns and extremal equations. In contrast, the horizon in the regenera¬ 
tion model was unbounded, and value iteration does not necessarily converge in 
a finite number of iterations, whereas finite convergence does occur for a shortest 
route in the network model. Chapter 12 will combine the network formulation 
above with the process of optimization over a limitless vista. 

► The preceding discussion assumed that only one period elapses in traversing any 
single arc. But sometimes you want to permit several periods to elapse in traversing an 
arc. For example, in the regeneration model, illustrated in. Fig. 11.4, an arc (i,J) implies 
i — j periods elapse. The solution techniques above are easily modified to permit this 
generalization. 

Let 

(i) h,, = number of periods required to go from Node i to Node j 

along arc 

Then associated with each arc (i,j) is a discount factor 
' (ii) for 0 ^ Of ^ 1. 

Instead of (4), the extremal equations for the present values are 
(iii) yi = minimum {ct^yj + for all x r and ' 

(t.j) In 
network 
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and in the same fashion, all the other formulas in this section are modified by writing 
oCij wherever oi appears. A 


REVIEW EXERCISES 

1 Discuss the sense in which the numerical illustrations in Fig. 11.1, the timber harvesting 
model for ah unbounded horizon as described in Sec. 11.3, and the infinite stage 
regeneration model as described in Sec. 11.4, assume stationarity. 

2 Section 11.1 states that in comparing two strategies that operate over an unbounded 
horizon, there is no problem in determining their relative merit if one accumulates 
more returns than the other for every horizon length. Do you agree? Why? (Are there 
any implicit assumptions in the text’s assertion ?) 

3 Verify in Fig. 11.1 that 

(a) D gives a better cumulative return than B for every even-numbered period. 

(b) B is “ahead” of A, C, and D at every Period 3 -f 6n, for n = 0, 1, 2,.... 

*(c) The general terms in the sequences for the average return per period are those 
given by (i) through (v) in the advanced material at the beginning of Sec. 11.2. 
*(d) Find a value n* such that for every Period n, where n ^ n*, the average return per 
period is within 2 ± .01 for Policy A. Do the same for Policy B. For Policy C. 
For Policy D. Similarly find a value for n* associated with ensuring that the 
average return per period is within 1^ ± .01 for Policy G. How does the value of 
n* change if the tolerance is .001 instead of .01 for Policy A? 

4 Consider a choice between Policy A in Fig. 11.1 and a policy with returns, identical 
to those in Policy A except that in Period 1 the return from the alternative policy is 
100. Suggest one or more general rules you might apply to a choice between policies 
where both alternatives give the same average return per period over an unbounded 
horizon. Would your rules always select a unique policy from among alternatives 
having the same average return per period over an unbounded horizon ? Justify your 
answer. 

5 Calculate the average return per period for Policy E in Fig. 11.1. Do you agree with 
the argument in the text that it is plausible to eliminate Policy E from further consider¬ 
ation, given that you can select Policy D? Suppose Policy D were not available for 
selection. Would you still eliminate Policy E? Explain the reasoning behind your 
answers. 

6 In each part below, assume that the specified policy is added to Fig. 11.1. State whether 
the policy dominates or is dominated by any policies now listed in the table, and 
compute the policy’s average return per period. 

(a) 3, 4,-1, 3,4,-!,.... 

(b) 6,-1,1, 6,-1,!,.... 

(c) 0, 0,8, 0, 0,0, 8,0,.... 
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(d) 0, 8, 0, 0, 0, 8, 0, 0, .... 

(c) 4, 0, 4, 0, 4, 0, .... 

(f) 0,4,0,4,0,4,.... 

(g) 2, 21,2,2,2,...,2,.... 

(h) 2 + 2 + 2 + 2 + f S 2 + 2 + , where « is a constant in the 

interval 0 < ^ < 1. {Hint: your answer should depend on the specific value for e.) 

*7 For each part in exercise 6, derive general terras in the sequence for the average return 
per period, analogous to those given by (i) through (v) in the advanced material at the 
beginning of Sec. 11.2. 

8 Consider the finite stream of returns /?i = 4, /?2 = ^2, = 24, R^ = 40; suppose 

that the discount factor o' = J, and assume that you can borrow or lend as much 
money as you desire at the associated compound rate of interest. 

(a) What is the present value of this stream? What is the associated compound rate 
of interest? 

(b) If you wanted to redistribute the returns so as to have all the returns in Period 1 
and none in other periods, how large an amount could you obtain? (Indicate how 
much you borrow or lend in each period, when these amounts must be repaid, 
and the associated amount of interest.) 

(c) Answer the question in part (b), but assume you want to have all the returns in 
Period 2. In Period 3. In Period 4. 

(d) Explain in detail why the stream Si = 2, •$? = 16, Si = 20, Si = 56 is better 
than the stream Rj, 

(e) Explain in detail why the stream Ti = 4, Ti = 8, Tj = 24, Ti = 48 is worse. 

*9 Consider the return streams in exercise 8. At what value of oc is the stream containing 
Rj as good as the stream containing Sj? At what value of ct is the stream containing Rj 
as good as the stream containing Tj? At what value of ct are the streams containing 
and Tj equally good? 

10 Derive the present-value formula (5) in Sec. 11.2, and perform all the intermediate 
algebraic steps to obtain the results shown in (6) through (12). 

11 Determine expressions [analogous to those shown in (6) through (12) of Sec. 11.2] 
for the present value of the streams in exercise 6. 

12 (a) Perform the intermediate algebraic steps to obtain the results shown in (13) 

through (17) of Sec, 11.2. 

(b) What do you conclude about the relative merit of Policy A versus Policies C and 
D ? Verify that Policy B is more advantageous than Policy D when ct < i, and is 
less advantageous when ot > 

13 In each part below, determine expressions analogous to those shown in (13) through 
(17) of Sec. 11.2 for the indicated streams in Fig. 11.1 and the indicated part in 
exercise 6. State a range of values for ot over which each of the policies is preferable. 

(a) Policy B and Policy E. 

(b) Policy B and Policy C. Exercise continued on p, 394. ;. 
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(c) Policy C and Policy D. 

(d) Policy A and the policy in part (a). 

(e) Policy B and the policy in part (a). 

(f) Policy C and the policy in part (a). 

(g) Policy G and the policy in part (b). 

(h) Policy A and the policy in part (c). 

(i) Policy D and the policy in part (h). 

*14 Verify the results (i) through (iv) in the advanced material following (16) in Sec. 11.2. 

15 (a) Perform the intermediate algebraic steps to obtain the results for Policy B* shown 

in (18) and (19) of Sec. 11.2. 

(b) Derive the corresponding result for Policy A*, and verify that this present value 
is finite for 0 ^ a < 1. 

♦(c) Determine an expression analogous to those shown in (13) through (17) of See. 
11.2 for Policies A* and B*. State a range of values for oi over which each of these 
policies is preferable to the other. 

♦(d) Derive the present value for Policy G^, defined as 

1,6,-1, 1,6,-1, 2, 12,-2,2, 12,-2, 3, 18,-3, 3, 18,-3,.... 

Answer the question in part (c) but for Policies B* and G*. 

♦(e) Derive the present value for Policy D*, defined as 

2|, 2,2,2,2,2, 4,4,4,4,4, 4, 12, 12,12, 12, 12, 12,.... 

Answer the question in part (c) but for Policies B* and D*. 

16 Verify the results in (22) of Sec. 11.2. Show why equivalent average return is not well 
defined for OJ = 1 for Policy B^. 

17 Determine the equivalent average return for each of the streams in exercise 6. Galculate 
' the value for o' = 1. 

18 The text states in Sec. 11.2 that although the criterion of equivalent average return 
ranks Policies A, B, G, and D in Fig. 11.1 as being equally desirable for of = 1, a 
resolution must rest on the decision-maker’s personal opinion and not on scientific 
fact. The text then goes on to argue that a case can be made for saying that, for of = 1, 
Policy A is optimal. Policies G and D are nearly optimal, and Policy B should be 
eliminated. Discuss the issues involved and whether you agree with the argument 
offered. Give reasons for your position. 

19 (a) In Sec. 11.3, the text assumes that the sequence of satisfies the inequalities 

in (2). Discuss what complications arise if this simplifying assumption is dropped. 

(b) Show that the inequalities in (2) imply that an optimal value for K can be 
determined from (3). When there are alternative optimal policies, does (3) always 
lead to a unique value for K? Justify your answer. 

(c) Interpret (2) and the stopping criterion (3) for oi = 1. 

20 (a) Draw a graph for (4) in Sec. 11.3, where a = .75 and h = .5. 

(b) Perform the intermediate algebraic steps to verify that when Ni^ is given by (4), 
the stopping criterion (3) simplifies to (5). 
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*(c) Construct an example of a sequence for iV* such that, given a value for the 
discount factor o', where 0 < QJ < 1, there is no^mte*valuc for K that is optimal. 

(d) Given the numerical example (6) in Sec. 11.3, verify that X = 4 satisfies (5). 

(e) Verify the present values in (7) for /: = 1 and 2. 

21 Consider the timber harvesting model in Sec. 11.3. Suppose 

Nil a kb where a -f- ^ ^ 0 and ^ > 0. 

(a) Determine a stopping criterion analogous to that implied by (5). 

(b) Let q: = §, a = 3, and 6 = 9. What is an optimal value for K1 Calculate the 

present value for A: = 1,2, 3, 4. 

22 Consider the timber harvesting model in Sec. 11.3. Suppose 

iVjt = a 4- k!^b where a ^ ^ 0 and 6 > 0. 

(a) Determine a stopping criterion analogous to that implied by (5). 

(b) Let q: = I, a = 3, and 6 = 9. \Vhat is an optimal value for K? 

(c) Calculate the present value for k= 1, 2,..., iC Ij where K is the 

optimal value you found in part (b). 

23 (a) In Sec. 11.3, the text assumes that the sequence of P.V. [/:]/(1 — CL^) satisfies the 

inequalities in (14). Discuss what complications arise if this simplifying assump¬ 
tion is dropped. 

(b) Show how the inequalities in (14) imply that an optimal value for k' can be 
determined from (15). If there are alternative optimal policies, does (15) always 
lead to a unique value for A:'? Justify your answer. 

(c) Explain why the ratio on the right-hand side of (15) is larger than NidN^^i for 
each k, 

(d) Verify that when o' = 1 in (15), the inequalities can be written as (17). 

24 Consider the numerical example (6) in See. 11.3. 

(a) Use (15) to verify that k' = 2 is an optimal policy for an unbounded horizon. 

(b) Verify the values ofF(l) and F(2) in (18). 

25 Consider the data in exercise 21. Find an optimal policy, k% for an unbounded horizon. 

Calculate F{k' — 1), F{k'), and F{k' +1). W^at is the loss in present value from using 

the optimal single decision policy, K, instead of A:'? 

26 Consider the data in exercise 22. Find an optimal policy, k\ for an tmbounded horizon. 

Calculate F{k' — 1), F(A;'), and F{k' 1). What is the loss in present value from using 

the optimal single decision policy, K, instead of k' ? 

27 Consider the inhnite stage regeneration model in Sec. 11.4. 

(a) Assume all > 0. Explain why there is no finite value for/that satisfies (5) for 

ct = 1. 

(b) Assume all = 0. Explain why any finite value for/satisfies (5) for OJ = 1. 

(c) Explain why (5) implies the inequalities in (6), and why they in turn imply (7). 

(d) Show the intermediate algebraic steps to obtain the equality on the right-hand 
side of (8). Exercise 27 continued on p. 396. 
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(c) Show the intermediate algebraic steps to derive (9), and verify that (9) becomes 
(10) fora: = 1. 

(f) Show how to alter (6), (7), (9), and (10) if the optimization indicates “maximum” 
in (5). 

♦28 Consider the inventory model with concave costs, discussed in the special material at 
the end of Sec. 11.4. 

(a) Show that k in (vi) minimizes the cost expression in (v). 

(b) Derive the formula for /?jfe(oc), when there is a discount factor o:, where 0 < OC < 1, 
and each period’s costs are discounted. How will the optimal lot-size quantity 
depend on the value of a: ? How, if at all, will this quantity depend on the value 
of the purchase cost c? Explain in detail. 

29 Consider the Regeneration Model Horizon Theorem in Sec. 11.5. Assuming that 
K = 5 and n* = 8, give a verbal interpretation of 

(a) (3) for ^20 = 4. For kzo = 4. For kzo = 6. 

(b) (4) for n = 10. For n = 20. 

(c) If Policy 12 is optimal for all n > 22, does k^Q = 12? Is Policy 12 optimal over an 
unbounded horizon? [Justify your answers with reference to (3) and (4).] 

♦ (d) If Policy 9 is optimal over an unbounded horizon and k^ = 9, is Policy 9 optimal 
for all n > 3 ? 

30 In each part below, apply the method of “finite horizon in the limit,” and find the 
smallest n such that the policy that is optimal for an unbounded horizon is also optimal 
for finite horizons of length n, n + 1, and n + 2. (Remember, the objective is to 
“maximize.”) 

(aj Timber harvesting illustration (6) and (18) in Sec. 11.3, where /?* = P.V, [A;]. 

(b) Problem in exercises 21 and 25. 

(c) Problem in exercises 22 and 26. 

31 Consider the numerical example (2) used to illustrate the value iteration algorithm 
(1) in Sec. 11.6, 

(a) Check the individual terms in (4), and calculate the values inside the brackets 
to verify that k = \ yields a minimum. 

(b) Calculate/*®, given the value for/*® in (5), and verify that k = 3, 

(c) Let/® = 20 and perform the value iteration algorithm. Stop the iterations at n = 3, 

(d) Let /® = 35 and perform the value iteration algorithm. Stop the iterations at 
n = 3. 

♦(c) Let Ri = 7. Perform the value iteration algorithm with /® = 0. At what value of 
n does A; = 3 for the first time? What is the corresponding value/"? How docs it 
compare with in (5) ? 

♦(f) Let Ri = 10. Perform the value iteration algorithm with /® = 0, At what value 
of n docs A: = 3 for the first time? What is the corresponding value/”? How docs 
it compare with/^ in (5) ? 

♦(g) Let Ri = 17.9. Perform the value iteration algorithm with/® = 43.50. At what 
value of n docs A: = 3 for the first time? What is the corresponding value/"? 
How docs it compare with/* in (7) ? 
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*(h) Suppose at every iteration n > the same optimal infinite horizon policy is 
selected. Derive a formula forgiven the value of/”'. 

32 Graphical Representation of Value Iteration, For each k, graph the line y = cL^f -f- /?*, 
where ^ is the ordinate value and f the abscissa value. For each value of/ one (or 
more) of these lines is “lowest.” Mark in red pencil (for example) the piecewise linear 
function consisting of all these “lowest” segments. Next graph the line y =f The 
intersection of this line and the piecewise linear function gives the solution value for/ 
The value iteration technique starts with/°. Draw a vertical line at/°, and select a k 
corresponding to the piecewise linear function. The associated value on this function 
is the next trial/* and can be indicated by drawing a horizontal line from the point on 
the piecewise linear function to the line = /. As the process repeats itself, the trial 
points move ever closer to the intersection of the line^ = f and the piecewise linear 
function. 

In each part below, draw the line for each Ar, mark the piecewise linear function, 
draw the line = / show the solution, and display how the value iteration technique 
progresses when /° = 0 and when /° = /?,/(! — a), [Only show the iterative tech¬ 
nique graphically and do not perform any of the underlying computations. Remember 
in parts (b), (c), and (d), the objective is to “maximize.”] 

(a) Numerical example (2) in Sec. 11.6. 

(b) Timber harvesting illustration (6) and (8) in Sec. 11.3. 

(c) Problem in exercises 21 and 25. 

(d) Problem in exercises 22 and 26. 

33 In each part below, perform the value iteration technique in Sec. 11.7. Letting n' 
denote the first iteration at which the corresponding k is optimal for the unbounded 
horizon, terminate the calculations at n' + 2. First use /° = 0 as a starting value. 
Then use/° = /?j/(l — a), (Remember the objective is to “maximize.”) 

(a) Timber harvesting illustration (6) and (8) in Sec. 11.3, where Rj^ = P.V. [A]. 

(b) Problem in exercises 21 and 25. 

(c) Problem in exercises 22 and 26. 

(d) In each part, describe a strategy that actually yields the value/”'”*’*. Is this policy 
stationary? {Note: such a strategy may not exist.) 

♦(e) In each part, verify that |/” — /| ^ o'” |/° —/| for n = 1, 2,..., n' -f 2. 

34 In each part below, perform the policy iteration algorithm of Sec. 11.7. First use A: = 1 
as the initial policy at n = 0. Then use JC + 1 as the initial policy, where iC is an 
optimal policy for an unbounded horizon. 

(a) Timber harvesting illustration (6) and (8) in Sec. 11.3, where Rt = P.V. [A:]. 

(b) Problem in exercises 21 and 25. 

(c) Problem in exercises 22 and 26. 

35 Consider employing the criterion of average return per period in the policy iteration 
algorithm of Sec. 11.7. [Parts ^c) and (d) on p, 398.^ 

(a) Demonstrate that (5) corresponds to (1) when expressed in terms of equivalent 
average return. 

(b) Similarly, demonstrate that (6) corresponds to the test quantity (2). 
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(c) Let ^(A:) be the expression in the brackets of (6). Let = (1 — OJ)*"*, b = 
—^”(1 — a)“*. Form the expression ag{k) 4- b and show the algebraic manipula¬ 
tions yielding the simplified quantity on the left-hand side of (7). 

(d) Give a verbal interpretation of the test criterion in (7) for OJ = 1. 

36 Use the policy iteration algorithm of Sec. 11.7 to find an optimal solution to the timber 
harvesting illustration (6) in Sec. 11.3, for o: = 1. Let A; = 1 be the initial policy. 
(Remember the objective is to “maximize.”) 

37 Consider the shortest-route problem in See. 11.9 and give a verbal interpretation of (2) 
and (3). 

38 Apply the value iteration algorithm in Sec. 11.9 to the example shown in Fig. 11.5 
with ct = 1 and = 0. 

(a) Show all the detailed computations, and verify the results in Fig. 11.6. 

(b) Show the best path of n arcs from Node 2 to the terminal for n ^ 6. Do the same 
from Node 7. 

39 Apply the value iteration algorithm in Sec. 11.9 to the example shown in Fig. 11.5 
with a = 1 and y\ corresponding to the arcs (8, 7), (7, 3), (6, 4), (5, 4), (4, 1), (3, 4), 
and (2, 3). 

(a) Verify the values for^? are those shown in Fig. 11.7. 

(b) Show all the detailed computations, and verify the results given in Fig. 11.7. 
[Carry out the successive approximation process (5) on Fig. 11.5.] 

40 Apply the value iteration algorithm in Sec. 11.9 to the example shown in Fig. 11.5 
with QJ = 1 and^J = 10 for every i {i ^ r). 

41 Apply the value iteration algorithm in Sec. 11.9 to the example shown in Fig. 11.5 
with Of = 1, a,ndyi corresponding to the arcs (8, 5), (7, 3), (6, 4), (5, 1), (4, 1), (3, 1), 
and (2, 1). 

*42 In each part below, apply the value iteration algorithm in Sec. 11.9 to the example 
shown in Fig. 11.5 with oi = .8. Stop the iterations at n = 9. Let 

(a) j^ = 0. 

(b) be calculated from the policy in exercise 39. 

(c) be calculated from the policy in exercise 41. 

43 Apply the policy iteration algorithm in Sec. 11.9 using the data in exercise 4l. 

*44 Apply the policy iteration algorithm in Sec. 11.9 using the data in exercise 42, 

(a) Part (b). . 

(b) Part (c). 

45 Explain your understanding of the following terms; 

unbounded horizon (limitless vista) regeneration model 

stationarity regeneration period (or point) 
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stationary policy (or strategy) 
infinite stream of returns 
utility of money 
average return per period 
interest rate and compound interest 
discount factor 

present discounted value (or present 
worth) 

equivalent average return 
nearly optimal 


functional (or extremal) equation 
successive approximation 
value iteration (successive approximation 
in function space) 

policy iteration (successive approximation 
in policy space) 

monotonically increasing (decreasing) 
sequence 

convergence from above (below). 


COMPUTATIONAL EXERCISES 

46 Equipment Replacement Problem, Consider the case of the Goode-O’Toole Company, 
described in exercise 41 of Chap. 6. Suppose a replacement policy is desired for an 
unbounded horizon. Assume the cost of a new model in any period is j&, the number of 
policies iV = 7, and the salvage values and operating costs arc as given in exercise 41 
of Chap. 6. In parts (a) and (b) below, find an optimal replacement policy for o' = 1. 

(a) p = 100. 

(b) /I = 90. 

(c) Find the largest and smallest value of p such that A: = 3 is optimal for cc = 1. 

(d) Suppose OC < 1 and that purchase and operating costs occur at the “beginning” 

of a period and salvage value is obtained at the “end” of a period. Hence, if a 
drill press is purchased at the beginning of the current period and kept for three 
periods, the resultant present value over these three periods is ^ + ccr 2 -f 

+ (X\, For parts (a) and (b) above, determine the smallest value of a such 
that the policy for oc = 1 remains optimal, and calculate the corresponding 
interest rate. 

(e) Docs increasing both the purchase cost top -{• d and each salvage value tovt + d 
yield the same optimal policy? Explain and be sure to make explicit all your 
assumptions. 

47 A commonly employed method of numerically obtaining the square root of a positive 

number A is the following technique of successive approximations (suggested by 
Newton): /" = .5 + (-4//”“')]. Give a graphical description of this technique. 

Indicate how the sequence of/” approaches the correct value/= starting with 
any initial > 0. Also show that/" converges to /from above (for n ^ 1). 

48 Consider the infinite stage regeneration model in Sec. 11.4 with N = 5 and Ri = 5, 

= 8, jRs = 10, Ri = 12, and R^ = 18. 

(a) Determine an optimal policy for eveiy value of of in the interval 0 ^ of ^ 1. 

(b) Let of = ,9. Apply the “finite horizon in the limit” approach in Sec. 11.5. Termi¬ 
nate the iterations after you select the same policy for five iterations. 

(c) Let Of = .9. Apply the value iteradon algorithm in Sec.. 11.6 with'/® = 0. Do the 
same with/® = RJil — of). Letting n' denote the fint iteration at which the 
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corresponding k is optimal for the unbounded horizon, terminate the calculations 
at n' -f 2. 

♦ (d) Using the graphical representation described in exercise 32, plot the iterations in 
part (c). 

(e) Let CL = .9. Apply the policy iteration algorithm in Sec. 11.7 with A; = 1 as the 
initial guess. 

(f) Let Of = 1. Apply the policy iteration algorithm in Sec. 11.7 with A: = 1 as the 
initial guess. 


49 For each exercise in Chap. 7 listed below, apply the value iteration algorithm for a 
shortest-route problem as given in Sec. 11.9, with cL = 1 and y\ = 0. 


(a) Exercise 27, part (a). 

(b) Exercise 28, part (a). 

(c) Exercise 60, part (a). 

(d) Exercise 60, part (b). 

(e) Exercise 61, part (a). 

(f) Exercise 61, part (b). 


(g) Exercise 62, part (a). 

(h) Exercise 62, part (b). 

(i) Exercise 63, part (a). 

(j) Exercise 63, part (b). 

(k) Exercise 65, part (a). 


50 For each exercise in Chap. 7 listed below, first apply the value iteration algorithm for 
a shortest-route problem, as given in Sec. 11.9, with a = 1 and>^ calculated from the 
indicated initial set of arcs. Then apply the policy iteration algorithm, as given in Sec. 
11.9, starting with the same initial set of arcs. 

(a) Exercise 27, part (a), with arcs (2, 1), (3, 1), (4, 1), (5, 1), (6,4), (7,6), and 
(8, 5). 

(b) Exercise 28, part (a), with arcs (1,4), (3,2), (4,3), (5, 1), (6,5), (7,3), and 

( 8 , 6 ). 

(c) Exercise 60, part (a), with arcs (2, 1), (3, 2), (4, 3), (5, 4), and (6, 5). 

(d) Exercise 60, part (b), with arcs (1, 2), (3, 2), (4, 3), (5, 4), and (6, 5). 

(e) Exercise 61, part (a), with arcs (2, 1), (3, 2), (4, 3), (5, 4), and (6, 5). 

(f) Exercise 61, part (b), with arcs (1, 2), (3, 2), (4, 3), (5, 4), and (6, 5). 

(g) Exercise 62, part (a), with arcs (2, 1), (3, 2), (4, 1), (5, 4), and (6, 5). 

(h) Exercise 62, part (b), with arcs (1, 2), (2, 3), (4, 1), (5, 4), and (6, 5). 

(i) Exercise 63, part (a), with arcs (2, 1), (3, 2), (4, 3), (5, 4), and (6, 5). 

(j) Exercise 63, part (b), with arcs (1,2), (3, 2), (4, 3), (5, 4), and (6, 5). 

(k) Exercise 65, part (a), with arcs (2, 1), (3, 2), (4, 5), (5, 6), and (6, 3). 


MIND-EXPANDING EXERCISES 

Exercises 51 through 56 refer to the regeneration model in Sec. 11.4. 

51 (a) Show that J determined by (5) is the maximal discounted return among all (not 
necessarily stationary) policies. 

(b) Show that a stationary policy is optimal among all policies. {Hint: you may wish 
to prove and use the result in exercise 55.) 
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52 Prove the Regeneration Model Horizon Theorem in Sec. 11.5. 

*53 Knapsack Problem, Show the connection between the Regeneration Model Horizon 
Theorem in Sec. 11.5 and the proposition about the Knapsack Problem described in 
exercise 47 of Chap. 10. 

54 Consider the value iteration algorithm in Sec. 11.6. 

(a) Show that if all 72* > 0, then >/" at every iteration n if/® = 0. 

(b) Show that if/° = Rkl(\ — o'*) for a selected k, then ^/” at every iteration n, 

(c) Show that if</”, then /” < (Do the same for both inequalities in 
the reverse direction.) 

(d) Show that if Policy k is selected at iteration n — 1 but is discarded at iteration n, 
Policy k it will never be selected again. 

55 Consider the policy iteration algorithm in Sec. 11.7. Suppose you have any strategy 
(not necessarily a stationary one) and an associated trial value/”. Show that if Policy k' 
satisfies the test 0^*/” + R^f < /”, then using Policy k' as a stationary strategy over an 
unbounded horizon yields an associated value /”■*■* </”• Give a verbal interpretation 
of this computational test. 

56 Graphical Representation of Policy Iteration, Devise a graphical approach, analogous to 
that described in exercise 32, to display the computational process of policy iteration. 
Apply your approach to the numerical example (3) in Sec. 11.7. Would any other 
starting policy {k ^ 1) have required more iterations? 

57 Consider the shortest-route problem as characterized by the extremal equations (4) in 
Sec. 11.9. 

(a) Suggest a direct algorithm (that is, a method that computes the correct value for 
each^t once and for all) when’ the network is acyclic. {Hint: modify the approach 
in Sec. 7.7.) 

(b) Apply your method to Mark Off’s routing problem in Fig. 8.1, letting OJ = ^ 
(for simplicity). 

58 Consider the policy iteration algorithm for solving the shortest-route problem in 
Sec. 11.9. 

(a) Show that when o' = 1, at every iteration the trial set of arcs in Step 2 has the 
property that each node is connected by a path to the terminal. 

(b) Construct an example that shows the property in part (a) need not hold when 
cc C, \, Suggest a way to calculate^" when the trial set of arcs contains loops; 
give a verbal interpretation to the values for y^ that are determined by your 
method. (Be certain your approach ensures that the algorithm eventually finds 
shortest routes.) 
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CHAPTER 


Optimization Methods for 
An Unbounded Horizon 


12.1 DISCRETE DYNAMIC PROGRAMMING 

You are now ready to integrate the various ideas you have learned about optimi¬ 
zation in an unbounded horizon and solution by successive approximations. 
You should study this chapter with two purposes in mind: 

(i) To discover how to characterize mathematically an optimal solution to a 
model with an unbounded horizon. 

(ii) To see how such a solution can be obtained by techniques of successive 
approximation. 

It is less important that you master the details of the numerical techniques. 
Furthermore, in real applications you would not manually calculate answers but 
would rely on electronic computers. To make effective managerial use of these 
answers, however, you must be aware of the models’ underpinnings so as to realize 
the true applicability of any particular munerical solution. 

Inventory model. We begin with a concrete example that will illustrate 
the appearance of a dynamic programming optimization recursion appropriate 
to an unbounded horizon. Specifically, we consider a stationary inventory model 
that closely resembles the one you previously studied in Sec. 8.3. We use notation 
here to reflect the assmnption of stationarity. The decision variable is 

X = production quantity, 

which we require to be a nonnegative integer. Let the associated production cost 
function in each period be C(x), ^d the holding cost function be A(j), wherej 
is the inventory level at the end of the period. Assume that the demand require- 


403 



404 Optimization Methods for an Unbounded Horizon 


• 12.1 • 


mcnt is the same every period, namely d (a positive integer), and that all demand 
must be met on time. As usual, let a be the one-period discount factor, where 
0<q: < 1. 

Define 

fn{i) = present value of an optimum production policy when 
entering inventory is i and n periods remain. 

Then the appropriate dynamic programming recursion for a finite horizon model is 

(1) /n(0 = minimum [C(:v) + h{i + x — d) + + x — d)], 

X 

where the minimization is taken over nonnegative integer values x > d — i. 
A plausible infinite stage analogue of (1) is 

(2) f{i) = minimum [C(a:) + h{i + x — d) + (Xf{i + x — d)'\ (0 < a: < 1). 

X 

Here/(f) is interpreted as the present value of an optimal policy over an unbounded 
horizon, given the inventory entering the current period is i. 

Notice several points of difference between (1) and (2). The most important is 
that the same function f{i) appears on both sides of the relation (2). For this reason 
(2) is called a functional or extremal equation—actually, a set of 
equations, one for each possible value of the state variable, entering inventory i. 
Hence (2) deals with an entire function f{i) of the state variable. As you should 
recall from the discussion of the simple functional equation for the regeneration 
model in Sec. 11.4, you must determine whether the equations in (2) have a unique 
finite solution. 

You could find out by analyzing this specific model. But you can make further 
' headway by analyzing the extremal equations of a more general model for which 
the inventory example is merely a special case (after imposing a few additional 
mild restrictions). In preparation for this general model, observe how (2) can be 
characterized in terms of a network optimization problem. 

Assume the number of possible levels for entering inventory is finite, and let 
each node of the network correspond to one of these levels. Given a Node i, 
the decision to produce can be indicated by drawing an arc from Node i to 
Node i + X — d. The associated arc cost is C(a:) + h{i + x — <f). Substitute the 
symbol for f{i) in (2), and sojff is a present value associated with Node i. The 
network now closely resembles that in Sec. 11.9, except here there is no terminal, 
so the entire path implied by the value of^j is really endless. 

Network characterization. Consider a network comprised of a set 
of p nodes and a set of directed arcs connecting some of the nodes. Assume that 
every node has at least one arc leading out. In other words, assume it is possible 
to go from any node to at least one other node. Each arc (i,j) has an associated 
return or cost Cij and a traversal time of one period. The discount factor is desig¬ 
nated a as before. 
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Imagine starting a path at any arbitrarily selected Node i. Suppose you travel 
from Node i to Node j and thereby incur a cost Cij, When you get to Node j, 
suppose you then go on to Node k, which adds to the discounted cost If you 
continue the process indefinitely, the path never ends. More and more costs are 
added, but with higher and higher powers of the discount factor o' < 1. Let yt 
denote the present value of an optimal unending path starting at Node i. 

If you adopt a stationary policy, then every time you come back to a node, you 
take the same arc out that you did on the previous visit. If a stationary policy 
exists that is optimal, then the corresponding^! satisfy the functional equations 

(3) y^ = minimum {ctyj + Cu) for each Node i (0 ^ o' < 1). 

(t.J) In 
network 

(The problem of finding a path that minimizes average return per period is treated 
in Sec. 12.2.) Notice that no node is distinguished as a terminal. In this model, 
it is convenient to allow arcs (i, z), because an optimal path in an unbounded 
horizon may lead from Node i back to itself repeatedly. (For example, the network 
corresponding to the infinite stage regeneration model of Sec. 11.4 has only a single 
node and N arcs leading out of and back into this node.) 

In stating the extremal equations (3), we assumed the existence of an optimal 
stationary policy and the corresponding j>!. To be able to use (3) for all actual 
applications, however, you must know the answers to the questions: 

(i) Can the extremal equations (3) always be solved by unique finite values 
for every ji? 

(ii) If so, is a corresponding stationary strategy truly optimal ? 

In other words, before applying numerical techniques, you must ascertain whether 
the functional equations in (3) invariably have a solution. When they do, an arc 
{hj) giving the minimum for Node i is a component of a stationary policy 
or strategy: traverse (z,j) whenever you are at Node i. Since there is only a 
finite number of nodes, eventually you will be repeatedly coursing a loop of arcs. 
The particular cycle or circulation can depend on where you are at the 
initial period. The second question, then, is whether it is impossible to do better 
by another stationary strategy or by a nonstationary strategy—that is, one that 
keeps track of the previous events that lead you to Node i. 

Both (i) and (ii) have affirmative answers. It can be rigorously demonstrated 
that: 

Stationary Policy Theorem. There always exist unique finite^! that satisfy 
the extremal equations (3), and a corresponding stationary policy is optimal 
among all possible policies. 

The existence of a stationary policy that is truly optimal is also demonstrable if 
the optimality criterion is minimum equivalent average return per period for 
O' = 1. However, (3) is not the appropriate set of functional equations; this case 
is taken up in the next section. 
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With almost trivial modifications, the model can be generalized to include the 
situation in which more than a single period is needed to traverse an arc. The 
necessary modification in the notation makes the formulas look complicated (an 
oCtj would appear instead o£cc in almost all the formulas). Rather than clutter the 
expressions with extra subscripts, we treat this more general case only in the 
advanced material. 

► At the end of See. 10.7, we gave a fairly general canonical form for a finite horizon 
model. If we now include a discount factor (X, the recursion is 

(«) fn(s) = optimum d„) + </„)]} for every s in S„ and f^is) s 0, 

d. In DnW 

where s is the state of the system, is the set of all possible states at Stage n, (/„ is the 
decision at Stage n, D„(s) is the set of all the feasible values for <4, /^(s, <4) is the imme¬ 
diate economic return of decision <4 given that the system is in State s, and <4) is 
the transformed state of the system at the next stage. 

Assume that the structure of the model is stationary, so that the terms in (i) can be 
simplified to iS*, D(s), Ji(s, </), and T(s, d) for every n. Further suppose there is an optimal 
strategy that is stationary. Then a natural infinite stage analogue of (i) is 

(ii) J(s) = optimum {/2(j, d) + clJ[T(s, </)]) for every j in 5’ (0 ^ o' < 1). 

d in X>(3) 

t 

We also postulate that the system can be in only one of a finite number of states, which 
for convenience can be numbered j = 1, 2,... ,/>, and similarly, that D{s) contains 
only a finite number of decisions. Given these two structural assumptions, we refer to 
the model (ii) as a discrete dynamic programming problem. 

There is a unique correspondence between the network extremal equations (2) and 
formulation (ii) provided that T(j, d) = T{s, d') only \id— d\ Each State s corresponds 
to a Node s, and/(j) = The decision d represents an arc from Node s to Node T(.s, d), 
and R{Syd) is the associated arc cost. 

Consider a network in which each arc (i,j) has an associated traversal time of 
periods, and all do not necessarily equal 1. Let the corresponding discount factor 
be defined as 

(iii) aij = 

Then instead of (3), the extremal equations for this more general problem are written as 

(iv) yi = minimum {oCijyj + Cij) for each Node i (0 ^ o' < 1). 4 

(<ti) In 
netwoilc 

12.2 SUCCESSIVE APPROXIMATIONS 

We discussed many of the important ideas motivating the several techniques 
of successive approximations in Sec. 11.5, where we dealt with the regeneration 
model. These comments are equally applicable here, and you may want to quickly 
review them before getting into the details below. 

In brief, we examine two approaches. In the first, we guess at the unknown 
function values and at each iteration improve on the current guess or approx¬ 
imation. In the second, we guess at an optimal policy. If our guess leads to a 
solution of the extremal equations, we stop; if not, we investigate an improved 
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trial policy. The second method as compared to the first requires more computation 
at each iteration, but always converges in a finite number of steps. 

Value iteration. A solution to the functional equations can be sought by 
successive approximations in function space. Following the fash¬ 
ion in Chap. 11, let be arbitrarily chosen guesses for and apply the so-called 
value iteration algorithm 

(1) = minimum {(Xyj -|- Ctj) for each i (value iteration). 

( i . J ) In 
network 

Then each tends to a limit, but as you already saw in the elementary regenera¬ 
tion model of Sec. 11.4, convergence of thcyi need not occur in a finite number of 
iterations. Furthermore, that a strategy is the same for several iterations of (1) 
does not indicate that it is truly optimal for an unbounded horizon. In fact, even as 
the number of iterations n gets arbitrarily large, there need not be convergence 
in policy, that is, there need not be a single policy that is optimal for all finite 
n'^ Ny no matter how large the value for N, 

By starting with every = 0, the process is the same as looking at a certain 
finite horizon model. Specifically, y? represents the present value of an optimal 
path starting from Node i and traversing n arcs. Analogous to the Regeneration 
Model Horizon Theorem of Sec. 11.5, there is an n* such that for each n > n*, 
any optimal strategy for the current decision in the n-period finite horizon model 
is also an optimal stationary strategy for the unbounded horizon model. Further, 
it is possible to calculate n* given the data Cij and o', so that the approach can be 
applied. 

If all Cij'>0 and all = 0, thej^J* increase monotonically. Convergence from 
above is obtained for any if you initially select a trial strategy and calculate 
yt accordingly, using the formula (2) below. Then if you arbitrarily stop at itera¬ 
tion m and use the immediate optimal policy from the value iteration (1) as a station¬ 
ary strategy in the unbounded horizon model, the corresponding present values 
will be no larger than the ^7* in (1). 

> The proof that the^J* converge lo^'t for arbitrary^ is analogous to the argument given 
in Sec. 11.6 for the infinite stage regeneration model. We present the proof for the more 
general network in which the discount factor for arc (i, j) is otij. 

For each Node i, let arc be optimal for the unbounded horizon case and arc 
ihj) he optimal at iteration n of (1). Then 

(i) yi = oiij^yj* + cty for each i 

(ii) y^ ^ oitj^yy^ + for each i, 

so that 

(i“) y? -yt ^ octAyK' ~yr) ^ 

where = maximum 

k 

Similarly, 

(iv) 

(V) 


>r = +«« 

yt^atiyj + Cij, 
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so that 

(vi) 

Therefore, 

(vii) 


->r -^r') ^ 


l>r - J-il «<4-i ^ 


Convergence follows since 0 < of < 1 . 

Suppose all Cij ^ 0 and all^ = 0. Then you can establish that the>>J* are monotoni- 
cally increasing by the following inductive argument. For n = 1 and each i, 

(viii) 


= minimum (cHij'O + cu) ^ 0 =^?, 

(i,J) In 
network 


SO Assume for n = 1, 2,..., m, and each i. Let arc (ij) be optimal 

at iteration m. Then 

(ix) = oCijyJ + Ci) ^ OCijyf-' + ct^ ^ minimum (aijyf-' + Cfi) 

( t . J ) in 
network 

and therefore^? for « = m + 1 . 

Similarly, if the^J are calculated by an initial strategy, you can prove inductively 
that convergence is from above. For n = 1 and each z, 

(x) 


j>\ = minimum (o(<j >5 + c,j) < attyl + =^ 5 , 

(UJi In 
network 


where (i, k) is the arc in the initial strategy^ and so^'J Assumefor n = 
1, 2,..., m and each i. Let arc (i,j) be optimal at iteration m. Then 

(xi) yi'*^ ^ + Ct) ^ + c,i =J/^ 

and therefore>•}* for n = m + 1 . 

The following example, based on a network representation of a choice between 
Policies B and D in Fig. 11.1, p. 363, for <X — .5, shows how thej’J* converge without a 
corresponding convergence in policy. Assume the network consists of four nodes, but 
that there is a nontrivial decision only at Node 1 : 


(I) 


Node 1: fi 2 = 2| rjj = 3 Node 2: Cjs = 2 
V Node 3: ^34 = 1 Node 4: ^43 = 3, 

and let oi = .5. Draw the network corresponding to (I). Also verify that 
(II) ^1 = 4| J ’2 = 4 J-J = = 4| 

satisfy the functional equations and that arcs ( 1 , 2 ) and ( 1 , 3 ) arc both optimal. 

Begin the function space algorithm with all = 0. Then the iterations yield, for 
n = 1, 2, 3, 


(III) 

(IV) 


= minimum [.5(0) + 21, .5(0) + 3] = 2| for arc (1,2) 
jyl = .5(0) + 2 = 2 = .5(0) d- 1 =: 1 y\ = .5(0) + 3 = 3 

yl = minimum [.5(-2) + 2|, .5(1) + 3] = 3i for arc (1, 3) 
yi = .5(2) + 2 = 3 = .5(3) + 1 = 2i ^ = .5(1) + 3 = 3^ 

yf = minimum [.5(3) + 2f, .5(2i) + 3] = 4J for arc (1, 2) 

^ = .5(3) + 2 = 3J .5(3i) + 1 = 2J ^ = .5(2]) + 3 = 4-]. 


(V) 
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As n increases, the values of approach the values for yi in (II), but arc (1, 2) always 

gives the minimum for n odd, and arc (1,3) always gives the minimum for n even. Hence 

there is never convergence in policy. ^ 

Policy iteration. You can solve the extremal equations (1) by the 
successive approximations in policy space algorithm: 

Step L Select an arbitrary initial policy, and let « = 0. 

Step 2, Given the trial policy, calculate the y? according to the value deter¬ 
mination equations / 

( 2 ) yj = ay] + c„ or y? — ay^ = Ct, 

for each Node i (value determination routine), 

where arc ( 1 , 7 ) is the decision at Node i indicated by the specific policy being 
evaluated. 

Step 3, Test for a policy improvement by calculating 

(3) minimum {cCy] + r^j) = 7” for each i (test quantity). 

(i.J) in 
network 

Step 4, Terminate the iterations if 7” =yi for all z. Otherwise, revise the policy 
at each Node k, where <yk, by using an arc yielding in (3). Increase n 
to « + 1, and return to Step 2 with the new trial policy. 

Mechanics and logic of policy iteration. Observe that the value 
determination routine ( 2 ) requires the solution of a set of simultaneous linear 
equations. Since there is one equation and one variable for each Node i and 
0 < O' < 1, (2) always has a unique solution. In applying (2) you will find it 
helpful to solve first for the^'? included in a cyde and then for the remaining 
variables in terms of the previously found quantities. The method terminates when 
7/* =yY for all z, because then theT'? satisfy the extremal equations. 

For z = /: in (3), you can interpret the quantity (o' y^ + c^j) as the present value 
of going from Node k to Node j, and afterwards-Msing the complete strategy being 
tested at iteration n—including the arc previously specified for Node k by the trial 
strategy at iteration n. Now if for some k you find 7* < yi^ using arc (^, 7 ), then 
it is plausible that you can do better yet by always using that arc whenever you are 
at Node k. The new value forjJ"^* reflects such a revised policy. 

The important conclusion then, is that you can test whether a strategy that 
consistently uses arc (A:, 7 ) gives an improvement by employing a criterion that 
assumes (A*, 7 ) is used only for the current decision! (Reread the conclusion because 
this insightful result is not at all obvious.) 

The policy space approximation has the properties: 

(i) yV^ ^y? for each Node i, andyJl*' <yS if 7** < yS, 

(ii) The algorithm terminates in a finite number of iterations. 

(iii) At termination, a strategy yielding 7” is optimal. 
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As you will see more clearly in Sec. 12.5, the policy space approximation method 
has these properties because it is closely analogous to the simplex method. At 
each iteration the current policy is feasible; however, the corresponding values of 
the variables^? might not be feasible in that they might not satisfy the extremal 
equations. The stopping rule represents a feasibility check. A proof of finite con¬ 
vergence to an optimal solution by policy iteration closely parallels a proof that 
the simplex method stops in a finite number of iterations. The crux of the argument 
is that because of (i), you never return to a policy once tested. Since there is only 
a finite number of possible policies, eventually the iterations end. 

In comparing successive approximation in function space and in policy space, 
note that the same type of calculation appears on the right-hand side of (4) and 
the left-hand side of (3). In the value iteration method, the resultant values become 
the new approximations for at the next iteration. In contrast, the new approx¬ 
imations for yi in the policy iteration method are found by the additional cal¬ 
culations involved in solving the value determination equations (2) at the next 
iteration. This added computational burden is the price paid to ensure conver¬ 
gence in a finite number of iterations. 

The method of policy iteration will be illustrated by numerical examples in Sec. 
12.4. 

^In a network where the discount factor is cii^ for arc you need only substitute 
for a in formulas (1), (2), and (3). <1 

9 

12.3 MINIMIZING AVERAGE RETURN PER PERIOD 

The extremal equations for in Sec. 12.1 cannot be expected to work when 
O' = 1, since then a summation of returns over an unbounded horizon may be 
infinite (for example, when all Ci^ > 0). To derive the functional equation required, 
we re-express the formulas in terms of equivalent average return per period, and 
then let ct = 1. 

To begin, consider calculating mentally the equivalent average return per period 
for each possible cycle in the network, and let the symbol c denote the smallest such 
value for O' = 1. Of course, in an actual problem you would not want to calculate 
all of these averages, and therefore, typically, you would not know the actual 
value of c until you performed the optimization. But for the algebraic development 
to follow, merely view c as a symbol representing this constant, which does in fact 
exist. 

To avoid treating a host of special cases, assume the network satisfies one addi¬ 
tional property: suppose there exists a path—more precisely, a directed chain—from every 
Node i to at least one of the nodes associated with a cycle yielding c. (For example, this 
condition is satisfied if there is a directed chain from every Node i to every Node j.) 
The approach explained below can be modified to treat networks in which this 
property does not hold. 
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We will develop functional equations involving c. As you will see, the equations 
will also involve other quantities, «;<, which are required for the computations. 
We caution that our line of argument below really does not stand up to the 
scrutiny of rigorous mathematical analysis. We only intend for it to provide a 
means of motivating the end result, which by itself may look strange. So if you are 
impatient, you can skip the heuristic development and go directly to the extremal 
equations (5). 

It is convenient to relate the equivalent average return (1 — at each Node i 
to c by the following definitive relation for Wii 

(1) (1 - = (1 - q:)w, +c (0^a< 1). 

(Give a verbal interpretation of Wf.) Thus 

(2) = ( 0 ^ a < l ). 

You can interpret as the difference between and c/(l — o'), which is the 
present value of receiving c every period over an unbounded horizon. [Already the 
notation is inexact because both^< and Wi depend on of; a precise notation is 
yi{(X) and 

Then the extremal equations in Sec. 12.1 can be rewritten as: 

(3) Wt + -r— ^ — = minimum \a\w* + -r—^ — ) + Cij for each Node i. 

1 — of {i,S) In L \ 1 — Of/ J 

notvork 

Since c is a constant, we can either take the expression involving c on the left of 

(3) and move it to the right, or take the expression involving c on the right of (3) 
and move it to the left. We choose the former approach because it generalizes 
to the case where each arc (i,j) may have a different discount factor of|j. Thus, 
as you should verify, (3) can be stated as 

(4) wi = minimum {awj + — c). 

{i,S) In 
network 

When Of = 1, (4) simplifies to 

(5) Wi = minimum (wj + Cij — c) for each Node i, 

(1.^ In 
network 

or equivalently after moving the constant c to the left-hand side 

(6) Wi + c = minimum (wj + for each Node i. 

In 

network 

[Using precise notation, we let Wi in (5) and (6) be the limiting value of Wi((X) 
as Of approaches 1 from below.] 

So far we have assumed that it is meaningful to transform the previous extremal 
equations for of < 1 into extremal equations (5) for of = L This manipulation 
presupposes the existence of a stationary strategy that is optimal in terms of con¬ 
taining a cycle having minimum equivalent average return per period. The 
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assumption is justified, because by rigorous argument the following theorem can 
be established: 

Stationary Policy Theorem. There always exist a unique finite c and finite 

Wj that satisfy the extremal equations (5), and a corresponding stationary policy 

contains a cycle with an equivalent return per period that is minimal among all 

possible policies. 

As in the case cc < 1, an arc where the minimum occurs in (5) or (6) indicates 
an action to take if you are at Node i. Such a strategy must contain at least one 
cycle. Consequently, the strategy specifies a path leaving the current Node i and 
leading to a loop having minimum equivalent average return per period. (Of 
course. Node i may be included in the cycle.) If an optimal stationary strategy 
contains more than one cycle, then the equivalent average return per period must 
be the same for all cycles. This follows from our earlier assumption that you can 
always find a path from any Node i to a node associated with a cycle yielding c. 
Since c is associated with a cycle and the are stationary, the value c is also the 
minimum average return per period among all possible cycles. That is, the “equiv¬ 
alent average return per period” approach yields a strategy that is optimal for the 
criterion “average return per period.” 

As we mentioned, the main interest in the Wi is computational—you have to 
find values for the Wi in order to calculate c and an optimal policy. Bi.t there are 
several interpretations of the economic significance of Wi, Here is one. You can 
see from (2) that for any two i and j, 

yi —yj = Wi — Wj {0 <a < 1 ). 

So as q: approaches the value 1, the limiting value of the difference between the 
present values for Node i and Node j is given by Wi — Wj, Therefore, Wi — Wj can 
be thought of as the incremental cost from being in Node i rather than Node j when 
a: = 1. 

Policy iteration. In the case o: = 1, you would use the technique of 
successive approximations in policy space to solve (5) or (6). For 
expository convenience, assume that at every iteration the trial policy has a single cycle. 
The added computational details needed to encompass the possibility of having 
more than one cycle at an iteration are given in the special material at the end of 
this section. 

When you find values for Wi that satisfy the extremal equations (5) or (6), 
then adding any constant to these values yields new amounts for Wi that also 
satisfy the equations. Thus the Wi are not unique, and in the algorithm below it is 
convenient to set one of the values to zero. For the sake of definiteness, we make the 
normalization Wi = 0. 

The procedure is: 

Step 1. Select an arbitrary initial policy, and let n = 0. 
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Step 2, Given the trial policy, calculate Wi and c" according to the value deter¬ 
mination equations: 

lu? — + c” = Ci) for each Node i (value determination routine) 

«;r=0, 

where arc (f,j) is the decision indicated at Node i by the specific policy being 
evaluated. 

Step 3, Test for a policy improvement by calculating 


( 8 ) 


minimum {wj + — c”) = Wi for each i (test quantity). 


( i . J ) In 
network 


Step 4, Terminate the iterations if Wt = wj for all i. Otherwise, revise the policy 
at each Node k where < wi, using an arc yielding WJ} in (8). Increase n to 
w + 1, and return to Step 2 with the new trial policy. 

Mechanics. Observe that (7) requires the solution of a set of simultaneous 
linear equations. Given that the trial policy has a single cycle, there is always a 
unique solution to (7). Notice if you sum the equations (7) for the nodes contained 
in the cycle, the wi cancel each other and yield as a result 


(9) 


c" = 2 ^tj/number of arcs in cycle. 


(i.» 

in cycle 


We exhibited the test quantity (8) that came from the extremal equations (5). 
Note that the term c” in (8) has no influence on the minimization. Therefore 
you do not have to subtract when finding the minimum, but only when calculating 
the resultant value of Wi, We find it convenient to express the test as (8) because 
this form generalizes to the case where there are different traversal times for the 
arcs. 

With proper precautions to avoid cycling phenomena, such as can occur in 
the simplex method, the algorithm has the properties: 

(i) At every iteration c" < 

(ii) The algorithm terminates in a finite number of iterations. 

(iii) At termination, a strategy yielding W? is optimal. 

> Suppose hij is the number of time periods to traverse arc and Then 

(3), (4), and (5) would be'Written as 


(I) 

(II) 

(III) 


["«(“'' + r^) " J 

network 

Wi = minimum [cCi^Wj + — (1 + 0^ + • * * + 


a, j) In 
network 


Wi = minimum {wj + ” hijC), 

(t.» In 
network 


Accordingly, the coefficient hij modifies c” in (7) and (8). And the denominator in (9) 
is the sum of the for arcs (i,j) in the cycle. 
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Even though the initial strategy may contain a single cycle, a trial strategy at a later 
iteration can have two or more cycles. In this event the algorithm must be modified 
slightly. Since any trial strategy indicates that each Node i is eitlier in one cycle or on 
a path to one cycle, all the nodes can be partitioned into disjoint groups, one group for 
each cycle. Let cj* denote the average return per period for the cycle to which Node i 
is associated. We continue to use the notation for the general network in which hij is 
the number of periods to traverse arc 
Then given a trial strategy, you solve 

(i) wj* — -f for each Node i, 

where you also let all the cJ associated with the same cycle be equal, and you arbitrarily 
select one i in each cycle to let = 0. [For example, if Nodes 1, 2, and 3 are all the 
nodes associated with one of the cycles, you have in addition to (i) the equations c” = 
c” = cJ and, say, m;}* = 0 .] 

After (i), calculate 

(ii) minimum (c?) = Cf for each i. 

(i.J) in '' 
network 

If for any A*, then revise the strategy accordingly from (ii); if the new policy 

still has more than one cycle, return to calculation (i); otherwise return to (7). If 
= ^k every A, then will be equal for all A; and letting c" be the common value, 
apply ( 8 ) as before. 

You can gain further insight into (5) by relating it to the shortest-route model for a 
general network. Let 

(^'0 Cij = Cij — 

Then (5) can be viewed as 

(iv) = minimum {wj + Cjj) for every i and Wt = 0 . 

(1.S) in ' 

, network 

There is one difference between (iv) and the usual shortest-route problem, such as that 
in Sec. 11.9. Although Wi = 0, Node 1 really is not a terminal. Now a strategy may well 
include an arc that goes from Node 1 to Node j. 

Suppose you have guessed a strategy for which c is the associated minimal equivalent 
average return per period along any cycle. Then the sum of the corresponding along 
the same cycle is 0. If all other possible cycles in the network have a nonnegative sum of 
Cij, the proposed strategy is optimal. In such an instance, a shortest-route tecl^nique of 
successive approximations in function space leads to a solution of (iv). If, however, a 
cycle exists whose average return is negative in terms of the current Cfj, an improvement 
exists, and the shortest-route algorithm in Secs. 7.6 and 11.9 breaks down. In performing 
the steps, you eventually find that you are repeatedly revising those Wi that are a part 
of a cycle that represents an improvement. 

Thus the following hybrid algorithm emerges. Select a stationary strategy and calcu¬ 
late the resultant trial c. Use the value iteration algorithm (5) in Sec. 11.9, with from 

(iii). Let = oo for i ^ 1. If the calculations discover a loop such that the correspond¬ 
ing sum of the is negative, revise the stationary strategy accordingly, recalculate c 
for this improved cycle, revise the Cij, and repeat the process. When the shortest-route 
algorithm terminates with values for the Wi, the corresponding c is optimal. 
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12.4 NUMERICAL EXAMPLES OF POLICY 
ITERATION 

The models in this chapter are complex, and studying a few examples will help 
you maister the optimization problem. The illustrations below and in Sec. 12.5 
are for this purpose. 

Examples 1 and 2 exhibit the detailed calculations performed in the policy 
iteration technique, where 0 ^ QJ < 1. In Example 3 there is more than one 
optimal strategy for o' = 1, and the iterations terminate without selecting one of 
these policies that also is optimal for 0 < o' < 1. It is possible to add further instruc¬ 
tions to the policy iteration algorithm so that it always selects a policy that is 
optimal not only for o' = 1 but for all cL close to 1. 

In Examples 4, 5, and 6, when o' = 1 the optimal strategy for any finite horizon 
n depends on whether n is odd or even. Nevertheless, Examples 4 and 5 demonstrate 
that the policy iteration algorithm can indicate a unique determination of an 
optimal strategy. Example 6, like Example 3, shows that when there are several 
optimal strategies, the technique may not distinguish between one that is correct 
only for q: = 1 and one that is optimal for 0 ^ o' < 1. [In Example 7, at the 
first iteration a policy with two cycles is indicated, even though the initial strategy 
contained only one cycle; thus (i) and (ii) of the previous section are applied.] 

From these illustrations you will see why you must understand the broad eco¬ 
nomic implications of formulas and algorithms. Blindly applying them can lead 
you to numerically correct but patently unsuitable answers. 

You will find it helpful to copy on paper the formulas for value determination 
(2) and the test quantity (3) from Sec. 12.2, where o' < 1, and the corresponding 
formulas (7) and (8) from Sec. 12.3, where o' = 1. 

Example 1. Consider the network in Fig. 12.1. The numbers on each arc 
indicate the corresponding To initiate the policy iteration procedure, you 
must select a starting strategy consisting of an outward arc for each node. Let 
this policy be 



Figure 12.1. Example 1. 


4 
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..v Node 1: arc (1,4) /Node 2: arc (2, 1) 

^ Node 3: arc (3,2) Node 4: arc (4, 1). 

Thus the strategy contains, as its only cycle arcs, (1,4) and (4, 1). 

Using the value determination formula (2) of Sec. 12.2, solve the following set 
of simultaneous linear equations for jJ: 

fi = oif, + r ,4 
y2 = oifi <^21 
fz = Oiyl + ^32 
y\ = ocji + Cil, 

which can be written as 

f, ^ccyi = 3 

(3) -«^?+ =4 

- af, +f, = I 

-ayl + 


In a specific application, the value for a would be given. But here it will be useful 
to perform the calculations leaving a unspecified and showing how it affects the 
answers. You c.'’n solve the first and fourth relations in (3) forjJ andji®, these being 
the equations for the nodes in the cycle, and then the remaining relations can be 
solved. Verify that the answers are 


A 


3 +la 
1 -a’^ 


(4) 


A = aA +I = . 1 + 4a+^a- - 3a^ 


A 


1 +3q: 

1 -q:“’ 


A glance at Fig. 12.1 shows that only the decisions for Nodes 1 and 2 are in 
question, because there is only a single arc out of Nodes 3 and 4. The test calcula¬ 
tion (7) of Sec. 12.2 yields, for Node 1, 


minimum (ccyl + <^i2, 

__/4 + 4q: - q:2 - 3 cc ^ 3 + 1q'\ 

= minimum -j— 

• for arc ( 1 ,4) 


if 0 < QJ < 1. Similarly, for Node 2, 
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minimum {ay\ + C 2 u Oiy\ + ^ 23 ) 


( 6 ) 




if 4 < QJ < 1 . 

Since Yt =yi for each Node i, the initial policy is optimal provided § ^ c£ < 1. 
If o' < §, then the minimum in ( 6 ) would be for arc (2, 3), and this decision would 
replace arc (2, 1) at the next iteration. Explain why arc (2, 3) is a better decision 
for values of a below -I- 

Verify that the test quantity for arc (2, 3) in ( 6 ) really represents the present 
value of going I’rom Node 2 to Node 3, back to Node 2, then to Node 1, and there¬ 
after around the loop between Nodes 1 and 4. However, if (2, 3) is actually used 
instead of (2, 1), then the present value is calculated by going from Node 2 to 
Node 3 and back again in an endless loop. Thus you can determine if a repeated 
cycle between Nodes 2 and 3 is desirable by testing whether there is an improve¬ 
ment in making the loop only once. 

From Fig. 12.1 you can also see that the strategy (1) is uniquely optimal for 
cx = L The minimum average cost cycle consists of arcs (1,4) and (4, 1), so that 
c = (3 + l)/2=2. 

£xample2. Consider the network and data in the previous illustration, but let 
C 23 = 3. Since the starting strategy in ( 1 ) does not involve this arc, the trial values 
for are those in (4). Similarly, the test in (5) gives the same result. However, 



(7) 

and as a result, 

( 8 ) 


+ Ct 3 = YS, for arc (2, 3), 


when 0 ^ a < 1. Thus arc (2, 3) replaces arc (2, 1 ) in the strategy. This in turn 
means that for n = I the second equation in (3) is replaced by 


(9) 


—CCji +^s = 3, 


and the new solution is 



( 10 ) 



Now the test quantities for Nodes 1 and 2 are identical: 


(H) 



(Nodes 1 and 2) 


for 0 ^ q: < 1. The calculations stop, since Y} =j>] for all f. 
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Example 3, Continue with the previous example, but let <X = 1. Looking 
at Fig. 12.1, where again ^23 = 3, you see that the cycles between Nodes 1 and 4 
and between Nodes 2 and 3 are equally good and optimal. If you use the same 
starting policy as (1), and apply the value determination routine (7) in Sec. 12.3, 
you obtain the system of simultaneous equations: 

w^l — Wi = Ci 4 — 

W2 — Wi = C21 — 

( 12 ) W3 — = C32 — c® 

Wi — = C41 — 

Check that the solution of (12) is 

(13) c^=2 wl=2 a;S = l =-1. 


The test calculations in ( 8 ) of Sec. 12.3 give 


(14) 

and 


minimum (wt + — c", wj + c„ — c®) 

= minimum (2 + 4 — 2, — 1 +3 — 2) 


(Node 1) 

0 = M'.® for arc (1, 4) 


minimum (a^ + — c®, laj + c,, — c®) (Node 2) 

(15) 

= minimum (0 + 4 — 2, 1 + 32) = 2 = 14? for both arcs. 

The calculations terminate, since 14? = Wt for all i. 

Notice that although the initial policy ( 1 ), which has a single cycle between 
Nodes 1 and 4, tests optimally, you probably would prefer to use the alternative 
optimal strategy, which includes the cycle between Nodes 2 and 3. The reason is 
that arc (2, 3) is optimal not only when o' = 1, but this arc is uniquely optimal 
for 0 ^ O ' < 1, as you found in ( 8 ). 


^Letting 

( I ) = Cij — c = Cij — 2 , 

Fig. 12.2 shows the network for 

(II) Wi = minimum (wj + c<i) and = 0. 

network 

Notice that the total length around an optimal cycle is 0, whereas the length around 
any other cycle is strictly positive. ^ 

Example 4. Consider the network in Fig. 12.3. To begin, suppose Ci 2 = 1.5. 
Then if you examine the network, you quickly find that, for o' = 1, arc (2, 3) is 
a better decision than arc (2, 1). Period by period, starting at Node 2, a cycle 
between Nodes 2 and 3 accumulates less cost than between Nodes 2 and 1. 

It is not so clear, however, whether for o' = 1 arc (1,3) is better than arc 
(1, 2). To see why, calculate period by period the cumulative costs for a path 
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Figure 12.2. Example 3 (c/y). 


Figure 12.3. Example 4. 


Starting at Node 1, going to Node 2, and looping between Nodes 2 and 3 there¬ 
after. Compare those sums with the corresponding costs of going to Node 3 first. 
You will find that arc ( 1 , 3) is better when the horizon is an odd number of periods 
(1, 3, 5, , .and arc (1, 2) is better when it is an even number. 

What happens when you use the policy space algorithm? Let the starting 
strategy be 

(16) Node 1 : arc ( 1 , 3) Node 2:* arc (2, 3) Node 3: arc (3, 2). 

Then the^J are found by solving 


fi=ocf3+ r,3 

= CLjfi + Cjs 

jS = 4- ^32* 


(17) 


Verify that the results are 


\P — ^ + to 4- 
1 - a:* 



(18) 



The test computation for Node 1 yields 
minimum {aft 4 - af 4 - C 13 ) 


(Node 1 ) 


(19) 



^ = y? for arc ( 1 , 2 ) 


^ QJ < 1. Thus for large o', arc (1, 2) is better than ar,c (1, 3), so the policy 
in (16) must be altered. The revised optimal strategy gives : / 


■(20)1’“***^^ .V.li 
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Next perform the policy iteration algorithm for o' = 1, and check that the above 
solution with arc (1,2) is uniquely optimal. 


Example 5. Cbntinue with the previous illustration, but here let Ci 2 = 2. 
Once again, if the horizon is finite, arc (1, 3) is strictly better for an odd number 
of periods and strictly worse for an even number. If you choose (16) as the starting 
strategy, (17) and (18) remain valid. 

The test calculation for Node 1 is 


( 21 ) 


minimum 

(1,2), (I.S) 


/2 + 2a: + 

V 1 - O'* 


+ 2 a:* 

2 ' 


1 + 4a + 

1 - O'* 


) = y? for arc (1, 3), 


when 0 < a: < 1. Therefore arc (1, 3) is now optimal for all values of a below 1. 


Example 6 . Continue with the previous illustration, but let a: = 1. As 
you would suppose, arc (1,3) remains optimal. But if you test arc (1,2), it too 
is optimal according to the algorithm’s criterion. The equations to be solved are 

Wi wl = Ci2 — 


( 22 ) 


wl — Wj = Czz — C® 

= ^32 — 


with the results 


= 0 , 


(23) = 3 a;? = 0 wl = \ wl = 2. 

Verify that you obtain the same answers using arc (1, 3) instead. 

The computation for Node 1 is 

minimum (o/J + Ciz — c°, u;? + — c°) (Node 1) 

(24) = minimum (1 + 2 — 3, 2 + 1 — 3) = 0 = 

for arcs (1, 2) and (1, 3), 

so the trial solution with (1,2) tests optimally. 


>>£xample ?• Consider the network in Fig. 12.4, where a = \. Let the starting 
strategy be 



0 


r. * 


3 


Figure 12.4. Example 7. 
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/-jx Node 1: arc (1,4) Node 2: arc (2,4) 

^ ^ Node 3: arc (3,2) Node 4: arc (4, 1). 

Thus the only cycle is between Nodes 1 and 4. Verify that the corresponding solution is 

wS = —|- = - y- 

for arc (1,4) 
for arc (2, 3) 

for arc (3, 2) 
for arc (4, 1). 

The policy in (v) therefore represents an improvement. Notice that the new strategy has 
two cycles, and so (i) and (ii) of Sec. 12.3 must be used at the next iteration. 
Accordingly, the value equations to be solved are 

w\ — w\ 4- c] = 2 c[ = c\ «;| = 0 

cl = 2 cj = cj wl = 0. 

w\ — wl-h cl = 0 

— !£;[ + c| = 1 

/rl __ /rl_L — cl _ 1 

Cl — C4 — 2 Cj — C3 — X, 

C} = 1 < c| for arc (1, 2) 

Cj = 1 = cJ for arc (2, 3) 

C’J = 1 = cJ for arc (3, 2) 

C] = 1 < cJ for arc (4, 3). 

The new strategy in (viii) has a single cycle. Perform the next iteration and check that 
the strategy in (viii) is optimal. 

Summary. This section applied the successive approximation in policy 
space algorithms to several illustrations. For the case 0 < oi < 1, you saw how 
the optimality test criterion 7/* indicates whether a policy improvement is possible, 
and the way in which this quantity can depend on the specific value of OC, For the 
case O' = 1, you observed that the method determines at least one optimal cycle, 
but the test criterion is not strong enough to distinguish between alternative optimal 
strategies that yield distinctly different present values for d near 1. As we mentioned 
above, it is possible to modify the algorithm to eliminate this annoying feature. 
Because the necessary modifications go beyond the scope of this text, they arc not 
given here. 


(vi) 

Verify that 

(vii) 
and 

(viii) 


(iv) ‘'’ = Y = ° 

Then check that 

\ 

iv; = 0 = w? 
wi= -2 <!4 

M'S = 

M'S = -y = “! 
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12.5 ELEMENTARY INVENTORY MODEL 



( 1 ) 


The preceding examples were simple networks that allowed you to focus on the 
mechanics of successive approximation techniques. Here, the application is ori¬ 
ented to a dynamic problem of practical significance, namely, the elementary 
deterministic inventory model you first studied in Sec. 8.3, which was summarized 
in Sec. 12.1. We will employ the data for the Dependable Manufacturing Com¬ 
pany illustration in Sec. 8.4: 

demand Z) = 3 production x ^ 5 
ending inventory 7^4 

23 production and inventory costs 

C(xJ) =CW+A.J, 

where x and j are nonnegative integers and 
C(0)=0 C(l)=15 C(2)=17 

(3) C(3) = 19 C(4) 21 C(5) = 23 

h = 1 

for all periods in an unbounded horizon. 

The state variable is the level of entering inven¬ 
tory, designated by the symbol z, where z = 0, 
1, . . ., 4. Therefore, a stationary strategy consists 
of a rule for the amount to produce given z. You 
already saw (Fig. 8.16) that the optimal strategy 
for an unbounded horizon was given as 

x«o(0) =x41) =x«(2) =5 
Aroo(3) = x«.(4) = 0. 

You now will validate (4) using the technique of 
policy iteration. 

The network appropriate to the example ap¬ 
pears in Fig. 12.5. There is a node for each value 
of entering inventory z and an arc for each feasible 
production level x, given z. Specifically, if entering 
inventory is z, production is x, and demand is 
3, then inventory entering the next period is 
i = t + X — 3 > 0. Thus arc (z,j) represents produc¬ 
ing an amount j — i + 3. Because of the produc¬ 
tion and inventory constraints in (1), certain 
combinations of i and j are not allowed. For 
example, if z = 0, then j ^ 2, since x < 5. The 
arc costs, given in Fig. 12.6j are t . JKSi. 


(4) 


Note: node / = inventory level / 
ore (/, /) = produce y - / + 3 

Figure 12.^^ Dependable Manu¬ 
facturing pompany Inventory 
Model (Unbounded ‘ Horizon) 


(5) 




— z + 3) H' j. 
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Figure 12.6. Dependable 
Manufacturing Company 
Inventory Model. 


Arc Costs 

C/y = C(y-/ + 3) + hj 


\/ 

0 

1 

2 

3 


4 

0 

19+0 

21 + 1 

23 + 2 




1 

17+0 

19 + 1 

21+2 

23 + 

3 

2 

15+0 

17 + 1 

19+2 

21 + 

3 

23+4 

3 

0+0 

15 + 1 

17 + 2 

19 + 

■7 

21+4 

4 


0 + 1 

15+2 

17 + 

3 

19+4 


Assume the optimality criterion is minimum average cost per period. Then 
the functional equation to be solved is 

(6) Wi = minimum {w) + — c) for each i, 

• In 
network 


Suppose the starting strategy is to produce just enough when z < 3 so that 
ending inventory j is zero, and not to produce at all if i = 4 so that j = 1. In 
network terms, the policy is 


(7) 


NodeO: (0,0) 
a: = 3 

Node 3: (3,0) 

;c = 0 


Nodel: (1,0) Node 2: (2,0) 

;c = 2 X = I 

Node 4: (4,1). 

;c = 0 


For example, at Node 1 (z = 1) you produce 2, so that at the end of the period the 
demand 3 reduces inventory to 0. Draw the network containing only these arcs, 
and observe that the strategy eventually leads to producing 3 every period. 
Employing the policy iteration algorithm, you solve for u/l and ^ in the value 
determination equations. 


101 - 101 +^ = 19 


zz;? — w^o + c® = 17 


w\ — wl ’\-c^ = 15 
zz;? — zz^ + c® = 0 
zz^ — zz^ + c® = 1 
zz^ = 0, 


which yield 

(9) c® = 19 zz;S = 0 zz;? = -2 zz^ = -4 zz^ = -19 zz^ = -20. 
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Verify that the step 

(10) minimum — c") = Wi 

(l,J) In 
network 

for n = 0 gives 

Wq = 0 for arc (0, 0) 

= -12 for arc (1,3) 

(11) =-14forarc (2,3) 

= -19 for arc (3, 0) 

Wi = —20 for arc (4, 1). 

Since JVt < for i = 1, 2, the iterations are continued. Draw the network 
corresponding to the revised strategy in (11). 

The results of the successive iterations are exhibited in Fig, 12.7. You will find 
it instructive to trace through the calculations yourself, using Fig. 12.7 as a check 
on your calculations. You should also draw the network corresponding to each 
trial policy. Note the following in Fig. 12.7: 

(i) There is no improvement in c” between = 0 and 1. 

(ii) Although at n = 2 the average cost per period improves C” < the 
relative values increase wj > IVl for all t, and wj > w\ for all i ^ 0. 


(iii) The production strategy at each iteration is: 


(12) 

= 5 

for z = 1, 2 X = 4 for z = 0, 3 x = 0 for z = 4, 

production cycle (5, 4, 0) at n = 2; 

(13) 

X = 4 

for z = 1 X = 5 for z = 0, 2 x = 0 for z = 3, 4, 

production cycle (4, 5, 0) at ;z = 3; 

(14) 

X = 5 

for z* = 0, 1 X = 4 for z = 2 a: = 0 for z = 3, 4, 

production cycle (5, 4, 0) at « = 4; 

(15) 

x = 4 

for z’ = 0 X = 5 for z = 1, 2 a; = 0 for z = 3, 4, 

production cycle (4, 5, 0) at n = 5; 

(16) 

X = 5 

for z = 0, 1, 2 a: = 0 for z = 3, 4, 

production cycle (5, 5, 0, 5, 0) at n = 6. 


Thus the iterations examine two cycles (5, 4, 0) and (4, 5, 0) at successively 
lower levels of inventory, until convergence to the optimal policy is obtained in 
(16), thereby verifying (4). 


►To test your understanding, relax the assumption on production to permit x = 6 at 
a cost C(6) = 28.5. Perform one iteration of the policy space algorithm to verify that an 
optimal policy is 
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(i) 

implying 

(ii) 


Node 0: (0, 3) Node 1: (I, 3) Node 2: (2, 3) 

Node 3: (3,0) Node 4: (4,1), 


;c = 6 for i = 0 X = 5 for i = 1 

X = 4r for j = 2 ;e = 0 for i = 3, 4, 


and c = 15.75. 


production cycle (6, 0), 


12.6 LINEAR PROGRAMMING APPROACH 

You will now see how to formulate a discrete dynamic programming problem 
in an unbounded horizon as a linear programming model. This connection will 
help you understand why the policy iteration algorithm converges in a finite num¬ 
ber of trials, since it is closely related to the simplex method. Furthermore, you 
consequently will know how to convert such a dynamic programming problem to 
a format that permits the application of standard linear programming computer 
codes. The conversion has practical significance in actual situations, since highly 
sophisticated linear programming algorithms are readily available for all large- 
scale electronic computers. A corollary advantage is that you can then employ 
the well-developed techniques for sensitivity analysis. 

Minimum present value. Consider the case 0 < o' < 1 and the 
appropriate functional equations 

(1) = minimum + c^) for each i. 

(i,J) In 
network 

These equations imply that every must satisfy the linear constraints 

, 2 ) J'i^c^yj + Cij or yt-ayj^Ctj 

for each arc (i,j) in network (dual arc constraints). 

Theyt are unrestricted in sign; however, the set of inequalities (2) does not fully 
characterize extremal equations (1), because values of that meet (2) may not 
also satisfy (1). For example, if all Cfj > 0, then = 0 is consistent with (2) but 
not (1). For each z, (2) must hold with equality for at least one arc Sur¬ 
prisingly, the correct optimal values for all yi can be achieved by applying the 
linear objective function 

(3) maximize^ r^yk (dual objective), 

where each* r* > 0 but otherwise is arbitrary, and p is the largest node number in 
the network. 

Denote the restrictions (2) and the objective function (3) as the dual problem, 
so the primal model becomes 
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(4) 


minimize 


network 


(primal objective) 


subject to 

(5) 2 2 ^ik = ^k for each Node k (primal node constraints) 

{.k.}) In (l.filn 

network network 

(6) Xij > 0 for each (i,j) in network. 


As usual, the double summation sign in (4) indicates the sum is over all arcs (z, j) 
in the network. For each Node k in the network, the single summation signs in 
(5) indicate the sums are over arcs {k^j) and (t, k) with the particular value of 
k as given. The dual and primal linear programming technologies for the network 
of Example 1 in Sec. 12.4 (illustrated in Fig. 12.1, p. 415) are shown in Figs. 12.8 
and 12.9. (The primal technology is closely related to that of the generalized net¬ 
work problem in Sec. 6.9.) 


Arcs 

>'i >'a >'3 


(1.2) 

1 -a 

- ^12 

(1.4) 

1 -a 


(2. 1) 

-a 1 

< ^21 

(2.3) 

1 -a 

< ^^23 

(3.2) 

-a 1 

- ^32 

(4.1) 

-a 1 

< C41 


'■l ''3 '■<. 

Maximize 


unrestricted in sign 


Figure 12.8. Dual Linear Programming 
Model for Example 1 in Fig. 12.1. 
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Figure 12.9; Primal Linear Programming 
Model for Example 1 in Fig. 12.1. 


Simplex method and policy iteration. Suppose you apply the 
simplex method to the primal problem (4) through (6). Since r* > 0 for each 
Node k, at least one > 0. A trial solution at each iteration of the simplex method 
consists of a basic solution with p variables, where p is the number of nodes. Such 
a solution must therefore contain exactly one for each Node k. In other words, 
a basic solution in the primal model corresponds to a trial strategy in the dual 
model. 

When a new x^y is introduced in the basis, the old x^^ must leave to preserve 
feasibility. Thus at each iteration of the simplex method, a new arc enters the 
solution and an old one leaves for a particular Node k. After the change, the new 
solution is tested for optimality by a calculation tantamount to checking whether 
all the inequalities in (2) are satisfied for the current set of dual variables. The 
latter values are the same for the current policy, so the usual optimality cri¬ 
terion in the simplex method agrees with the policy improvement rule in the 
policy space algorithm. 

You can now discern the only real difference between the simplex approach 
applied to the primal problem and the policy iteration technique applied to the 
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dual problem. At every iteration, the latter algorithm looks at all nodes and makes 
a change for every potential improvement. Consequently, the policy space algorithm 
may alter the trial solution basis by introducing several new variables (arcs) at 
one iteration. In contrast, the simplex method brings into the basis only one new 
variable at a time, and retests for optimality. Whether such multiple basis changes 
between optimality checks arc computationally more efficient has not been deter¬ 
mined on theoretical grounds. The relative merit of the two approaches largely 
depends on the technicalities involved in designing a computer program for a 
particular electronic calculator. 

The connection between the two approaches also provides a means of proving 
that the policy space successive approximation algorithm converges to an optimal 
solution in a finite number of iterations. Only slight modifications of the arguments 
showing optimality and finiteness of the simplex algorithm are required. 

►The following line of reasoning can be made rigorous to demonstrate that you can 
arbitrarily select positive fit in the dual objective function (3) and the primal constraints 
(5). Assume you have solved the dual model for a particular set of rjt, and let Xij have 
values corresponding to this strategy. According to the Dual Theorem in Chap. 5, the 
Xij exist and are finite. Specifically, let xb be the associated optimal basis vector. The 
values of Xb can be found from the primal node constraints (5): 

(i) {I-A)xB = r, 

where / is a ^ X ^ identity matrix, A is the matrix of appropriate coefficients o', and r 
is the column vector of r*. You can rewrite (i) as 

(ii) XB=r Axb, 
which by repeated substitution of xb on the right yields 

(iii) a*b = r -f -d(r + Axb) = (/ + -d + ^4^ -{- • • • )r. 

Since xb is finite and r > 0, the sum {I A A" • • •) must converge to a matrix 
that you may denote as (/ — A)~^, The individual elements of each matrix in the sum 
are nonnegative, and consequently (/ — ^4)”' is a nonnegative matrix, with diagonal 
elements having a value of at least 1. 

Therefore, if r* > 0 is used instead of r, then 

(iv) x% = {l^-A^A^-\- ...)r*>r*>0, 

so the previous basis remains feasible (and nondegenerate). This substitution of a 
different right-hand side leaves the optimality indication (values for the dual variables) 
of the basis B unaltered, giving the desired conclusion that the optimality of the strategy 
Xb is preserved. ^ 

Minimum equivalent average return per period. When o! = 1, 
the dynamic programming functional equation is 

(7) Wi = minimum — c) for each i, 

(i.J) In 
network « 

where, as before, we assume that there exists a path from every Node i to at least 
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one of the nodes associated with a cycle yielding c. By appropriately modifying the 
extremal equations, this assumption can be dropped. 

As you saw in the numerical examples of Sec. 12.4, the Wi are not unique. What 
is more, although the policy iteration algorithm finds a strategy containing a 
cycle with a minimal equivalent average return per period, the method does not 
distinguish among alternative optimal policies so as to select one that is also 
optimal for cc close to 1. The corresponding linear programming approach has 
this same limitation. There are practical procedures, however, for finding a policy 
that is also optimal for cC near 1. 

You can find the optimal value ofc, and an associated policy, by solving the dual 
model 

(8) maximize c (dual objective) 

subject to 

(9) — Wj + c < Cl) for each arc (i,j) in network (dual arc constraints), 
where Wi and c are unrestricted in sign. The corresponding primal model is 

(10) minimize (primal objective) 

network 

subject to 

(11) 2 = 0 for each Node k (primal node constraints) 

(k,irin a.iSIn 

network network 

(12) 2 2 ~ ^ (time normalization constraint) 

(i./) m 
network 

(13) Vi)>0 for each (i,j) in network. 

The constraint (12) is imposed as a normalizing equation to rule out the 
solution Vij = 0. As a result, the ordinarily will not be integer-valued, and the 
positive Vi) in an optimal solution will be associated with a cycle having minimum 
equivalent average return. The amount Ou represents the fraction of periods the 
system spends in traversing the arc (i, j) when there is a single cycle in an optimal 
policy. 

When you add (11) for all nodes, the terms on the left-hand side cancel each 
other, so there is a redundancy in the set of relations, and any one equation can be 
dropped in the computational process. This phenomenon occurred in the network 
models of Chaps. 6 and 7, and corresponds to the fact that the Wi are not unique 
in (9)—if the Wi satisfy (9), so do {wi + constant). 

Example. The dual and primal models are illustrated in Figs. 12.10 and 
12.11 for the network given in Fig. 12.1. Compare the technologies with those 
in Figs. 12.8 and 12.9, where 0 ^ o: < 1. Verify for Example 3 on p. 418 that 

(14) Wi ^0 Wf =2 = I m; 4 = —1 c=2 

is feasible in Fig. 12.10, and that as a result the feasible solutions 
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(15) 
and 

(16) 
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Figure 12.10. Dual Linear 
Programming Model for 
Example 1 in Fig. 12.1. 


Figure 12.11. Primal 
Linear Programming 
Model for Example 1 in 
Fig. 12.1. 


1 

2^14 — 2^41 — ^^23 — ^32 — 


all Other Vij = 0 


are both optimal in Fig. 12.11. [If the equation for Node 1 is eliminated as 
redundant, then V 21 = 1^32 = 0 are also in the basis for the solution (15).] 

► Only straightforward modifications in the above formulations are needed to encompass 
the situation in which hij is the number of periods to traverse arc and dij = 

In the minimum present-value case, you merely substitute cCi) for a wherever it appears 
in the formulas. In the minimum equivalent average return per period case, you need 
only place the coefficient hij in front of c in the extremal equations (7) and the associated 
dual arc constraints (9). Likewise, place the coefficient hij in front of vij in the time 
normalization constraint (12). In this version, hijVij is the fraction of periods the system 
spends in traversing arc (t,J) when there is a single cycle in an optimal policy. A 


12.7 CONCLUDING REMARKS 

In this chapter, the successive approximation methods have been applied to 
extremal equations for dynamic programming models with unbounded horizons 
that satisfy the assumptions 
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(i) The decision outcomes are deterministic. 

(ii) The state of the system is examined at discrete points in time. 

(iii) Both the decision and state variables are discrete and have a finite number 
of possible values. 

(iv) The system parameters are stationary. 

In Chaps. 16, 17, and 18 you will study models where assumption (i) is weakened 
to include probabilistic outcomes. Random events introduce a new type of behav¬ 
ior to be encompassed by the optimization process. However, these models are 
not solved much differently from those in Chaps. 8 through 12. 

As you will see in the waiting line models of Chap. 20 and Appendix III, it 
is helpful to view certain real situations as if decisions can be made any moment, 
not only at discrete points in time given in assumption (ii). For exsmple, during 
“rush hours” a supermarket manager may decide to open an additional customer 
checkout stand whenever he sees the waiting lines getting too long. Obviously, 
he is not restricted to making this decision only at periodic intervals, such as every 
five minutes. In these situations the various successive approximation approaches 
you have studied can be adapted to provide workable optimization techniques. 

Assumption (iii), relating to the variables being discrete and finite-valued, is 
often imposed for either analytic or computational convenience. Frequently, 
however, a real system can be modeled just as well by letting the decision and 
state variables be continuous, and even unbounded. To illustrate, consider the 
elementary inventory model summarized in Sec. 12.1. There we assumed that the 
production quantity x must be integer-valued. Since the demand quantity is 
integer-valued, the production assumption implied that inventory would be 
integer-valued. Suppose, instead, that we think of the production level as a rate 
(quantity per period), and drop the postulate that it must be integer-valued. 
Then the recursion for an unbounded horizon is 
(0 /(O = minimum [C(x) + h{i + x — (f) + aj{i -f x — (/)], 

z 

where now — oo < t < -j-oo and x is any nonnegative real number at least as 
large as i — d. 

Analyzing such a model involves several technical details you have not yet 
encountered. For example, assuming/(z) exists and is unique, you would want to 
discover when f{i) also is a continuous function. Similarly, you would want to 
determine whether the function x^ij) giving an optimal policy is single-valued and 
continuous. 

When continuous variables are used in a representation like (1), you can some¬ 
times, by applying the calculus, derive a simple explicit formula for the solution 
f{i) and the decision function x^{i). The advantages to such results are significant. 
Not only is the computation of specific answers simplified, but the structure of an 
optimal policy and its dependency on the economic parameters are rendered 
more apparent. The solution techniques for these continuous problems are of 
principal interest to the operations research specialist, and consequently will not 
be explored further here. 
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♦Weaker stationarity assumption. Notice how assumption (iv) has 
been utilized in the network model of Sec. 12.1. The hypothesis W 2 is the set of 
available arcs (i,j) and the associated costs remain unchanged period after 
period. By auspiciously defining the state variables, you actually can encompass 
a wider variety of models than might appear at first sight. The following example 
suggests how the case of cyclic variation can be handled. 

Consider the elementary inventory model studied in Sec. 12.5. There you 
assumed that = 3 at every period. Suppose, instead, that demand alternates 
between 3 and 2. Then you no longer have only five nodes for the state variable 
: = 0, 1, . . ., 4 representing the level of entering inventory. Instead you denote 
the level of entering inventory as i' = 0, 1, . . ., 4 when the period’s demand is 
3, and i" = 0, 1,. . . , 4 when demand is 2. Thus the new network consists of 
two sets of five nodes. A feasible decision to produce a* in a 2)^ = 3 period corre¬ 
sponds to an arc from Node i' to Node i" = i' + x — 3. Likewise, a decision x 
when Dt = 2 corresponds to an arc from Node i" to Node i' = i" + x — 2. 
With this device you can also let production and inventory holding costs alter¬ 
nate between periods. 

♦Exploiting structure. The regeneration model in Sec. li.4 demon¬ 
strated how the numerical solution of a functional equation can be simplified 
if you have information about the form of an optimal policy. You will see other 
examples in Chap, 18 and Appendix II. As a preview, consider the value itera¬ 
tion operation 

(2) minimum {ay” + Cg). 

«.J) In 
network 

In some problems, information about the economic parameters and decision 
possibilities may show that if arc {i^j*) is optimal when i = 0, then (ij*) also is 
optimal for all i <j*. Knowledge of the i = 0 solution may thus eliminate your 
having to solve for many other values of i, 

► For another illustration, let 

(i) gj{j) = ayj 4- Cij for each i, 

and suppose j = 1, 2,..., Then if the model allows you to establish gi{j) is concave 
in j, the solution in (2) is simply j = 1 or j = iV<. Thus only two values of j need to be 
tested in (2) for each i. If g”{j) is convex, a different simplification is possible. Although 
usually more than two values of j must be tested, not all Ni need be examined. 

An efficient procedure for searching the minimum of a convex function is based on 
the so-called Fibonacci numbers. Actually, the method works for any function 
g{j) such that g{j) is strictly decreasing for 1 ^ j ^j*y and strictly increasing for j > j*. 
These functions arc called ditonic. [If^O) is ditonic, —g{J) is called unimodal.] 
The sequence of Fibonacci numbers is 

(ii) 1, 2, 3, 5, 8, 13, 21, 34, 55,89, 144,..., 

in which each successive number in the sequence is the sum of the previous two numbers. 
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Let Fn be the nth number in (ii). Then, by a so-called Fibonacci search, the minimum of 
sU)i for j = 1, 2,..., — 1), can be determined in (n — 1) tests. 

For example, suppose — 1 = 20, that is, n = 7. Then only n — 1 = 6 tries arc 
required. Start with the Fibonacci number and the next smaller number -Fn-s* lo 
the example, Fg = 13 and F 5 = 8 . Calculate and compare ^(13) and ^( 8 ). On the basis 
of the comparison, you will make the subsequent test employing the next smaller 
Fibonacci number, 5 . Specifically, if ^(8) <^(13), you have reduced the interval of 
search for the minimum g{j) ioj= 1, 2,..., 12. As a result, test g(j) with j = 5 and 
j = 8 . If, instead, ^( 8 ) > ^(13), you have reduced the interval of search to 
j = 9, 10,..., 20. As a result, test with j — 8 = 5 (j = 13) and J — 8 = 8 (j = 16). 
In either case, for the subsequent test you need to calculate g(j) for only one new value 
of 7 . The procedure continues in this fashion and terminates with the minimum gO’) 
after six values oTj are tested. A 


REVIEW EXERCISES 

1 Consider the inventory model in Sec. 12.1. Suppose the only possible levels for entering 
inventory i are i = 0, 1,..., 7. Assume that demand </ = 4. 

(a) How many different values for f (t) in (2) must be determined? Write the extremal 
equation for each of these values, and indicate the values of x to be considered 
in the minimization. 

(b) Suppose the optimal production policy is produce 11 — i for i ^ 5, and produce 0 
otherwise. Explain how you w6uld determine the values fory‘(i) (write the mathe¬ 
matical equations). 

(c) Draw a network representation of the entire optimization problem, and indicate 
the path implied by the optimal policy in part (b). 

2 Consider the network characterization in Sec. 12.1. 

(a) Explain why the corresponding value denoting the present value of an optimal 

unending path starting at Node i, is finite. 

(b) Explain why aWyi necessarily satisfy the extremal equations (3). 

3 Consider the network characterization in Sec. 12.1. Suppose that there are six nodes, 
and there are arcs in each direction between every pair of nodes. 

(a) How many extremal equations in (3) must be satisfied? Write each of these, 
showing all terms that must be considered in the minimization. 

(b) Suppose an optimal set of arcs is (1, 2), (2, 3), (3, 4), (4, 5), (5, 6), and (6, 1). 
Indicate how you can solve a set of simultaneous linear equations to find the 
associated values for 7 ><. 

4 In the value iteration algorithm, in Sec. 12.2, there need not be only one policy that 
is optimal at the current decision for all finite n ^ F’—no.matter how large the value 
for N, The text states that there is an n* such that, for each n > «♦, any optimal strategy 
for the current decision in the n-period finite horizon model is also an optimal sta¬ 
tionary strategy for the unbounded horizon model. Arc these two assertions contra¬ 
dictory? Explain why. 
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5 Consider the value iteration algorithm in Sec. 12.2, and suppose you calculate 
using an initially selected trial strategy. Suppose further that you stop the process at 
iteration m. Describe a strategy that yields the associated values y[^. Is this strategy 
necessarily stationary ? 

♦6 Consider the numerical illustration (I) in the advanced material following the descrip¬ 
tion of the value iteration algorithm in Sec. 12.2. 

(a) Draw the network corresponding to (I). 

(b) Verify that the values in (II) satisfy the functional equations for the network, and 
that arcs (1, 2) and (1,3) are both optimal. 

(c) Given the values for^ in (V), calculate^ and and check that arc ( 1 , 3) is 
strictly better at n = 4 and arc (1,2) is strictly better at /i = 5. 

7 In each part below, first find the values for^^ given the analysis in Sec. 12.4. Then 
apply the value iteration algorithm in Sec. 12.2, starting with ^ = 0. Letting n' 
denote the first iteration at which the entire policy agrees with the optimal policy, 
terminate the calculations at n' + 3. Plot the values for^{* and^f. 

(a) Example 1 in Fig. 12.1, with o' = . 8 . 

(b) Example 1 in Fig. 12.1, with cc = I, 

(c) Example 4 in Fig. 12.3 (ri 2 = 1.5), with oi = i, 

(d) Example 4 in Fig. 12.3 (ri 2 = 1.5), with o: = i. 

♦(e) In each part, verify that for every z, 

hi —yil < cLdn-\ where = maximum —ykl 

8 In each part, apply the value iteration algorithm in Sec. 12.2, computing from the 
initial set of arcs indicated below. Letting n' denote the first iteration at which the 
entire policy agrees with the optimal policy, terminate the calculations at n' + 3. 
Plot the values for^J* and 

(a) Example 1 in Fig. 12.1, with a = . 8 , and starting with arcs (1, 4), (4, 1), (2, 3), 
and (3, 2). 

(b) Example 1 in Fig. 12.1, with cc = and starting with arcs (1, 4), (4, 1), (2, 1), 
and (3, 2). 

(c) Example 4 in Fig. 12.3 (ci 2 = 1.5), with = j, and starting with arcs (1,3), 
(3,2), and (2, 3). 

(d) Example 4 in Fig. 12.3 {ci 2 = 1.5), with d = i, and starting with arcs (1,2), 
(2, 3), and (3,2). 

♦ (e) In each part, verify that for every i, 

\yi —yi\ < ddr^.i where = maximum —^* 1 - 

k 

9 Consider a network with Nodes 1 and 2, and arcs corresponding to Cn = <^22 = 

Cjj = C 21 = 10. Let d = i. Inspect the diagram and compute and ^ 2 * Apply the 
value iteration algorithm in Sec. 12.2, with^J = 0 and^ = 40, up to n = 5. Plot 
and^ 2 * Are thc^f monotonic? 

10 Consider the network characterization in Sec. 12.1. Suppose for each Node i you select 
an*arc ( 1 , 7 ). 
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(a) Explain why your selection of arcs includes at least one cycle. If there are two 
cycles, indicate why the nodes in each cycle are disjoint (that is, why a node is not 
included in more than one cycle). 

(b) Consider the value determination routine (2) in Sec. 12.2, and suppose that your 
selection of arcs includes only one cycle, namely, the cycle bet^^'een arcs (1,2), 
(2, 3), and (3, 1). Write the associated equations in (2). Show that the solution 
to these simultaneous linear equations yields the values 

+••• 

= ^31 + <a^^I2 + 0£*^23 + CL^C^l + CL^Cxi 4- 

*(c) Suppose you apply policy iteration, where thej{ are as given in part (b). Further, 
suppose that you find at Steps 3 and 4 that = cLj^ -f and F? <>?, and 
yj for all other i. Show that <>{, and, as a consequence, that 
for every Node i. 

11 In Sec. 12.3, the text assumes for simplicity that there exists a path from every Node i 

to at least one of the nodes associated with a cycle yielding the minimum equivalent 

average return per period {a = 1). 

(a) What complication arises if this assumption is dropped? {Hint: consider the 

network associated with the arc costs Cxz = c^x = ^i 3 = = ^ 4 s == 2.) 

(b) Explain why there is no need to make such an assumption in the case 0 ^ ct < 1. 

12 (a) Give a verbal interpretation of Wi in the definitions (1) and (2) of Sec. 12.3. 

(b) Show the intermediate algebraic steps that allow (3) to be written as (4). 

(c) Take the expression on the right of (3) and move it to the left; then show that for 
ct = 1, the result is (6). 

(d) Suppose you interpret as the incremental cost associated with being in 

Node i rather than Node j, given an optimal policy. Present a verbal description 
of the extremal equations (6) using this interpretation. 

(e) Suppose you are given values for Wx that satisfy the extremal equations (5) or 
(6). Explain why adding a constant to these values yields new amounts for the 
Wi that also satisfy the equations. (Illustrate your argument with a network having 
three nodes and arcs in each direction between every pair of nodes.) 

13 Consider the policy iteration algorithm for ct = 1 described in Sec. 12.3. 

(a) Suppose the network contains Nodes 1, 2,..., p. How many equations and how 
many unknov/ns are in'the linear system (7) ? 

(b) Suppose the trial policy has the cycle (1, 2), (2, 3), and (3, 1). Write the associated 
equations (7), and show that c” = (ru + ^23 + ^si)/3, as claimed in (9). 

(c) Suppose the arc (i,j) is in an optimal policy. What is the value of Wx — Wj? 
Suppose the arc ( 7 , k) is also in this same optimal policy. What is the value of 

Wx — Wif? 

(d) Rewrite the steps of the policy iteration algorithm appropriate to the extremal 
equations as expressed by ( 6 ). 

(e) Rewrite the steps of the policy iteration algorithm if the objective is to “maximize” 
average return per period. 
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♦14 Consider a network model in which hn time periods are required to traverse arc 

and not all arc equal to 1. This case is discussed in the advanced material at the 
end of Sec. 12.3. 

(a) Verify that (3), (4), and (5) become (I), (II), and (III). 

(b) Verify that Wi — wj can still be interpreted as the incremental cost associated 
with being in Node i rather than Node j for oi = 1. Present a verbal description 
of the extremal equations (III) using this interpretation. 

In Sec. 12.4 verify that the solution to (3) is (4). 

Verify the algebraic steps in (5) and ( 6 ), that arc (1,4) gives the minimum in (5) 
for 0 ^ Of < 1 , and that arc ( 2 , 1 ) gives the minimum in ( 6 ) for § ^ of < 1 . 
Explain why arc (2, 3) is a better decision for values of cc below |. 

Show the changes in (2) through ( 6 ) if arc (2, 3) is used initially instead of arc 
(2, 1). Do you arrive at the same conclusions as in part (b) ? 

Verify that the test quantity for (2, 3) in ( 6 ) represents the present value of going 
from Node 2 to Node 3, back to Node 2, then to Node 1, and thereafter around 
the loop between Nodes 1 and 4. Compare this test value with>»5 in part (c). 

16 (a) In Sec. 12.4, verify the algebraic steps in (7) through (II), and that the minimum 

is as indicated in (11) for 0 ^ of < I. 

(b) Verify that, for of = 1, the value determination routine yields equations (12), 
and that the solution is (13). Also verify the computations in (14) and (15). 
Discuss the assertion in the text that you probably would prefer to use the cycle 
between Nodes 2 and 3. 

(c) Assume that you employ the policy in (1), giving the present values in (4) for 

0 ^ Of < 1. Consider the quantities ^2 ~~Jiyy4 set of = 1. 

Show that the resulting amounts are the same as Wo — w,^ — Wi, W 4 — Wi 

in (13). 

17 Consider the illustration in Fig. 12.3 with rj 2 = 1.5 (Example 4). 

(a) Demonstrate that when you use arcs (2, 3) and (3, 2), selecting arc (1,3) is better 
than choosing arc ( 1 , 2 ) over a finite horizon if the number of periods is odd, but 
worse if the number of periods is even. 

(b) Verify in Sec. 12.4 that (18) is the solution to the equations (17). Check the 
algebraic steps in (19), and verify that arc (1,2) gives a minimum for ^ ^ of < 1. 

(c) Show the changes in (17) through (19) when arc (1,2) is used instead of arc 
(1, 3). Verify the statement in (20). 

(d) Perform the policy iteration algorithm for of = 1. Check that the solution with arc 
( 1 , 2 ) is uniquely optimal. 

(e) Consider the policy associated with arcs (1,2), (2,3), and (3,2). Assume that 
0 ^ of < 1 , and find the associated values for^i,>» 2 > andj >3 [expressed analogously 
to the formulas in (18)]. Consider the quantities j >2 '^yuyi — yu and set of = 1. 
Show that the resulting amounts are the same — Wi, ^ Wi in (23). 

18 Consider the illustration in Fig. 12.3 with C 12 = 2 (Example 5). Verify the expressions 
in (21), and that arc (1, 3) is optimal when 0 ^ of < 1. 

19 Consider the illustration in Fig. 12.3 with C 12 = 2 and of = 1 (Example 6 ). 


15 (a) 

(b) 

(c) 

(d) 
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(a) Verify that the value determination equations associated with using arc (1,2) 
are those in (22) and have the solution in (23). 

(b) Write and solve the corresponding value determination equations associated with 
using arc (1,3). 

(c) Verify that both arcs (1, 2) and (1, 3) test optimally in the policy iteration tech¬ 
nique. 

*20 Consider the illustration in Fig. 12.4 for ct = 1. (Example 7), discussed in the advanced 
material at the end of Sec. 12.4. 

(a) Write the value determination equations for the policy in (iii) and verify that the 
solution is as indicated in (iv). Also check the test quantities in (v). 

(b) Indicate how the policy in (v) contains two cycles. Verify that equations (vi) 
are solved as shown in (vii). Also check the test quantities in (viii). 

(c) Complete the policy iteration algorithm, and determine that the strategy in (viii) 
is optimal. 

Exercises 21 through 26 refer to the case of the Dependable Manufacturing 
Company case described in Sec. 12.5. All equation numbers refer to Sec. 12.5. 

21 (a) Assume 0 < 1. Employing the notation corresponding to (2) in Sec. 12.1, 

write in full detail all the functional equations to be satisfied. 

(b) Assume o' = 1, Let g(i) = Wi and g = c in functional equations ( 6 ) of Sec. 12.5. 
Rewrite the functional equations in this notation and thereby display extremal 
equations analogous to those in (2) of Sec. 12.1. 

22 In each part below, draw the network containing only the arcs implied by the policy 
in 

(a) (7) and ( 11 ). 

(b) ( 12 ) through (16). 

23 (a) Verify that ( 8 ) contains the value equations appropriate to the policy in (7), and 

that (9) contains the solution to ( 8 ). Show the detailed computations in (10), and 
verify that the results are those contained in ( 11 ). 

(b) Work out in detail the rest of the computations of the policy iteration algorithm, 
and verify the results in Fig. 12.7. Specifically, write and solve the value equations 
and show the comparisons in the test quantity. 

(c) Given the optimal policy in (4), assume that cu < 1, and find the corresponding 

values ... ,^ 4 . Consider the quantities —yoty* “ J'oj 

and set OC = 1. Show that resulting amounts are the same zs Wi ~~ Wq, — i£;o, 

■” the optimal policy. 

24 Apply the policy iteration algorithm {fL = 1), starting initially with arcs (0, 1), (I, 0), 
(2, 1), (3,2), and (4, 3). 

25 Apply the policy iteration algorithm, letting^ be determined according to the policy 
in (7). At each iteration, indicate the production cycle implied by the trial policy. 
Assume that 

(a) a = . 9 . 
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(b) Of = 4 . 

(c) Of = 0. 

(d) Describe the effect of has on an optimal policy. 

♦(c) What is the smallest value of of, and the corresponding interest rate, such that the 
policy in (4) remains optimal ? 

26 Apply the value iteration algorithm, first by letting^ = 0, and second by letting 
be determined according to the policy in (7). Letting n' denote the first iteration at 
which the entire policy agrees with the optimal policy (exercise 25), terminate the 
calculations at n' -f 3. Assume that 

(a) a = .9. 

(b) Of = 4 . 

(c) o: = 0. 

♦27 Consider the numerical illustration described in the special material at the end of 
Sec. 12.5. 

(a) Assume of = 1. Verify that the policy in (i) is optimal if production x — 6 is 
permitted, with C(6) = 28.5. Also verify that the policy in (i) implies the produc¬ 
tion schedule shown in (ii). 

♦(b) Determine an optimal policy for of = 4 j ^^d compare it with that in (i). 

28 Consider the numerical example in Sec. 12.5. In each part below, make the indicated 
modification, redraw Fig. 12.5, and find an optimal policy using the policy iteration 
algorithm for of = 1. 

(a) Demand in each period is 2, instead of 3. 

(b) Demand in each period is 4, instead of 3. 

(c) Ending inventory cannot exceed 3, instead of 4. 

(d) Production in each period cannot exceed 4, instead of 5. 

(c) Permissible production values are only x = 0, 2, 4. 

(f) Production must be at least one unit (x > 1) in every period. 

(g) The setup cost component in (3) is 10, instead of 13. 

(h) The holding cost h is 5, instead of 1. 

♦(i) The holding cost h is applied to average inventory within a period, instead of 
ending inventory. 

♦29 Consider the illustration of the inventory model in Fig. ’2.5. Suppose demand can be 
left unfilled in a period and is then backlogged, but that the backlog level is never 
permitted to exceed 3. 

(a) Redraw Fig. 12.5 accordingly. 

(b) Suppose the penalty per unit of backlogged demand per period is 10. Show the 
appropriate arc costs in your diagram of part (a). 

dO In each part below, write the dual and primal linear programming technologies 
described in Sec. 12.6. First assume 0 ^ cf < 1, so that the tables are analogous to 
those in Figs. 12.8 and 12.9. Then assume of = L so that the tables are analogous to 
those in Figs. 12.10 and 12.11. 
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(a) Figure 12.3 with C 12 = 1.5 (Example 4 in Sec, 12.4). Indicate optimal solutions 
to the primal and dual problems for OC = 1. 

*(b) Figure 12.4 (Example 7 in Sec. 12.4). 

(c) Inventory model in Sec. 12.5 (illustrated in Fig. 12.5). 

* (d) Indicate optimal solutions to the primal and dual problems in part (c) for ot = .9. 
For a = For = 0. (See exercise 25.) 

*31 The special material on "weaker stationarity” in Sec, 12.7 described a network 
characterizing an inventory model with alternating demands. 

(a) Draw this network. Assuming the cost functions in Sec. 12.5, guess an optimal 
strategy, calculate the corresponding average cost (o' = 1), and check the test 
quantities in the policy iteration algorithm to determine whether your suggested 
policy is truly optimal. (Hint: think carefully about the alternating demand 
structure and cost functions in picking your policy.) 

(b) Assume, instead, that demand alternates between 3 and 1 (instead of 3 and 2). 
Answer the questions in part (a). 

*(c) If your guesses in parts (a) and (b) were not optimal, use the policy iteration 
algorithm to determine optimal policies. 


*32 Fibonacci Search, An efficient method for finding the minimum of a ditonic function was 
described in the special material at the end of Sec. 12.7. Apply the method to find the 
minimum of the functions below. 

(a) ^0-) = (j-7)2forj = 0, 1,..., 50. 

(b) gU) = y + (J - 7)2 forj = 0, 1,..., 50. 

(c) gU) = -2j + {j - 7)2 forj = 0, 1,..., 50. 

100 for j = 0, 1,...,10 
90- 3J for 11,..., 20 
30-1-5j for j = 21,...,30 
for j = 31,...,40. 


W gU) = 


33 Explain your understanding of the following terms: 


functional, or extremal, equations 
stationary policy (strategy) 
cycle (or circulation) 

*discreic dynamic prograriiming 
problem 

successive approximations 
value iteration (successive 
approximations in function 
space) 


policy iteration (successive approximations 
in policy space) 

monotonically increasirg (decreasing) 
convergence from above (below) 
value determination routine 
normalizing equation 
* Fibonacci numbers 
*ditonic function 
*unimodal function. 
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FORMULATION AND COMPUTATIONAL EXERCISES 

34 Consider the network in Fig. 12.12, where arrowheads at both ends of an arc signify 
that it is permissible to travel in either direction at the cost indicated. 

6 


Figure 12.12 

(a) Let = 1. Solve the associated extremal equations (5) in Sec. 12.3 using policy 
iteration. 

(b) Apply policy iteration to find yi and the associated strategy for oc = .9. For 
ct = For 0£ = 0. Use the optimal solution in part (a) as the initial policy. 

(c) Let CC = .9. Apply the value iteration algorithm, starting with yi = 0. Letting 
n' denote the first iteration at which the entire policy agrees with the optimal 
policy, terminate the calculations at n' -h 3. 

35 Consider the network in Fig. 12.13 where arrowheads at both ends of an arc signify 






Figure 12.13 
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that it is permissible to travel in either direaion at the cost indicated. Answer the 


questions in exercise 34. 

Exercises 36 through 40 require modifying the formulations of problems in the 
exercises of earlier chapters. As before, be sure to define all the symbols you use. 

36 In each part below, assume that costs and returns are to be discounted to the current 
period, and a is the one-period discount factor, where 0 ^ o' < 1. Indicate an appro¬ 
priate finite horizon dynamic programming recursion, analogous to that in (1) of 
Sec. 12.1. 

(a) The capacity expansion problem in exercise 33 of Chap. 10. 

(b) The ARKA Mutual Fund Problem in exercise 34 of Chap. 10. 

(c) The GAW Company Employment Smoothing Problem in exercise 36 of Chap. 10. 

(d) The Expando Manufacturing Company Problem in exercise 36 of Chap. 2 
and 38 of Chap. 10. 

(e) The production scheduling problem in exercise 39 of Chap. 10, assuming that 
the amount Xj is produced in Period j. Also show the optimal value for Xt, given 
values for X|, ^ 2 ,..., Xj-i. 

37 In each part below, assume that all the data are stationary, the horizon is unbounded, 
costs and returns are discounted to the current period, and (X is the one-period discount 
factor, where 0 < o' < 1. Indicate an appropriate set of extremal equations, analogous 
to those in (2) of Sec. 12.1. 

(a) Production smoothing model in exercise 45 of Chap. 8. 

* (b) Production smoothing model in Sec. 9.8. 

(c) Production scheduling model in exercise 53 of Chap. 9. 

(d) Cattle raising problem in exercise 32 of Chap. 10. 

38 Consider the production smoothing model in exercise 45 of Chap. 8. Use the data for 
the illustration in Sec. 12.5; let the target production level x = 3 and the smoothing 
parameter i; = 1. 

(a) Apply the policy iteration algorithm to find an optimal strategy for oi = 1. 

(b) Apply the policy iteration algorithm to find an optimal strategy for cc = .9. 
For q: = ^. For a: = 0. Let the initial trial policy be the optimal policy in part (a). 
Describe the effect (X has on an optimal policy. 

(c) Let <X = .9. Apply the value iteration algorithm, starting with ^ = 0. Letting 
n' denote the first iteration at which the entire policy agrees with the optimal 
policy, terminate the calculations at n' -f 3. 

39 Consider an inventory model analogous to that in Sec. 12.5. Continue to assume that 
demand D = 3, production x ^ 5, and ending inventory J ^ 4. Let production and 
inventory costs be C(x) + h^jy where 



i5x for X = 0, 1,2, 3 
^12+ x forx^4, 


and h = 1’. 


(a) Apply policy Iteration to find an optimal strategy for OJ = 1, 

(b) Apply policy iteration to find an optimal strategy for (X = .9. For a = For 





442 Optimization Methods for an Unbounded Horizon 


oc = 0. Use the optimal solution in part (a) as the initial policy. Describe the 
effect a has on an optimal policy. 

(c) Let CL = .9. Apply the value iteration algorithm, starting with = 0, Letting 
n' denote the first iteration at which the entire policy agrees with the optimal 
policy, terminate the calculations at n' + 3. 

(d) Repeat parts (a), (b), and (c) for A = 2. 

(e) Repeat parts (a), (b), and (c) for demand Z) = 2. {Note: finite horizon versions 
of these questions are contained in exercises 13 and 15 of Chap. 9.) 

40 Answer the questions in exercise 39, assuming that the production cost function is 

CM-f 

l3 + 2x forx^l. 

{Note: a finite horizon version of this question is contained in exercise 14 of Chap. 9.) 


MIND-EXPANDING EXERCISES 

41 Consider a network such as that characterized in Sec. 12.1, and assume that the 
discount factor ct satisfies 0 ^ cz < 1. 

* (a) Assume you are given a trial strategy (not necessarily stationary) and the associated 
present values yj. Suppose that there is at least one Node k (and perhaps several) 
for which 

minimum {af} + 

(t,J) IQ 

network 

Consider a stationary strategy which uses arcs giving minimum {ccy^ + Cij) for each 
Node i. Show that this new trial strategy yieldssuch thatfor every 
Node j, with strict inequality holding for Node k. 

(b) Given that the proposition in part (a) is true, show that there exists a stationary 
policy that is optimal among all possible policies. 

42 Consider a network such as that characterized in Sec. 12.1, and assume that the dis¬ 
count factor a satisfies 0 ^ < 1. Assume you have available a lower estimate Li 

and an upper estimate Ui on each unknown^i, that is, Li ^yi ^ Ui. Show that it is 
never optimal to use arc (i, k) when 

ocLk 4- > minimum (<Z(7j + C(j). 

{ i , S ) In 
network 

Suggest in detail how you might use this inequality to speed convergence in suc¬ 
cessive approximation algorithms. 

♦43 Consider the proof of convergence of the value iteration algorithm for 0 ^ oc < 1, 
as given in the advanced material of Sec. 12.2. Modify the argument to show that 
there is at least geometric convergence (vii) for the policy iteration technique. 

44 Consider the dual linear programming model for = 1, namely, (8) and (9) in Sec. 
12.6, and the corresponding primal model (10) through (13). 
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(a) Show that there is no loss in assuming that each ^ 0. 

(b) Show that there is no loss in assuming that the primal model constraints arc (^) 
inequalities. 

(c) Show that if all Cfj ^ 0, then there is no loss in assuming that c ^ 0 and that the 
normalization constraint (12) is a (^) inequality. 

45 Suppose you drop the assumption in Sec. 12.3 that there is a path from every Node i 
to at least one of the nodes associated with a cycle yielding c. Let c* be the minimum 
average return per period attainable starting at Node i. Then analogous to the extremal 
equations (5), the following extremal equations must be satisfied: 

Wi = minimum (wj -f” “ ^i) for each Node i 

(t. j) In 
network 

C/ = minimum (cj) for each Node i. 

(t,J) in 
network 

*(a) Devise (heuristic) arguments establishing these extremal equations. 

*(b) Indicate how these equations lead to the algorithmic steps in (i) and (ii) of the 
special material at the end of Sec. 12.3. 

(c) Write the associated dual and primal linear programming models, analogous to 
(8) through (13) in Sec. 12.3. The appropriate dual objective is to maximize 
S?=i ^k^k> where each Tjt > 0 but is otherwise arbitrary. 

(d) Explain why these extremal equations simplify to (5) when the special assump¬ 
tion in Sec. 12.3 is made. 

46 Review the Kleen City Police Department case in exercise 30 of Chap. 2 and the 
general formulation in exercise 27 of Chap. 5. Also review the results in exercise 54 
of Chap. 6, which state that if the number of periods T is odd, then the model is not 
equivalent to a network problem (hence, applying linear programming may not 
yield an integer-valued solution). Explore how to characterize the problem for any 
value of T in terms of a set of extremal equations, analogous to those in (5) or (6) of 
Sec. 12.3 for the network model. Specifically, 

(a) Indicate how the underlying network is drawn. Be sure you label the nodes, and 
state how they characterize the state of the system at each period. Also label the 
arcs to make clear the associated decision, and show how the problem’s constraints 
are taken into account. 

(b) Write the extremal equations associated with your answer in part (a), and indicate 
why their solution gives an optimal strategy for the original model. 

(c) Modify your answers in parts (a) and (b) for the situation in which a person 

starting work in Period t remains in the system in Periods t + 1 and t -j- 2; thus, 
the constraints are of the form ^ 

47 Consider the following composite successive approximations algorithm. Apply value 
iteration for N successive iterations. Use the trial policy at the Nth iteration and 
apply Step 2 of policy iteration. Stop if the extremal equations are satisfied. Otherwise 
repeat the procedure, letting be the values found in the value determination 
routine of policy iteration. Explain why this hybrid approach converges in a finite 
number of iterations. 




CONTENTS 

13.1 Quest FOR A Philosopher’s Stone . 445 

13.2 Integer Programming Formulations . 450 

13.3 Overview of Integer Programming Algorithms. 457 

13.4 Cutting-Plane Algorithm (Method of Integer Forms) . 459 

13.5 Branch-and-Bound Algorithm. 466 

13.6 Traveling Salesman Problem . 471 

13.7 Partial (Implicit) Enumeration Algorithm. 480 

Review Exercises. 488 

Computational Exercises. 497 

Formulation Exercises . 500 

Mind-Expanding Exercises. 506 













CHAPTER 


Integer Programming and 
Combinatorial Models 


13.1 QUEST FOR A PHILOSOPHER'S STONE 

This chapter, as well as the next two, investigates programming models in 
which the two assumptions called divisibility and additivity (Sec. 2.2) are weakened. 
Recall that these postulates in combination implied linearity in both the objective 
function and the constraints, and also allowed the variables to take on fractional 
values, such as 2.5 or J^.tHere we drop the divisibility assumption and 
treat problems in which some, or all, of the variables are permitted to take on 
only integer values (or whole numbers).}Later, in Chaps. 14 and 15, we remove 
the additivity assumption. 

Consider the model 


( 1 ) 

subject to 
( 2 ) 

( 3 ) 

( 4 ) 


optimize 

n 

S ^ for z = 1, 2,..., m 

Xi>0 forj = 1, 2,..., « 

Xj integer-valued for j = 1, 2,..., p (< n). 


This type of optimization model is referred to as an integer (or diophantine, 
or discrete) programming problem. AVhen so that every 

variable must be integer-valued, the model is called a pure integer program¬ 
ming problem; otherwise, it is called a mixed integer programming problem. 

Depending on the particular application, the sense of optimization in the objec¬ 
tive function (1) may be either maximization or minimization. (We will indicate 


445 
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the appropriate sense for each illustration and algorithm below.) Further, an 
integer programming problem may include (>) inequalities and equalities. 
[You can view the linear constraints (2) as a canonical form, however, by applying 
the procedures given in Chap. 3 whenever the constraints in their original version 
do not appear as in (2),] 

The discreteness stipulations (4) are what distinguish an integer from a linear 
programming problem. In case the linear constraints (2) happen to represent a 
network model, there exists an optimal solution to (1), (2), and (3) that also satis¬ 
fies the integer-value constraints (4), as you already know from the Integrality 
Theorem for networks (p. 192). But, in general, imposing (4) is restrictive, so 
that the maximum value of an objective function for an integer programming 
problem is smaller than for the corresponding linear programming problem. 

Importance of integer programming problems. Wc indicated 
earlier that most industrial applications of large-scale programming models are 
oriented toward planning decisions in the face of complex situations. There are 
several, frequently occurring circumstances that lead to planning models contain¬ 
ing integer-valued variables. 

1. Equipment Utilization. You may define a variable Aj to be the pieces of equip¬ 
ment that are to operate during the model’s planning horizon. If each piece of 
equipment provides a large capacity and is expensive—for example, an automatic 
screw machine, an oceangoing oil tanker, or a 150-inch double-knife paper 
corrugator machine—then a fractional value for like may be meaningless 
(nonrealizable) in the context of the actual decision problem. In this event, you 
would have to restrict Xj to be integer-valued. 

2. Setup Costs, You may want to consider an activity that incurs a so-called 
fixed cost (or setup cost) whenever the corresponding level Aj > 0, where Q 
is independent of the actual level of x^. For example, if x^ represents the hourly 
utilization of a blast furnace in a steel plant, then Cj represents the cost of starting 
up the furnace and heating it to the required temperature. You will see in Sec. 
13.2 how to encompass setup costs in a programming model by introducing integer¬ 
valued variables. 

3. Batch Sizes. In some production planning situations, you may want to restrict 
the level of x^ to be either = 0 or Afj > Ly For example, X) may be the amount 
of a special product to be manufactured during Period t, and may represent 
the minimum possible production batch size for the item. This stipulation is an 
example of an "either-or” restriction and can be formulated by introducing integer 
variables, as you will learn in Sec. 13.2. 

4. Decisions. You may wish to specify other types of “either-or” 
situations. To do this you can restrict the levels of x^ to eijther atj = 1 or = 0, 
representing the decisions "go” or “no go,” and "yes” or “no.” To illustrate, you 
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may let = 1 correspond to building a new factory, or to opening up a sales ter¬ 
ritory, or to acquiring another business, or to selling a currently owned asset. 
Frequently, these sorts of alternatives are categorized as capital budgeting decisions, 
because they require large expenditures of capital and resources. This is the main 
reason why integer programming is so important for managerial decisions. An 
optimal solution to a capital budgeting problem may yield considerably more 
profit to a firm than will an approximate or guessed-at solution. For example, 
a cement manufacturing firm with 25 plants may be able to substantially increase 
profits by cutting back to 20 plants or less, provided this reduction is planned opti¬ 
mally. The decreased overhead with fewer plants can easily outweigh any conse¬ 
quent increase in transportation costs, if the new plant configuration is optimal. 

Many other decision problems can necessitate integer programming models. 
One group of problems deal with sequencing, scheduling, and routing decisions. 
An example is the traveling salesman problem. It aims at finding a least-distance 
route for a salesman who must visit each of n cities, starting and ending his journey 
at City 1. Another example is the machine scheduling problem, A simple illustra¬ 
tion is when n items to be manufactured must be sequenced through each of k 
machines. Suppose an item cannot proceed to Machine j until it finishes being 
processed on Machine j — 1. Assume the processing time of each item on each 
machine is determinate. An optimal sequence of items is then defined as a schedule 
that minimizes total elapsed time to complete all jobs on all machines. Further 
examples of scheduling, sequencing, and routing problems include line^balancing, 
critical path scheduling (see Sec. 6.6) with resource constraints^ preventive maintenance 
scheduling with constraints on labor availability^ and truck dispatching. 

Despite the considerable attention given in operations research journals to 
such integer programming models for sequencing, scheduling, and routing prob¬ 
lems, so far these models have been of only limited practical importance. One 
reason why is that the integer programming models frequently ignore some of the 
critical considerations that arise in actual scheduling environments. Another reason 
is that the computational burden of solving these models has been so great that 
the cost of obtaining a solution ejcceeds any improvements resulting from the 
approach. 

Sequencing, scheduling, and routing problems are .special cases of what are 
called combinatorial models. A combinatorial optimization problem 
consists of finding, from among a finite set of alternatives, on<* that optimizes the 
value of an objective function. For example, the finite set for the traveling sales¬ 
man problem with n cities consists of (n — 1)! different possible tours starting and 
ending in City 1. In the simple illustration of the machine scheduling problem, 
the finite set consists of (n!)* possible sequences of the n items on the k machines. 

With sufficient ingenuity, you can always devise a nontrivial integer program¬ 
ming representation of a combinatorial optimization problem; some of the mathe¬ 
matical devices for doing so are explained in Sec, 13,2. Frequently, such integer 
programming problems are very large (that is, there are an enormous number of 
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constraints and variables), and therefore the formulation is of negligible com¬ 
putational interest. But, as you will discover, several of the algorithmic techniques 
for integer programming problems can be applied directly to combinatorial models 
without first having to transform them into integer programming models. 

The search of a lifetime. Would your guess be that integer program¬ 
ming problems are harder or easier to solve than linear programming models 
of the same size? Most students would answer that integer programming problems 
are probably easier. To support this opinion, they observe that a dichotomy, 
say, such as = 0 or = 1 contains considerably fewer alternatives than does the 
continuum 0 ^Xj ^ 1. And from this comparison they reason that the search for 
an optimum ought to be simpler. Regrettably, just the opposite is true—in a 
programming model, computation is easier with a continuum than with discrete 
alternatives. The following illustration shows why. 

Consider the model 

(5) , maximize 2lxi + 11^2 

subject to 

(6) 7x1+4x2^13 

(7) Xi and X 2 nonnegative integers. 

After a few moments inspection, you will find that the unique optimal solution is 
Xi = 0 and X 2 = 3. Of course, since this problem is so small, you can think of 
many ways to discover and verify the solution (including the dynamic program¬ 
ming technique given for the Distribution of Effort Problem in Sec. 10.2). But 
since our goal here is to understand the relative difficulty of solving mediuin- 
to large-scale integer programming problems, let us rule out any solution schemes 
that are ad hoc to a particular problem as well as those systematic algorithms that 
are practical only for small-scale problems. 

There is one approach for finding an integer-valued solution that is frequently 
suggested by beginners. The procedure is, first, ignore the integer stipulations, 
and solve for an optimal linear programming solution. If this answer satisfies the 
integer restrictions, then you have in fact found an optimal solution for the origi¬ 
nal problem. (State why?) Otherwise, obtain an integer solution by rounding the 
linear programming answer to whole numbers. 

Now suppose you try this approach on the example in (5), (6) and (7). The 
optimal fractional solution is Xj = and X 2 = 0. The obvious rounded solution 
is Xj = 2 and X 2 = 0, which is infeasible. A “rounded-down” solution Xj = 1 
and X 2 = 0 is feasible, but far from optimal. It is hard to think of any general, 
systematic procedure that is practical for rounding a nonintegral solution for a 
mediupi- to large-scale model, and that yields the optimal integer solution when 
applied to this toy problem. 

If some of the coefficients Oij in the linear constraints (2) are negative for a 
particular model, the problem of rounding a linear programming answer to a 
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feasible integer-valued solution can be a difficult task in itself. Hence, although 
rounding may succeed for some applications, you cannot expect such an approach 
to succeed in general. 

An even more serious difficulty inherent in integer programming problems 
is that there is no easy way to verify whether a given feasible solution is actually 
optimal. This represents an important distinguishing difference between integer 
and linear programming problems. To illustrate, suppose in the above problem 
(5), (6), and (7) you want to test whether = X 2 = 1 is optimal. To do this, 
you might examine whether the solution represents a local optimum, in that the 
objective function does not improve at any neighboring feasible integer (or 
lattice) point Xj = 1 + ^ and X 2 = 1 + ^, where d^e = —1, 0, 1. Verify that 
the feasible neighboring points in this case are (xi = X 2 = 0; =0 and ^2 = 1 i 

;ci = 0 and X 2 = 2; Xi = I and X 2 = 0). Check that the solution Xi = X 2 = I 
is indeed better than all of these, and yet is not optimal. Thus a point can be 
locally optimal among neighboring lattice points and still not be globally optimal. 
(Incidentally, this enumerative test for a local optimum is really not practical 
for large-scale problems, since there can be a vast number of neighboring points. 
In fact, if the model is a pure integer programming problem and eachx^ is restricted 
to be either 0 or 1, then searching all the neighboring points of a feasible solution 
is equivalent to enumerating all the solutions to the original problem.) 

By now you may be sufficiently exasperated to wonder whether the best general 
approach is merely to enumerate all the feasible solutions, and then pick the best. 
Actually, if there are only a few possible solutions, such an exhaustive procedure 
may well be easier to implement than any of the algorithms explained later in the 
chapter. In some real applications, this has been the very approach used (fre¬ 
quently, a number of possible solutions are ruled out immediately as “obviously” 
nonoptimal). Most often, however, an exhaustive approach proves unworkable, 
and the reason, simply, is that the number of feasible solutions is not always 
finite,- and even when it is, the magnitude is usually stupendous. For example, 
consider finding an optimal solution to an integer programming problem consist¬ 
ing of 100 variables, each restricted to equal either 0 or 1. Then the time to 
enumerate all possibilities on the fastest computer in existence far exceeds a 
lifetime. 

Practical algorithms. It is apparent from the foregoing discussion 
that in order for an algorithm to be of general use in solving integer programming 
problems, it must avoid explicitly enumerating all possibilities. What we want are 
techniques that partially enumerate a manageable number of possibilities and 
implicitly enumerate all the rest. Recall that the simplex method is just such a 
technique for solving ordinary linear programming problems—it systematically 
examines only a small number of all the possible basic solutions. (Incidentally, 
mathematicians still do not have a good theory for why the simplex method, in 
practice, does examine so few bases.) By the same token, dynamic programming 
recursions exploit the principle of optimality to circumvent enumerating all feasible 
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solutions. The success of these partially enumerative optimization methods 
motivates the quest for finding similar approaches to solve integer programming 
problems. 

Devising such integer programming algorithms is currently a lively area of 
research among technical specialists. It is too early to single out a few algorithms 
and claim they are as effective for integer problems as the simplex method is for 
linear programming problems. In fact, given any of these algorithms, it is easy to 
devise specific integer programming examples that require computational effort 
equivalent to complete enumeration. Hence, the usefulness of any one of the tech¬ 
niques must ultimately be established by empirical findings indicating the types of 
problems it is likely to solve well. 

Several general concepts are emerging as clearly fundamental to the various 
approaches that now look promising. You will study these ideas starting in Sec. 
13.3. Importantly, most of the algorithms employ linear programming models 
and dynamic programming notions as elemental building blocks. Encouraging 
evidence of progress in solving real integer programming problems is rapidly 
accumulating. When such algorithms become perfected, operations research 
analysts, equipped with high-speed computers, should indeed be credited with 
having created a philosopher’s stone. 

13.2 INTEGER PROGRAMMING FORMULATIONS 

If you review the various planning models in Chap. 2, you can easily envisage 
situations in which certain of the variables must be integer-valued. There are a 
myriad of other examples, primarily dealing with the allocation of resources over 
a short horizon (a day, week, or month), that also can be idealized by an integer 
programming formulation. To illustrate, these include assigning transportation 
vehicles of various sizes to satisfy certain pickup and delivery requirements; 
scheduling several assortment mixes for a paper-making or steel-rolling machine 
to meet a variety of customer demand commitments; and allocating a number of 
large orders to a company’s several manufacturing plants that differ in capacities, 
efficiencies, and costs. Being familiar with the techniques of constructing linear 
programming models, you will find it a straightforward matter to formulate these 
resource allocation problems as integer programming models; therefore we will 
provide no further elaboration in this chapter. It would be misleading, however, 
to give you the impression that these situations constitute the main focus of interest 
in managerial decision-oriented applications. 

By far, the most significant applications deal with setup costs, dichotomous 
choices, and complex capital budgeting decisions. Constructing integer program¬ 
ming formulations to handle such considerations is partly an art; hence, we explain 
below some of the standard devices that are useful. To further enhance your 
understanding, we also provide an illustration of how to formulate a combinatorial 
problem as an integer prografnming model. The example we have selected is the 
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traveling salesman problem. The important point of this illustration is your seeing 
there is a connection between combinatorial and integer progpramming problems. 
Although such integer formulations themselves are frequently of little immediate 
practical consequence—even a small combinatorial problem usually becomes an 
enormous integer model—the connection does provide the bridge spanning several 
fundamental algorithmic concepts. 

Plant location—an example with setup costs. In Chap. 6 you 
studied the classical transportation problem: 

m n 

(1) minimize 2 2 

subject to 


(2) 

S for z = 1, 2,..., ffi 

(supply) 

( 3 ) 

S Xi) > Dj for J = 1,2, ...,n 

isl 

(demand) 

( 4 ) 

Xij > 0 for all i andj. 



Recall that one interpretation of the problem is that there are m supply points 
with items available to be shipped to the n demand points. Supply Point i can 
ship at most Si items, and Demand Point j requires at least items. The cost of 
shipping a unit from Supply Point i to Demand Point j is Ci). The objective is tc 
select a routing plan that minimizes total transportation costs. 

r In some transportation planning applications, part of the problem is to establish 
which of m possible supply points should actually be in operation. The supply 
points usually represent plants or warehouses, and incur overhead expenses that 
must be combined with the transportation costs in determining company profit. 

‘ The typical cost tradeoff is between reduced overhead cost from fewer plants and 
warehouses, and increased transportation costs. 

For example, consider the Goforth Manufacturing Company which is going to 
expand its business by serving customers in states west of the Rocky Mountains. 
To accommodate this extension of its marketing area, Gloforth plans to open 
several new assembly plants. Suppose that Plant Site i represents a potential 
location. Assume the plant v^l have capadty Si and incur a fixed ^verh ead, cost 
Fi ^ 0,jndependent of the amount it produces. There are m possible sites for these 
new plants, but the overhead expenses are so significant that it is too costly to open 
plants on all the sites. Let ca ^ 0 be manufacturing and transportation expenses 
for shipping an item from Plant Site i to Marketing Area j. Assume, in addition, 
that there is a setup^cost Fij > 0 associated ydth mainjtaming a shipping rouje,jiom 
Want Site i to Marketing Area j^tliisambimt F^ is independent of the magnitude 
of Xi) > 0, but is not incurred when Xij = 0. 
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Accordingly, lo formulate the problem as a mathematical model introduce the 
integer-valued decision variables 

( 1_if Plant Site i is selected 

^ _for t = 1, 2, . . . , m, 


and 




=i'-‘ 

lo < 


otherwise 

if route from Plant Site i to Area j is used 


for all i and 


otherwise 

Then, instead of the usual objective function (1), the objective function becomes 

tn TTi n TH n 

(5) minimize Fty* + + 2 2 Fj z^. 

Instead of imposing the supply constraints (2), you state the capacity restrictions 
as 


( 6 ) 


2 for I = 1, 2,. , ., m (supply). 

-j»i __—-- 


If Plant Site i is not opened, so that^i = 0, then (6) ensures that > 0 is not 
allowed. The demand constraints (3) and nonnegativity constraints (4) remain 
unchanged. For certain integer programming algorithms, it is sufficient to state 
in addition that the^i and must be either 0 or 1. But for other algorithms, these 
s ame rest rictions must be expressed in the expanded form 

(7) yi ^ 1 and < 1 for all i and j 

(8) yi and nonnegative integers for all i and j. 

[State why (7) and (8) together ensure that each j;, and is either 0 or 1.] 
The model formulation is not yet complete, because we have not related 
the value of z^ to the the value of Xij, We want to ensure that Xi) > 0 only if 
^.= J ;.this can be achieved by including the linear restrictions 

(9) Xij — UijZij < 0 for all i and j, 

where Ujj is a suffici ently large coefficient—for example, let Utj = minimum 
{Sjy Dj). Verify that if z^ = 0, then the nonnegativity condition (4) and (9) 
imply that Xij must be equal to zero; and if z<^ = 1, then (9) is superfluous, 
given the new supply constraint (6) and the demand restriction (3). (Also check 
that it is never essential for optimality to have z</ = 1 and x^ = 0.) 

It is easy to construct examples in which an ordinary linear programming prob¬ 
lem given by (5), (6), (7), and (9) has an optimal solution with fractional values 
for the variables. Hence the restriction (8), requiring that the^< and z<j be integer¬ 
valued, is essential. As in the ordinary transportation problem, it is only necessary 
to stipulate that each Xt^ must be nonnegative, since the integrality property of the 
usual transportation model continues to hold provided that yx and z<j are integer¬ 
valued. 

The formulation in (9) generalizes to other situations. Whenever there is a 
setup cost C)'>0 associated with an ordinary variable x^ in a programming model, 
then|a valid integer programming formulation is to introduce a zero-one variable 
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(z^ = 0 or 1) and a constraint {Xj — Uzj < 0), where U is chosen to be appro¬ 
priately large. Notice that this device complicates the model in that it adds both 
an integer-valued variable and a linear constraint for j^h variable incurring a 
s^tup cost.’^J 


Batch sizes—an example with alternative constraints. 
Suppose X) is the amount of an item to be manufactured, but that either X) 
where Lj represents the smallest feasible lot size, or that = 0 during the horizon. 
Assume that you can specify a large enough number such that the constraint 
{xj ^ is certainly satisfied by an optimal solution. Then the dichotomy 
= 0 or > L^) can be formulated by adding a zero-one variable = 0 or 
1) and two linear constraints: 

( 10 ) - U,y, ^ 0 

(11) < x,-L,y,^Q. 

For_y^ = 0, the constraints (10) and (11) imply that there is no production {Xj = 
0). For = 1, the constraint (10) becomes ineffective, and the constraint (11) 
v^imposes the required lot-size restriction. 

The preceding idea can be generalized to handle situations in which a solution 
must satisfy at least k out of the p constraints 

(12) for i = 1, 2,... 


Introduce the p zero-one variables {yi = 0 or 1), for i = 1,2,and impose 
the 1 -f p linear constraints 


(13) 

(14) 


. I ; « 

; . n 

; a^iXi < btyi -f £/c(l — j^,) or 
— {bi — Ut)yt <Ui for t = 1, 2,... 


where Ui is chosen so large that (S?-i ^ijXj ^ Ut) is certainly satisfied by an 
optimal solution. Explain how (13), (14), and the stipulation that each ji be 
either 0 or 1 is a valid formulation of the problem. [Actually, (13) can just as well 
be stated as an equality. Why?] 


Capital budgeting—an example with interdependent alter¬ 
natives* “Go-no-go” alternatives are probably the most important managerial 
decisions that lead to integer programming problems. These choices frequently 
arise in the context of a multiperiod strategic planning model. Typically, the 
variable xj indicates whether or not a particular project (or an activity, or an 
investment alternative, etc.) is to be undertaken, where 

f 1 if Project j is accepted 

Xj = \ 

10 otherwise. 
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An associated coefficient in a constraint often represents the amount of a scarce 
resource, such as cash, that is required for the project at Period i, where the total 
amount of the resource utilized in each period is limited. These types of constraints 
arc identical to those you would find in an ordinary linear programming model 
designed to aid in long-term strategic planning. Hence we go no further in develop¬ 
ing specific examples. 

But the fact that Xj is a zero-one variable can be exploited to represent combina¬ 
torial kinds of restrictions that are often present in capital budgeting problems. 
For example, suppose that you want to restrict the solution to accept no more than 
k of the first p projects. (Perhaps each project requires the full-time supervision 
of a senior engineer, and only k engineers are available for new assignments.) 

. Then you can achieve this limitation by imposing the linear constraint 

(15) f) atj ^ A:. 

If you write (15) as an equality, then exactly k alternatives will be accepted. The 
jl special case of p mutually exclusive projects can be treated by letting A: = 1 in 
I (15). This case can occur, for example, if the first p projects actually refer to the 
same alternative, and x^ indicates whether or not this activity is to commence at 
Period j. 

Jf,,s^, alternative x^ can be undertaken only if alternative x^ is accepted, then 
the linear constraint 

] |(16) —AT, + a:j 0 

on the two zero-one variables achieves this stipulation, as you should verify. Sup¬ 
pose, instead, that alternatives Xi and X 2 are mutually exclusive, and A '3 can be 
undertaken only if either Xi or X 2 is accepted; check that this dependency can 
be achieved by imposing the linear constraints 

(17) + ^2 < 1 

(18) —Xi — X 2 + X 3 < 0 . 

By now you can see that, with sufficient ingenuity, a host of combinatorial 
interdependences among the projects can be represented by means of linear 
stipulations on zero-one variables. 

Traveling salesman problem. A traveling salesman wants to visit 
each of « cities, starting and end ing at City_ 1. He visits no other city twice. Let 
Cij ^ 0 designate the distance between City i and City j, where we use the symbol 
Cij = 00 when there is no direct connection from City.i to City j; in some applica¬ 
tions, does not equal cjt. The optimization problem is to find a^shortest 
tp^ur. (A production scheduling application of this model was outlined in an 
earlier discussion in Sec. 6 . 5 .) 

There are several valid mathematical formulations of the problem. The one to 
follow uses relatively few variables. Define the zero-one variables: 
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f 1 if a tour includes traveling from City i to City j 
[O otherwise 


for all 

i and j. The objective function is 


(19) 

n n 

minimize ^ ^ ^ 0^0 where Cu = 

= 00 for i = 1, 2,. . ., n. 

The variables must satisfy the constraints 


(20) 

2 ATo = 1 for i = 1, 2, 

;=i 

..., n (departure) f 

(21) 

= 1 for 1,2, 

. . ., « (arrival) ^ * " ’ ' ^ 

(22) 

Xij nonnegative integers 

for all I andj. 


(The convention Cn = oo is adopted to rule out the meaningless possibility that 
Xu = 1 occurs in~an optimal solution. Alternatively, you can simply omit the 
variables Xu from the problem specification.) [Restrictions ( 20 ), ( 21 ), and ( 22 ) 
in concert ensure that each x^ is either zero or one’. Relations ( 20 ) require that a 
tour includes one departure from each city, and, similarly, relations ( 21 ) guarantee 
one arrival at each city. The model would be an assignment problem, treated in 
Sec. 6.4, if (19) through ( 22 ) were the complete characterization. 

Unfortunately, although the x^ for any tour must satisfy the above constraints, 
a feasible solution to these constraints need not be a tour. In par ticular, a feasible 
solution to (20), (21), and (22) may include two or more disconnected cycles or 
subtours, such as Xi2 = X23 = X31 = 1 and x^ = x^^ = • • • = x„4 = 1. 
(Slfate why such a feasible solution is not a tour for the traveling salesman.) There¬ 
fore, further restrictions on the x^ have to be added in order to guarantee that the 
result is a tour. 

There are various ways to formulate linear constraints on the integer variables 
in order to rule out cycles. For example, the restriction X 23 + X 31 ^ 1 eliminates 
the subtour between Cities 2 and 3, and similarly, the restriction X 23 + Xu + -^ 4 j 
^ 2 rules out one of the subtours between Cities 2, 3, and 4. But the following is 
an elegant specification of linear restrictions that exclude all subtours. 

Introduce the variables i/<, for z = 2, 3,..., n, and constrain Aem by the 
(n — 1 )* — (« — 1 ) relations 

The Ui need not be further constrained to rule out subtours, but no harm is done 
if the Ui are also restricted to be nonnegative integers. The complete formulation 
( 20 ) through (23) consists of n* — n + 2 linear constraints and n* + ?z — 1 
integer-valued variables, n of these being the Xu that must equal 0 . Certainly, 
it is not obvious why (23) works; we provide the justification in the advanced 
material below. 
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►To validate the constraints (23), we must establish two p ropositions: (a) all subtoura 
arcjndeed eliminated, and (b) no tour is exclud ed. These two statements are argued 
in turnT’ ** 

First, the constraints (20) through (23) rule out all subtours among Cities 2, 3,.. •, n. 
To see why, consider any cycle among k of these cities, which would be represented by 
Xij = 1 for A: variables. Add the k constraints in (23) associated with the variables in 
the cycle. Such a composite constraint must also be satisfied by any solution that is 
feasible in (23). As you should verify, both Ui and (—Wj) for each of the k cities appear 
in the resultant sum; thus, no i/i appears in the composite constraint. But it is not feasible 
for all Xi) = 1 in this constraint, for then the sum on the left of the inequality is nk, 
which is larger than the sum on the right of the inequality, (n — 1)A;. Hence (23) 
excludes such a subtour. 

To illustrate the foregoing argument, consider the subtour City 2 to City 3 to City 4 
to City 2, which would be represented by X23 = ^34 = X42 = 1. The k = 3 associated 
constraints in (23) are 

«2 — “s + nx23 ^ n — 1 

(i) «3 — 4- n^34 < n — 1 

“4 — *^2 + «^42 ^ — 1 ; 

adding these inequalities, you obtain the composite constraint 

(ii) (1/2 - «3 + W3 - «4 + «4 “ «2) + n{x23 + X34 -I- X42) < (n - 1)3. 

Check that (ii) rules out the possibility X23 = X34 = X42 = 1. 

Second, the constraints (20) through (23) do not exclude any tour. To show this, 
we must demonstrate that there are values for Ui that satisfy (23) for any given tour. 
Consider such a tour, which, by definition, visits each of the Cities 2, 3,..., n, once 
and only once. Let << be the position in that tour at which City i is visited, where we 
designate h = \ for City 1. For example, in the tour City 1 to City 3 to City 5 ..., 

the values of h are /i = 1, <3 = 2, ts = 3,_Then t/< = U, for t = 2, 3 ,..,, n, is a 

feasible set of values. To see why, suppose Xij = 1 in the given tour, so that tj = It I. 
Then the constraint for this x^ in (23) is satisfied, since 

(iii) h- {lt+ l) + n{l)^n- 1. 

Suppose, instead, that x^ = 0. Then the inequality in (23) is (^t — ^ n — 1), which 

must be satisfied because ui^n and > 1. 4 

* Other formulation devices. Suppose you want to permit a variable 
X to equal only certain values, say, ATj, Ar2, ..., Xp, You can impose this stipula¬ 
tion by removing x and appropriately introducing p zero-one variables w;*. 
Specifically, replace x everywhere in the model by the expression 

(24) .XxWi + X 2 W 2 + • • • + XpWp 
and then restrict the by the constraints 

(25) Wi + W 2 + • • • + Wp= \ 

(26) all a;* integer-valued. 

Further, if a nonlinear function c{x) also appears in the model, then using (24). 
(25), and (26), you can replace c{x) by the expression 
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(27) c(A',)w, + c{Xt)Wi + • • • + 

which is linear in the 

The preceding approach can be generalized to accommodate a programming 
model that contr.ins a continuous piecewise-linear function c(x) of a nonnegative 
variable x. This type of model is discussed at some length in Sec. 14.7, and you 
may postpone reading the following explanation until you have studied the mate¬ 
rial in the next chapter. 

Assume, for simplicity, that x lies in the interval 0 ^ AT and that the values 

0 = Xi < Xo < ••• < A'p = a: are the end points of the intervals for the linear 
pieces; that is, c(x) is linear for Xjc^x ^ where k = 1,2,...,^ 1. 

To begin, introduce p nonnegative weights replace x everywhere in the model 
by the expression (24), impose (25), where all > 0 instead of (26), and replace 
c{x) by (27). 

As you should verify, these substitutions will be valid provided that a solution 
contains only one variable Wi^= 1, or at most two adjacent variables Wi^ and 
where + w^+i = 1. To guarantee this so-called property of adjacent weights^ 
introduce (/? — 1) zero-one variables and impose the (/^ + 1) constraints 


i/;i — 

2l 

^0 

W 2 — 

Zi — Zj 

^0 


— Z2 — 

^0 

M^p-1 

• 

Zp-2 1 ^ 

Wj , 




2i + Z2 + Z3 + 

• • • + Zp_2 + Zp.i = 1, 


along with the restrictions 

(30) all Zjfc integer-valued. 

Check that the restrictions (29) and (30) ensure that exactly one Zjt equals 1, 
thereby guaranteeing in (28) that no more than two of the weights Wj^ and 
can be positive, and that their sum must be 1 according to (25). 

Variations on the above procedures permit you to treat functions that are 
piecewise linear but possess jump discontinuities at the Xjc values. 


13.3 OVERVIEW OF INTEGER PROGRAMMING 
ALGORITHMS 

Currently, there are two principal approaches for finding a truly optimal solu¬ 
tion to an integer programming problem. 

1. Cutting-Plane Algorithms, Several variants of this approach are available. 
The one given in the next section is for the pure integer programming model. 
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It starts with an optimal linear programming solution. At each iteration, it adds 
a linear constraint that is satisfied by any integer solution to the original problem, 
but that rules out the current nonintegral solution. The method terminates as soon 
as an integer-valued solution is obtained. Convergence is guaranteed in a finite, 
albeit sometimes discouragingly large, number of iterations. 

2. Backtrack Algorithms, Here, too, various versions are available. The first one 
that we present in Sec. 13.5 is for the mixed integer programming model, and is 
called the Branch-and-Bound Algorithm. Like the Gutting Plane Algorithm, 
it starts with an optimal linear programming solution; it then generates a family 
of related but separate linear programming problems. We go on in Sec. 13.6 
to explain several modifications of the approach, using the traveling salesman 
problem for illustration. Finally, we present in Sec. 13.7 the Partial (Implicit) 
Enumeration Algorithm, which we will apply only to problems containing zero- 
one integer variables. A consequence of such special structure is that the arith¬ 
metic simplifies considerably. The term “backtrack” refers to the specific v/ay that 
the sequence of problems is generated and solved. The notion will become clear 
when you read Sec. 13.5. 

There are other approaches, although not guaranteed to find an optimal 
solution, often do so in particular instances, and frequently provide a near-optimal 
solution. One of the these methods is to take a random sample of feasible solu¬ 
tions, perhaps making simple-to-find improvements on each sample solution 
whenever possible. For example, if you applied the approach to a traveling 
salesman problem with n cities, you would draw a random sample of permutations 
of the Cities 2, 3,., ., n, selected from the total (n — 1)! possibilities, and then 
compare the associated tour distances. Another approach, known generically as 
heuristic programming, employs both simple and sophisticated rules of thumb, as 
well as partially corrective trial-and-error procedures, to produce a feasible, and 
hopefully near-optimal solution. In the traveling salesman problem, a simple 
rule would be to travel from City j to the closest city not already visited; a partially 
corrective technique would be to test whether interchanging pairs of cities in the 
trial tour reduces the total distance. 

These other methods have worked well in a number of practical applications. 
But since there is a negligible amount of codified knowledge about the techniques 
—oftentimes, they are of an ad hoc nature—we do not consider them further in 
this text. 

The nitty-gritty. In keeping with our overall goal of emphasizing essen¬ 
tial ideas, we present each of the methods in simplified terms—we have ignored 
technicalities that are relevant to the design of efficient electronic computer 
programs. Such details are critical in constructing successful computer codes, but 
are of interest primarily to specialists and consequently are omitted in this chapter. 
One aspect of these computational details does warrant your attention, however. 
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As you know, an algorithmic technique must be described unambiguously in 
order to be generally useful. This is because in practice, an electronic computer 
will do the calculations and has to be instructed in no uncertain terms on how to 
proceed. Naturally, this requirement applies to all the other algorithms you have 
studied. But we emphasize the point at this juncture because the descriptions of 
most integer programming algorithms are replete with possibilities for arbitrary 
choices to be made during the course of the iterations. 

Curiously, when an individual applies any of these algorithms to a small-scale 
problem, he often is able to exploit the special properties of the model, and to make 
these choices auspiciously by exercising his intuition about the optimal solution. 
As a result, most of these algorithms look effective when tried on such toy problems. 
When the methods are programmed for an electronic calculator, however, the 
choices are made according to predetermined rules, without any human inter¬ 
vention to provide intuitive guidance. Experience has shown that the computa¬ 
tional burden of these methods is highly sensitive to the particular way the arbi¬ 
trary choices are resolved. This is in sharp contrast to experience with the simplex 
method. In that technique, many different versions of Criterion I (selection of 
variables to enter a nonoptimal basic solution) result in roughly the same magni¬ 
tude of computational effort. As you study each algorithm below, take note of 
where the arbitrary choices arise. 

13.4 CUITING-PLANE ALGORITHM 
(method of integer forms) 

The following simple example will motivate the guiding idea of the algorithm. 


Consider 


(1) 

maximize 5xi -J- 1 x 2 

subject to 


(2) 

2xi -f- 1x2 = 3 

(3) 

Xi and X 2 nonnegative int^ers. 


As you can quickly verify, Xi = -y is the optimal linear programming solution to 
(1) and (2), when (3) is weakened to Xi '^ 0 and X 2 ^ 0. The linear programming 
solution is not feasible in (3), since it is fractional. Note that (2) and (3) together, 
however, imply the linear inequality constraint 

(4) X, ^ 1. 

Explain why. Verify that the optimal linear programming solution to (1), (2), 
(4), and Xi > 0 and Xt ^ 0, is Xi = 1 and Xj = 1, which is integer-valued. 
Consequently, this linear programming solution is optimal for the model (1) 
through (3), because it is feasible in (3). 

What we have done, in effect, is to find an optimal solution to an integer pro¬ 
gramming problem by constructing and solving an enlarged linear programming 
model containing linear constraints implied by the stipulations of the original 
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integer problem. It was simple to discover a constraint (4) that did the trick for 
the small example. The algorithm below provides a systematic way of deriving 
such constraints for any integer programming problem.|Thc central notion of the 
algorithm, then, is to replace an integer problem by a’ suitably enlarged linear 
programming problem.*^! 

Notice in the example that the optimal integer solution contains both variables 
at positive levels, even though there is only a single linear restriction (2). This 
illustrates a characteristic distinguishing linear and integer programming models. 
In a linear programming problem containing m constraints and n variables, and 
having a finite optimal objective function value, there is always an optimal solu¬ 
tion containing no more than m variables at strictly positive levels. In an integer 
programming model of the same size, an optimal solution may require every 
integer-valued variable to be at a positive level. This is why it is necessary in the 
Cutting-Plane Algorithm to enlarge the size of the original model by adding fur¬ 
ther constraints. 

There arc several versions of the Cutting-Plane Algorithm. We present one that 
solves the pure integer programming problem, in which all variables are integer- 
valued; the technique can be extended without difficulty to the mixed integer 
case. The particular version we have chosen has the merit of being easily explained. 
Other versions, more intricate but also more efficient, are mainly of importance to 
a technical specialist—all that they contain is further refinements of the essential 
concepts that you will learn in the method below. 

► The idea of the Cutting-Plane Algorithm can be easily motivated by reference to 
solution space geometry (Sec. 3.4). Define the convex hull of the set of feasible 
integer points (solutions) as the smallest convex set containing these points (or, equiv¬ 
alently, as the set of points in Euclidean n-space composed of all convex combina¬ 
tions of the form -f (1 ■— w)yij wx^ (1 — w)^ 2 > • • • > -h (1 — where 

0 < u; < 1 , and (ati, X 2 , ^, x„) and (^i,^ 2 j • • • >.>’ 71 ) arc feasible integer solutions). 
Assume that the optimal value of the linear objective function is finite. Then it is easily 
shown that the optimal value of the objective function occurs at a vertex of this convex 
hull. Such an extreme point is one of the feasible integer-valued solutions. 

Significantly, the convex hull itself can be represented by a finite collection of linear 
constraints. The Cutting-Plane Algorithm begins with the convex set defined by the 
linear and nonnegativity restrictions in the original model and obtains an optimal 
extreme point of this set. If such a solution is not integer-valued, it adds a restriction 
that reduces the convex set so as to thereby cut off the current extreme point. But the 
new constraint does not cut off any extreme point of the convex hull of feasible integer 
solutions. Eventually, the method imposes enough constraints so that an optimal extreme 
point of the reduced convex set coincides with an optimal extreme point of the convex 
hull. This geometric interpretation will be illustrated by an example later in the section. 

Detailed approach. Suppose that the pure integer programming prob¬ 
lem is stated in the form 

71 

(5) maximize 2 
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subject to 

(6) g OijXj = bi for i = 1, 2,. .., m 

(7) all xj nonnegativ6 integers. 

Assume that there exists a feasible solution to (6) and (7) and that the objective- 
function value is finite in an optimal solution. Before giving the specific steps of 
the algorithm, we show in general terms how to construct additional linear restric¬ 
tions that must be satisfied by any feasible solution to (6) and (7). 

Consider any linear equation that can be derived by algebraic operations on 
the linear constraints (6)—such as, by adding two or more of the equations, or 
by multiplying an equation by a nonzero constant. Then if a solution satisfies 
(6), it also must satisfy an equation so derived. Let 

W TiajXj = b 

denote such a constraint; possibly one or more of the Cj and b are fractional¬ 
valued, like 2.5 or Let the symbol [d] designate the integer part of rf, 
that is, the largest integer that is less than br equal to the real number (/; for exam¬ 
ple, 

(9) [2.5] =2 [^]=3 [4] =4 

[-2.5] =-3 [-i^] = -4 [-4] =-4. 

Because the X) are also restricted in (7) to be nonnegative integers, any values of 
the Xj that satisfy (6), and consequently (8), must actually satisfy the weaker 
inequality 

( 10 ) / 

State why. Further, since the summation on the left-hand side of (10) must be 
integer-valued, verify that (10) can be validly strengthened to 

Finally, (11) can be transformed to an equality by adding a slack variable 

(12) g [ajixj + x=[b] /(integer parts), 

where ^ is restricted to be a nonhegative integer. [Explain why (11) implies that 
X must be integer-valued.] Therefore, if (6) is augmented by the linear constraint 
(12), the problem still remains a pure integer programming model. To summarize 
so far, we have derived a constraint (12)—the values of aj and b being obtained 
by algebraic operations on the constraints (6)—that is satisfied by any feasible 
solution to (6) and (7). . 

To simplify the verbal explanation of the algorithm, we perform an elementary 
manipulation on (8) and (12). Define the values fj and/by the identities 
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(13) = and [*]+/=«, 

SO that 0 < 1 and 0 < 1; these quantities are called the fractional 

parts of and b, (To illustrate, the fractional part of 2.5 and —2.5 is .5; of 
is and of —is J.) By subtracting (8) from (12), you obtain a constraint 
composed of only fractional parts 


(.4)/I 


2 / (fractional parts). 


We will employ (14) instead of (12) in the algorithm below. 

The following example shows how the new constraints (12) and (14) \vork. 
In the example above, multiply both sides of (2) by -J’, yielding 


(15) 


4" — 'i"* 


Then verify that applying the formula in (12) gives 

(16) Ijfj + Oato + a: = 1, 

which is the.same as constraint (4), and that applying (14) gives 

( 17 ) — ix2 X = — 

which implies that X 2 > 1. 

The Cutting-Plane Algorithm proceeds in this way: 


Step L Find an optimal linear programming solution to the objective function 
(5) and linear constraints (6), ignoring the integer stipulations in (7) and only 
requiring that all > 0. 

Sup 2. Terminate the iterations if the current linear programming solution is 
integer-valued. Otherwise, select a fractional-valued basic variable. Form con¬ 
straint (14) from the equation that contains this basic variable in the current 
optimal linear programming solution. 

Step 3, Augment the linear programming problem with the new constraint, 
find a new optimal linear programming solution, and return to SUp 2. 


We elaborate on these steps before illustrating the algorithm by an example. 
Suppose that at any iteration the current optimal linear programming solution is 
obtained by using the simplex method, as described in Sec. 4.4. Assume that Xt 
is the basic variable in Row i at the final iteration: 

(18) + 2 ^ Row i, 

J Donbaalc »a==a» 

where b is fractional-valued; hence, x,^ = b does not satisfy the integer stipulation. 
Then at SUp 2, constraint (14), formed from (18), is simply 

(19) S , + x= —f (cut), 

whereis the fractional part of fy and/the fractional part of b. Note that after 
the first iteration, the summation of nonbasic variables in (19) may include the 
slack variables, symbolically denoted by x in (14), that were added at Sup 2 
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in previous iterations. This inclusion is warranted because each of these slack 
variables is restricted to be integer-valued. 

Observe that the current linear programming solution does not satisfy the new 
constraint (19), in that x = —f is strictly negative, since the summation on the 
left-hand side, containing only nonbasic variables, currently equals 0. When the 
so-called cut (19) is appended at Step 3 along with the stipulation that ;c > 0, 
the current linear programming solution is thereby ruled out, or, in geometric 
terms, is cut off. This means that the value of the objective function cannot 
increase, and may decrease, after the optimization in Step 3, In practice, you would 
perform the optimization by applying the dual simplex algorithm (Sec. 5.8) to 
the model enlarged by (19), after adding x to the current set of basic variables. 

How you select a fractional-valued basic variable in Step 2 is somewhat arbi¬ 
trary. You cannot assume that the choice is completely arbitrary, however, in 
devising a rigorous proof that the method always converges in a finite number of 
iterations. But for practical purposes, it frequently speeds convergence to pick 
a variable having the largest fractional part (that is, one leading to the largest 
value for f). You could also form several such constraints for different fractional- 
valued basic variables and augment them in Step 3, but there is no erripirical evi¬ 
dence to suggest that doing so is computationally advantageous. 

If you add an amendment to Step 3, you can keep the augmented linear programs 
from growing without bound. The procedure is that whenever the current optimal 
basic solution includes a slack variable x introduced at a previous iteration, you 
drop that equation containing x in the current solution prior to performing the 
optimization. This amendment implies that the size of the augmented problem 
never exceeds n linear equations. As soon as the problem has n equations, adding 
any new constraint must cause a basic solution to contain one or more of the slack 
variables introduced by (14). An important corollary of these observations is that 
the computational difficulty of an integer programming problem depends signi¬ 
ficantly on the number of variables in the problem. 

The algorithm is guaranteed, in theory, to converge to an optimum in a finite 
number of iterations; we discuss actual computotional’experience at the end of 
the section. But note that the method does not provide a feasible integer solution 
until the terminal iteration—in this sense, the algorithm is a dual technique. As 
a consequence, if you stop the iterations prematurely, you do not have a usable 
solution. (There are Gutting-Plane Algorithms which are primal, in that the solu¬ 
tion at each iteration is feasible and integer-valued, but these are not discussed in 
the text. The ideas are illustrated in exercise 69.) 

Example. Consider the problem 

(20) maximize 21 ;ifi + 11 x 2 
subject to 

(21) 7x, + 4xj + = 13 

(22) Xi, Xi, Xi nonnegative integers. 
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(You can interpret as a slack variable, since it does not appear in the objective 
function.) As usual, express the objective function by a definitional linear equation: 

(23) Xo-2lXi- ll;c 2 = 0; 

observe that you can restrict Xq to be integer-valued in this example. Before con¬ 
tinuing, find the optimal solution by inspection. 

As you should verify, performing Step 1 leads to the linear programming solu¬ 
tion 

ATo + X 2 + 3x3 = 39 Row 0 

+ T-^2 + T^*3 = It Row 1 . 

According to Step P, since Xi is fractional-valued, you must add a restriction (14) 
formed from Row 1 . Check that the constraint is 


(25) - yAf2 - yX3 -{- A*4 --y 

where x^ is the associated integer-valued slack variable. Continuing to Step 3, 
the linear programming model consists of (24) augmented by (25). Check that the 
new optimal solution is 


Xq 2yX3 -f- l'yAf4 =37-^ ^2^=9 

(26) Xi + a :4 = 1 Row 1 

^2 + = ly Row 2 . 

Returning to Step 2, verify that the new constraint formed from Row 2 is 

(27) —^^3 — j^x^ + ATs = —^ Row 3 

and that in Step 3 the optimal linear programming solution to (26) and (27) is 


(28) 


Xq + Xi + UXs = 33 

— Xi + a :3 — a :5 = 1 

2^1 + ATj + ATs = 3 

' Xi + x^ =1 


Row 0 
Row 1 
Row 2 
Row 3. 


Returning again to Step 2, the iterations terminate with the optimal integer¬ 
valued solution a:i = 0, a :2 = 3, and ^^3 = 1 , yielding Xq = 33 . [If the solution in 

(28) were not integer-valued, then you would drop the relation in Row 3 before 
performing the next optimization at Step 3, because ^4 has re-entered the basis.] 


►You can gain further understanding of the method by seeing how the additional 
constraints (25) and (27) can be expressed as inequalities in terms of the original variables 
Xi and X 2 - You then can easily graph these inequalities, using the solution space representation 
in Sec. 3.4 to sec how they cut off the associated nonintcger-valued linear programming 
solution.' 

Constraint (21) can be written as an inequality, 

(i) 7xi 4- 4^2 ^ 13, 

by suppressing the slack variable X 3 . Eliminating ^^3 from (25) by means of (21), and 
suppressing the slack variable ^^4 in the result, gives the inequality 
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. (ii) 

Eliminating from (27) by means of (21), ;if 4 from the result by means of (25), and 
suppressing the slack variable yields the inequality 

(iii) 2xi + X2^ 3. 

Verify that the optimal nonnegative solution to the linear programming problem 
con‘'asting of ( 20 ), (i), (ii), and (iii), agrees with that in (28). 

In drawing the solution space representation, begin by indicating the set of feasible 
integer-valued solutions implied by (i) as well as the region of feasible linear program¬ 
ming solutions; show that ;ci = 1 ® is the optimal linear programming solution. Next 
add (ii), and notice how this constraint does not eliminate any feasible integer-valued 
solution, but does rule out the current linear programming solution. Show that = 1 
and X 3 = 1J is optimal when (ii) is added. Finally, append (iii) and observe again how 
the new constraint does not eliminate any feasible integer-valued solution, cuts off the 
current linear programming solution, and yields a new optimal solution that is integer- 
valued. M 

Concluding remarks. Computational experience with sophisticated 
versions of the cutting-plane algorithm has by no means been uniformly success¬ 
ful. In particular, the approach has not worked well on inherently combinatorial 
problems that are only “tricked” into an integer programming formulation. The 
method also has exhibited difficulty in solving problems containing large values 
of the aij and bi. But the algorithm has succeeded in problems in which the optimal 
linear programming solution already yields integer values for many of the variables. 

The computational problem manifests itself in the optimization process of 
Step 3, After a few iterations, there may be numerous alternative optimal solutions 
to the linear programming problem in Step 3. This causes the value of the objective 
function to remain constant (usually at a nonoptimal value) for many iterations 
until the alternatives are eventually ruled out by the constraints formed in Step 2. 
(Such stalling is sometimes referred to as massive degeneracy.) Unfor¬ 
tunately, this phenomenon arises frequently in medium- and large-scale problems. 
Indeed, an abundance of examples of small-scale problems (10 or less variables 
and equations) have required thousands of iterations to achieve convergence. 

The number of iterations required to obtain a solution also seems to depend 
considerably on the specific way the problem is formulated. As an illustration of 
this so-called form sensitivity, adding seemingly redundant constraints (for exam¬ 
ple, Xi :< 1 and a :2 ^ 1 to a model containing the constraint Xi + X 2 ^ 1 ) can 
substantially improve the speed of convergence. 

► We have skipped over one point that warrants mentioning. Perhaps you found it 
surprising that we derived constraint (12) so simply. Were you to think about discover¬ 
ing other similarly motivated cutting-plane constraints, you would find that numerous 
possibilities are amenable to systematic derivation. But what distinguishes (12) from 
many other possibilities is that the resultant algorithm can be rigorously proved to 
converge to an optimal solution in a finite number of iterations. (Some of the other 
approaches can be shown incapable of converging when applied to certain classes of 
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problems.) Because of the sporadic computational success of the Cutting-Plane Algorithm, 
the theoretical property of finite convergence is an admittedly weak defense for using 
(12); but the other approaches have not proved to be any better even when they do 
converge. ^ 

13.5 BRANGH-AND-BOUND ALGORITHM 

This method can be applied to mixed, as well as pure, integer programming 
problems. For definiteness, suppose the model is stated as 

n 

( 1 ) maximize 2 

subject to 

(2) for I = 1, 2,..., m 

(3) xj integer-valued £orj=\,2,...,p{^n) 

(4) xj>0 for/ = p+ 

In addition, assume that, for each integer-valued variable, you can provide 
lower and upper bounds that surely include the optimal values 

(5) Lj:^xj<Uj forj= 1,2, 

Usually Lj = 0, but it need not. (As you know from Sec. 3.3, no essential generality 
is lost by assuming that all Lj = 0. The algorithm is more simply described, how¬ 
ever, by employing the general symbol L),) 

The idea of the Branch-and-Bound Algorithm stems from the following ele¬ 
mentary observation. Consider any variable Xj, and let I be some integer value, 
where Lj ^ I ^ “ 1- Then an optimal solution to (1) through (5) will also 

satisfy either the linear constraint 

( 6 ) xj>I+\ 

or the linear constraint 

(7) x,^I. 

To illustrate how this dichotomy can be used, suppose you ignore the integer 
restriction (3) and find that an optimal linear programming solution to (1), 

(2) , (4), and (5) indicates that ati 1-|. Then formulate and solve two more linear 

programs. Each of these still contains (1), (2), and (4). But (5) for j = 1 is modified 
in one problem to be 2 < < i/j, and in the other to be Li < ati < 1. Suppose 

further that each of these two problems has an optimal solution that satisfies the 
integer restrictions (3). Then the solution that has the larger value for the objective 
function is indeed optimal for the original integer programming problem. State 
why. Usually, one (or both) of these problems has no optimal solution that satisfies 

(3) ; hence, additional computations may be required. The algorithm below 
specifies how to apply the dichotomy (6) and (7) in a systematic manner to even¬ 
tually obtain an optimal solution. 
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Detailed approach. We present the steps of the method next, and dis¬ 
cuss its computational merits at the end of the section. The Branch-and-Bound 
Algorithm solves a set of linear programming problems. As you saw in the preced¬ 
ing section, an optimal solution to an integer programming problem need not be 
a basic solution to the m linear constraints (2); that is, an optimal solution may 
contain more than m variables at strictly positive levels. The bounds (5) on each 
X) serve to enlarge the linear programming models sufficiently so that all variables, 
if necessary, can be present in an optimal solution. The enlargement is further 
indication that the computational burden depends critically on the number of 
integer-valued variables in the problem. 

The method. At any iteration t, you have available a lower bound, say, 

for the optimal value of the objective function. To keep the exposition simple, 
assume that at the first iteration, xj either is strictly less than the optimal value, 
or equals the value of the objective function for a feasible solution that you have 
recorded. If worse comes to worst, you can let xj = —oo if you have no informa¬ 
tion at all about the problem. In addition to a lower bound, you have a master 
list of linear programming problems that must be solved; the only differences 
among these comprise revisions in the bounds (5). At iteration 1, the master list 
contains a single problem consisting of (1), (2), (4), and (5). 

The procedure at iteration t is: 

Step L Terminate the computations if the master list is empty. Otherwise, 
remove a linear programming problem from the master list. 

Step 2, Solve the chosen problem. If it has no feasible solution, or if the resultant 
optimal value of the objective function is less than or equal to xj, then let xS"^^ = 
and return to Step L Otherwise, proceed to Step 3. 

Step 3. If the obtained optimal solution to the linear programming problem 
satisfies the integer constraints, then record it, let xj'^^ be the associated optimal 
value of the objective function, and return to Step L Otherwise, proceed to Step 4, 

Step 4. Select any variable Xj, for j = 1, 2,that does not have an inte¬ 
ger value in the obtained optimal solution to the chosen linear programming 
problem. Let bj denote this value, and [AJ signify the largest integer less than 
or equal to bj. Add two linear programming problems to the master list. These 
two problems are identical with the problem chosen in Step 7, except that in one, 
the lower bound on xj is replaced by [b^ + 1, and in the other, the upper bound 
on xj is replaced by [AJ. Let Xq*^' = xj, and return to Step 1, 

At termination, if you have recorded a feasible solution yielding Xq, it is optimal; 
otherwise, no feasible solution exists. As the example below demonstrates, you 
may obtain an integer-valued solution prior to the last iteration. You do not know 
that it is optimal, however, until the^final iteration. For this reason, the algorithm 
may be called an almosi-dual method. 
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Illustration, An example will clarify the details of the procedure. Consider 

( 8 ) maximize 3xi + 3^2 + ISats 
subject to 

— 3a:i + 6a:2 + 7a:3 < 8 

(9) 6 a:i — 3^2 + 7a-3 < 8 , 

where each Xj must be a nonnegative integer. Suppose we specify the bounds on 
each variable as 


(10) 0<av< 5 forj=l,2, 3. 

As usual, let Xq denote the value of the objective function. Find the optimal 
solution by inspection. 

At iteration 1, let the lower bound be aJ = 0, since all = 0 is feasible. 
The master list contains only the linear programming problem ( 8 ), (9), and (10), 
which is designated as Problem 1 . Remove it in Step /, and in Step 2 find the optimal 
solution 


(H) 


Ao = 16 Xi = a :2 = 2f A 3 ■= 0 (Problem 1). 


Since the solution is not integer-valued, proceed from Step 3 to Step 4, and select 
Ai. Then place on the master list 

Problem 2: constraints (9) 


3^Ai<5 0:^A2<5 0<A3<5 

(12) ^2- 3- 

Problem 3: constraints (9) 

0:^Ai<2 0:^a2<5 0<a3<5. 

Returning to Step 1 with aJ = aJ = 0, remove Problem 2. As you can verify, , 
Step 2 establishes that Problem 2 has no feasible solution. Hence, put a? = aJ = 0, 
and return to Step L 

Now remove Problem 3, and obtain in Step 2 the optimal solution 

(13) Ao = 15f- A, = A 2 = 2 X 3 = ^ (Problem 3), 


which is not integer-valued. Therefore, go from Step 3 to Step 4, where A 3 is selected, 
and place on the master list: 

Problem 4: constraints (9) 

0:^ai<2 0<a, :^5 1<a3<5 

(14) ‘ - 3 - 


Problem 5: constraints (9) ' 

0:^A, <2 0:^a2:<5 0<a3^0. 

Observe that Problems 4 and 5 differ from Problem 3 only in the bounds on As. 
Returning to Step 1 with aJ = 0, remove Problem 4. The optimal solution is 

(15) Xq=^ 15 Aj = A 2 = .Vs = 1 (Problem 4). 

This leads to Step 4 ; suppose you select A 2 , yielding, as a consequence, 
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(16) 


Problem 6 : constraints (9) 

0<x, <2 I <Xi^ 5 Ir^x, <5 

Problem 7: constraints (9) 

0 < < 2 0 < X 2 < 0 1 < ;c 3 < 5. 


Note that Problems 6 and 7 differ from Problem 4 only in the bounds on X 2 . 

Returning to Step 1 with a:? = Oj remove Problem 6 . Check that Problems 5 and 
7 remain on the master list. You will discover in Step 2 that Problem 6 has no fea¬ 
sible solution, so return to Step 1 with Aq = 0. Now remove Problem 7, giving the 
optimal solution 


(17) Xq = 14^ Xi Xz =0 ^3 = 14- (Problem 7). 

Because A 3 is fractional, this creates, in Step 4, 


Problem 8 : constraints (9) 


0<A, <2 0<A2<0 2^ A 3 <5 

(18) - 1 - - 2 __ ^ 3 -- 

Problem 9: constraints (9) 

0<Ai<2 0<A2^0 1<A3<1. 

Verify that removing Problem 8 at iteration 7 gives an indication of no feasible 
solution in Step 2, and removing Problem 9 at iteration 8 gives 

(19) ;co = 13 Ai = A 2 = 0 A 3 = 1 (Problem 9) 

in Step 2, Therefore at Step 3, you record (19) and let = 13. 

Returning to Step 7, you find that only Problem 5 remains on the master list. 
The optimal linear programming solution is 


(20) Ao = 13 A, = 2 Aj = 24 * A 3 = 0 (Problem 5). 

Since Aq in (20) equals a^, you return to Step 1 and terminate the computations, 
as the master list is now empty. The optimal solution to the integer programming 
problem is (19), which was recorded at iteration 8. 

Note that arbitrary choices arise in the algorithm at two places; selection of the 
problem to remove in Step 1 and of the variable Xj to provide additional problems 
in Step 4. The number of iterations to solve a model can vary considerably, depend¬ 
ing on how these selections are actually made. (For example, the choice of Prob¬ 
lem 4 instead of Problem 5 at iteration 4 turned out to be auspicious.) Auxiliary 
numerical tests have been devised to assist in making these choices, but they will 
not be discussed here, since they are of interest mainly to technical specialists. 
By the same token, having a better bound aJ lessens the likelihood that at Step 2 
you will have to go on to Step 3 and Step 4, 


Recapitulation. The history of the iterations can be displayed by means 
of a tree-like diagram, shown in Fig. 13.1. Notice each node in the tree diagram 
represents a problem on the master list; each branch leads to one of the problems 



Figure 13.1. Branch-and-Bound Example. 


470 













































• 13.6 . 


Traveling Salesman Problem 471 


added to the master list in Step 4. The pertinence of this graphical analogy explains 
why the word “branch” is used in the algorithm’s name, “Branch-and-Bound.” 
The word “bound” is suggested by the test in Step 2, (The technique might alter¬ 
natively be designated Branch-and-Prune.) Note that the branch for Prob¬ 
lem 5 is terminated, even though the optimal solution is not integer-valued. The 
reason is that when you reach Problem 5 at / = 9, you already have a feasible 
solution with an objective function value 13. Therefore, branching further by 
imposing stronger restrictions cah never improve on the previously obtained fea¬ 
sible solution. 

Now you can see why the algorithm is called a backtracking method. 
When you terminate certain branches—as at Problems 2, 6, 8, and 9 in the exam¬ 
ple—you must go back up the branch in search of any unsolved problem that 
remains on the tree. 

O Several approaches aid the optimizing compulations in Step 2, For example, by employ¬ 
ing the transformation in Sec. 3.3, each integer variable can be redefined in Step 2 so 
that its lower bound is 0 and its upper bound is Lj This yields a linear program¬ 

ming model with m full-fledged linear constraints, corresponding to (20), and p upper- 
bounded nonnegative variables. You can pick the initial basis to be the optimal variables 
for the previous problem that generated the current one at Step 4, This starting basis no 
longer yields a nonnegative solution, because of the revised bounds; but it satisfies the 
conditions for initiating the Dual Simplex Algorithm, given in Sec. 5.8. (An Upper- 
Bounded Variables Algorithm in Sec. 5.10 can also be used.) 

Concluding remarks. This method shares with the Cutting-Plane 
Algorithm an important computational merit: it can be efficiently coded into a 
computer routine, and therefore run without manual intervention at any iteration. 
The method works well in problems containing a few integer-valued variables, 
but if the number of variables is large, or if the linear programming solution to the 
problem is far from optimal (as was the case for the example), then the number of 
iterations may be too large for a practical application of the algorithm. 

Of course, the method is applicable to combinatorial problems containing only 
zero-one integer variables; then each = 0 and {/j = 1. In this case, the two 
problems added at Step 4 actually imply the alternatives = 0 and xj = 1. 
You can often improve on the efficiency of the Branch-and-Bound Algorithm, how¬ 
ever, by adapting it to take account of a model’s special structure. In the remainder 
of the chapter, you will see several ways to do this. 

13.6 TRAVELING SALESMAN PROBLEM 

We now illustrate several ways the Branch-and-Bound Algorithm of the preced¬ 
ing section can be modified to take advantage of the special structure of combina¬ 
torial problems. We have selected the traveling salesman problem for this purpose, 
and exhibit three different versions of the branch-and-bound approach. Many of 
the formulations in Sec. 13.2 that contain zero-one variables can be solved by one 
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or more of these approaches, after suitable adaptation. The practical consequence 
of taking advantage of a model’s special structure is that you then can handle 
moderately large problems. For example, traveling salesman problems containing 
up to 40 cities have been solved in a reasonable amount of computer time using the 
methods below. 

It is convenient to characterize the traveling salesman problem by the for¬ 
mulation in Sec. 13.2: 


(1) 

subject to 

( 2 ) 

(3) 

(4) 

(5) 


71 71 

minimize 2 where = oo for i = 1, 2,.. ., « 

71 

2 = 1 for I = 1, 2,. . ., « (d'.parlure) 

71 

Xij =1 for j = 1, 2, . . ., (arrival) 

Xij nonnegativc integer for all i and j 
solution is a tour. 


Recall Xij = 1 implies that the salesman travels from City i directly to City j, 
and Cij > 0 is the corresponding distance. When (5) is dropped, (1) through (4) 
constitute an assignment model (Sect 6.4), which can be solved by any of the tech¬ 
niques explained in Appendix I (Note that the objective here is “minimization,” 
whereas it was “maximization” in the preceding section. The methods below will 
differ in certain obvious details from the Branch-and-Bound Algorithm to accom¬ 
modate this change.) 

A particular example will be used to illustrate the three different branch-and- 
bound versions; the data are given in Fig. 13.2. Observe that the unique solution 
to the associated assignment model is Xi^ = Xsi = X 23 = x^j^ = a :42 = 1, which does 


FronvN 

City 

1 

2 

3 

4 

5 


To City 


Distances Qy 


CD 

10 

25 

25 

10 

1 

00 

10 

15 

2 

8 

9 

CD 

20 

10 

14 

10 

24 

00 

15 

10 

8 

25 

27 

CO 


^ ^ . Figure 13.2. Traveling 

Optimal Assignment: ^5 + C 23 + C 34 + 4 ^51 — ^0 Salesman Problem. 
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not satisfy (5) as it contains two cycles. Therefore, the sum of the distances for 
this solution, namely 60, is strictly less than the length of an optimal tour. There 
are (« — 1 )! = 4 ! = 24 possible tours; before going on, see if you can spot an 
optimal one. 

Method of excluded subtours. Of the three methods to be discussed, 
this version makes the smallest modification of the Branch-and-Bound Algorithm 
in the preceding section. At the beginning of an iteration t, you have an upper 
bound on the optimal value of the objective function. You can let be any 
suitably large number, such as the sum (ci 2 + ^^23 + • * * + ^m) corresponding to 
the tour City i to City 2 to . . . City n to City 1. In addition, you have a master 
list containing a number of assignment models. All of these are of the form (1) 
through (4), but differ from each other in that various Ci) values have been revised 
to equal oo. At iteration 1 , the master list consists of the assignment model (1) 
through (4), which we designate as Problem 1. Define a sublour as a cycle contain¬ 
ing less than all n cities, such as a:,5 = atji = 1 , and ^^03 = atj, = ^^42 "= 1- 

The procedure at iteration t is 

Step L Terminate the compulations if the master list is* empty. Otherwise, 
remove a problem from the master list. 

Step 2, Solve the chosen assignment model. If the optimal value of the objective 
function (which may be 00) is greater than or equal to a*J, then let = a’J, and 
return to Step L Otherwise, proceed to Step 3, 

Step 3, If the obtained optimal solution to the chosen assignment model is a 
tour, then record it, let a-J^* be the associated optimal value of the objective func¬ 
tion, and return to Step L Otherwise, proceed to Step 4, 

Step 4, Select in the obtained optimal solution of the chosen assignment model 
a subtour that contains the smallest number of cities. For each of the = 1 
in the selected subtour, add a problem to the master list, and set the corresponding 
= 00; leave all the other costs the same as in the problem chosen in Step 1. 
Let Xq'*'^ = atq, and return to Step L 

The details of the method are clarified by the example shown in Fig. 13.2. At 
iteration 1, let = 65, corresponding to the tour City 1 to City 2 to . . . City 5 
to City 1 . The master list contains only the assignment model with the c^J given 
in Fig. 13.2, so you must choose this problem at Step 1. You find that the optimal 
solution in Step 2 yields an objective-function value 60, as shown in Fig. 13.2, bring¬ 
ing you to Step 3, Since the obtained optimal solution contains two subtours, you 
proceed to Step 4. Select the subtour a :,5 = atsi = 1, because this contains one less 
city than does the subtour X 23 = atji = A 42 = 1. Add two problems to the master 
list, letting one have r ,5 = cxd, and the other Csi = 00; leave all the remaining 
Cij unchanged. Let aJ = aJ = 65. 
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The justification for Step 4 is elementary. Since the subtour = Xj, = 1 is 
infeasible, either ;C |5 = 0 or ;c 5 i =0 (or both) in an optimal tour. The problems 
added in Step 4 allow for each of these possibilities. The reasoning applies with 
equal validity to any subtour. (Using subtours as the source for generating new 
problems at Step 4 leads to a noteworthy difference between this version and the 
Branch-and-Bound Algorithm in the preceding section. Here, several problems 
on the master list may turn out to be identical. This occurrence is easily recog¬ 
nizable when performing manual computations on a small-sized example; the 
redundant problems can be discarded when they are discovered. If the method 
is computerized, it may be preferable to perform the redundant calculations 
rather than to check for duplications.) 

Returning to Step 7, select the problem with = oo, which we designate as 
Problem 2. The optimal value of the objective function for the corresponding 
assignment model is 65. This represents a lower bound on any tour that excludes 
the route City 1 to City 5. Because you already have a tour with length 65, there 
is no need to consider this problem any further. Thus, according to Step 2, let 
a: 5 = ;co = 65, and return to Step 7. Select the remaining problem with ^51 = 00 , 



Optimal Tour: xo = 62 ^I5 = >f52=>f23=>f34=^4i=l 
(Problem 3) 


Figure 13.3. Method of Excluded Subtours. 
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which we designate as Problem 3. Now the optimal value of the objective func¬ 
tion for the assignment model is 62, taking you to Step 3, You obtain the associated 
optimal solution 

(6) minimum distance = 62 = Xi 2 = -^23 = -^34 = -^41 = 

which is a tour. Therefore, record (6), let = 62, and return to Step 7, where you 
find the master list is empty, thereby terminating the iterations. A tree diagram 
for this example is given in Fig. 13.3. 

Method of designated routes. The version above succeeds in limiting 
the number of branches to be explored by calculating an effective lower bound 
on the objective function for any tour emanating from each problem. But to 
obtain such a bound, you had to perform an assignment model optimization. 
Now you will see how to find more easily calculated bounds. The price you pay 
for this simplification, however, is that you will have to explore more branches in 
the associated tree for this method. 

At the beginning of any iteration t, you have an upper bound rj on the optimal 
value of the objective function; you can obtain rj as before. In addition, you have 
a master list of problems in which a subset of values have been revised to equal 
00 and a subset of Xy^ = 1. There are no subtours among the = 1. At iteration 1, 
the master list consists of two problems; in one, a selected value has been 
revised to equal 00 and in the other, the corresponding Xyj = 1 and Cji = 00 : 
(Letting Cjy = 00 rules out letting Xjy = 1, which would give the subtour City i 
to City j to City i.) 

The status of each problem in the master list can be conveniently summarized 
as follows. Take the array of Cyj, as in Fig. 13.2. Cross out the A;th row and ^th 
column if = 1- The remaining (some of which equal 00 ) correspond to 
not yet assigned. A lower bound on the optimal value of the objective function for 
any tour containing the specified subset of Xyj = 1 can be calculated in a variety of 
ways. In general, the larger the lower bound, the fewer the branches that will 
have to be examined. 

One simple, but moderately effective, bounding technique is based on the obser¬ 
vation that the distance must be at least equal to the sum of the Cyj where x^ = 1 , 
plus the sum of the smallest in each of the rows that have not been crossed 
out. You can (and should) raise the bound even further by subtracting the smallest 
Cij in each row that has not been crossed out from every other Cyj in that row. 
This yields the so-called diminished distances. Then add to the previous sum the 
smallest diminished distance in each of the columns that have not been crossed 
out. An example is shown in Fig. 13.4. Observe that X 23 has been designated equal 
to 1. The top table indicates the minimum distances for each row, excluding the 
seeond row. Each of these minima is subtracted from every element in the cor¬ 
responding row to yield the bottom table. 

(Actually, you can obtain a lower bound by solving an assignment model, 
formed from the in the rows and columns that have not been crossed out, and 
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Figure 13.4. Bounding Technique for Designated Routes. 

then adding the resultant optimal value of the objective function to the sum of the 
Cfj for which the specified subset of Xu = 1. This approach is probably computa¬ 
tionally more burdensome than the previous version of the algorithm or the sim¬ 
ple bounding technique just described.) 

The procedure at iteration t is 

Step 1. Terminate the computations if the maister list is empty. Otherwise, 
remove a problem from the master list. 

Step 2. Find a lower bound on the objective function for any tour emanating 
from the chosen problem. If the lower bound is greater than or equal to Xq, then 
let xi*^ = xi, and return to Step L Otherwise, proceed to Step 3, 
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Step 3, If the current solution is a tour, then record it, let be the associated 
value of the objective function, and return to Step L Otherwise, proceed to Step 4, 

Step 4. If possible, select a variable not in the current solution, such that 
< oo, provided that = 1 does not produce a subtour among the variables 
already in the solution. Given the selection, add two problems to the master list. 
Let each be identical with the problem chosen in Step 7, except that in one, 
make the revision = oo, and in the other, include the stipulation x^j^ = 1 
and the revision = oo. Let and return to Step L 

Note the two principal differences between this and the previous modification 
of the Branch-and-Bound Algorithm. In the present version. Step 2 calculates a 
lower bound for the chosen problem, but does not perform an optimization. Fur¬ 
ther, Step 4 may add two problems to the master list, but adds no problems if there 
is no Xhj^ satisfying the stated stipulations. In the previous version. Step 4 creates 
between 2 and «/2 problems. 

The method, applied to the example in Fig. 13.2, gives the tree displayed in 
Fig. 13.5. Of course, the branches explored in this tree depend on the choice of 
problems from the master list in Step 1 and on the selection of x^j^ in Step 4, The 
rule for Step 1 was to select the highest-numbered problem that remained on the 
list, where in Step 4 we used the convention of assigning the larger number to the 
problem with Xt^^ = 1. The optimal tour was obtained at iteration t = 8 for Prob¬ 
lem 12. 

>Thc number of branches can often be diminished by performing, at Step 7, a few 
more revisions of the in the problem added for = 1. We already indicated the 
revision = oo, to rule out the subtour City h to City k to City h. But other Cij values 
can be set equal to oo to rule out longer subtours. For example, if X |2 = X 2 z = 1 is in the 
current solution and ^34 is selected, then you can make the revisions ^41 = ^42 = 00. 
If this procedure is used throughout, you need not state at Step 4 the restriction, “x/i* = 1 
does not produce a subtour ...,” because the restriction Cfiic < 00 rules out such a 
possibility. 

Method of partial tours. The principal reason for presenting this 
version is to exhibit an approach that extends to many other kinds of combina¬ 
torial models, such as sequencing problems (in which Xu = 1 corresponds to put¬ 
ting the ith object in thejth position of a sequence). We will need to employ 
the notion of a partial tour^ which is defined as a sequence of less than n distinct 
cities, starting with City 1 (such as City 1 to City i to City j to City k, where 
n > 4). 

At the beginning of any iteration t, you have an upper bound acJ on the optimal 
value of the objective function; you can obtain xl as usual. In addition, you have 
a master list of problems that contains a subset of = 1, representing a partial 
tour, and a subset of values that have been revised to equal 00 . At iteration 1, 
the master list contains w — 1 problems, one associated with each Xu = 1 and the 
revision Cji = 00 for j = 2, 3,. .., n. You employ the same technique as in the 
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Figure 13.5. Method of Designated Routes. 
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method of designated routes to calculate a lower bound on the optimal value of the 
objective function for any tour that is a completion of the partial tour. (Alterna¬ 
tively, you can perform an assignment model optimization on the Cij in the rows 
and columns not associated with the subset of Xij = 1 in the partial tour.) 

The procedure differs from the method of designated routes only at Step 4, 

Step 4. For each City k not already in the partial tour of the problem chosen in 
Step ly add a problem to the master Ibt extending the partial tour from City j, 
which is the last city listed in the partial tour, to City k; also make the revision 
Cjy = cxD. Let = XQy and return to Step L 

In this method, if a partial tour contains m cities (including City 1), then Step 4 



*Diminished Distances give a bound 55; but 58 is valid, since it is a bound for ony 
tour emanating from Problem 4 

Optimal Tour: Xq = 62 x ,5 = X 52 “ ^23 ~ ^34“ ^4i “^ 

(Problem 8) 

Figure 13.6. Method of Partial Tours. 
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adds n — m problems to the master list. Notice there is no arbitrary selection to 
be made in Step 4. (Redundant calculations can occur in this method when there 
are several problems on the master list composed of the same m cities in their 
partial tours—for example, City 1 to City 2 to City 3, and City 1 to City 3 to 
City 2. In this event, the bounding computjations in Step 2 are duplicated.) 

Applied to the example in Fig. 13.2, the algorithm produces the tree displayed 
in Fig. 13.6. In this case, the branches explored depend on the selection of prob¬ 
lems from the master list in Step L (We used an auspicious procedure in solving 
this example.) 

^Concluding remarks. As you can easily discover by making infelicitous 
choices in Step 1 and Step 4 of the three methods, the computational burden 
depends critically on how well you resolve the arbitrary selections. You will often 
find it worthwhile in branch-and-bound procedures to perform extra calculations 
in Step 2 to obtain a truly good bound on the optimal value of the objective func¬ 
tion. Your effort will be rewarded by having to explore fewer branches. Also, you 
can usually take advantage in Step 2 of the calculations already done on the problem 
that gave rise to the current one. The practical success of applying a branch- 
and-bound approach to solve an actual combinatorial problem depends consider¬ 
ably on exploiting the special structure of your model in order to implement the 
foregoing suggestions. 

13.7 PARTIAL (implicit) ENUMERATION 
ALGORITHM 

Many important integer programming models can be characterized by the 
problem 

0 ) 

subject to 

(2) ^ ai,x, ^ bi for t = 1, 2,..., m, 

where the integer-value stipulations are simply 

(3) = 0 or 1 for 7 = 1,2,..., n. 

Assume each Cj is an integer (a condition that is always achievable by a proper 
scaling of the objective function, provided the original coefficients are rational 
numbers). 

Capital budgeting models frequently can be put in the form (1) through (3). 
In addition, many pure integer programming models can be suitably transformed 
so that each variable satisfies (3). To illustrate, suppose a variable x has a deter¬ 
minable upper bound U. Then wherever x appears, you may substitute an equiva- 


n 

maximize c^Xj 
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lent binary representation 

(4) a: = \wi + 2w2 + + • • • + 

where = 0 or 1, and you choose a value o^k such that 2* — 1 > i/. For exam¬ 
ple, if you know a priori that Xi < 6 , and the model contains the constraint 
3 a*i + ;c 2 :< 25, then let = \wi + -j- and write the constraint as 

3i<;i + 60^2 + 12 z <;3 + ^2 < 25. (Observe in the example that the transformation 
expressed in (4) loses some information. You have formally introduced the pos¬ 
sibility o£ Wi = W 2 = = 1, corresponding to a maximum value of 7 for Xi, 

whereas you know that in fact Xi cannot be larger than 6 . This loss can easily 
be eliminated by using a coefficient appropriately smaller than 2*"*. In the exam¬ 
ple, you can use the coefficient 3 for instead of 4. Give a general rule for the 
coefficient of so as to keep the transformation equivalent to the initial informa¬ 
tion about the upper bound of the variable.) 

Recall that if you applied the Branch-and-Bound Algorithm of Sec. 13.4 
to ( 1 ) through ( 3 ), you would have to solve a sequence of linear programming 
problems. The alternative version below exploits the zero-one conditions (3), and 
thereby limits the arithmetic to additions (and subtractions). For this reason, the 
method is sometimes referred to as the Additive Algorithm. 

If you take only (3) into consideration, there are 2” possible assignments of 
values to (xj, X 2 , . . . , x„). Naturally, many of these are not feasible in the linear 
constraints (2), and very few are optimal. Consider a subset of the Xj, where each 
such x^ has an assigned value (either zero or one). This subset is called a 
partial solution. The Xj not included in a partial solution are termed 
free variables. Any specific assignment of values to the free variables is 
denoted as a completion of the associated partial solution. If a partial solution 
contains s variables, then there are 2”"* completions. In this algorithm, each prob¬ 
lem on the master list corresponds to a partial solution [not necessarily feasible in 
( 2 )], and the possible completions give rise to the branches. 

The following example will illustrate the terms in the preceding paragraph. 
Suppose n = 5 . Then (atj = 1, X 4 = 0) is one partial solution, and the correspond¬ 
ing free variables are Xi, X 3 , and X 5 . Letting Xy = 0, X 3 = 1, and X 5 = 1 yields one 
completion of the partial solution. Altogether, there are 2*"^ = 8 completions of 
(X2 = 1, X4 = 0). 

Suppose you have a lower bound on the optimal value of the objective func¬ 
tion; assume you have recorded a feasible solution yielding this lower bound. 
Then given a partial solution, you need not branch further if somehow you can 
show that there is no feasible completion having an objective-function value 
greater than the current lower bound. In this case, the partial solution is said to be 
fathomed. When you fathom a partial solution containing s variables, you 
have implicitly enumerated 2”"* possible assignments satisfying (3). 

There is a close connection between the central idea of this method and the 
Principle of Optimality in dynamic programming. The corresponding proposi¬ 
tion is that given a partial solution, the values of the remaining variables must be 
optimized accordingly in order for a completion to be optimal. If there are no 
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values of the remaining variables that yield a feasible solution or if the resultant 
optimal values for these variables yield a solution that is worse than a previously 
found solution, then there is no optimal solution containing the given partiaJ 
solution. 

At any iteration t, you have a lower bound Xq on the optimal value of the objec¬ 
tive function. The value of xj can be chosen as in the Branch-and-Bound Algorithm, 
Sec. 13.4. In addition, you have a master list of problems, with each problem 
specifying a different partial solution. At iteration 1 , the master list contains two 
problems formed by selecting an and letting the partial solution for one be 
Xf^ = 0 , and for the other be 1 • 

It is helpful to give a few motivating remarks before explaining the procedure. 
For a specified partial solution, there are several tests you can apply to find out 
whether there can be any feasible completion that has an objective-function value 
greater than the current lower bound. One simple test is suggested by the two 
examples below. If the model contains the constraint 

(5) —Ixi -f- 2 a:2 — 6 x 3 -j- 4 a:4 — Ix^ < 0, 

then there is no completion of the partial solution (a:, = 1 , atj = 0 , a:^ = 1 ) that 
is feasible in (5). Explain why. Suppose that the objective function is 

( 6 ) 2xi + Ixz + 5x3 + 3x4 — Iats, 

and A^o = 7. Then, as you should also check, a completion must satisfy the con¬ 
straint 


(7) 2 xi -j- 1x2'+ 5 x 3 + 3a:4 — Iatj > 8 

or — 2 a:i — Ixz — Sxj — 3 a :4 + Ix^ < — 8 , 

in order to make an improvement on Xq- No completion of the partial solution 
(atj = 1, X 3 = 0, X 5 = 1 ) is feasible in (7). Explain why. 

The logic underlying the preceding examples can be systematized as follows. 
Given a partial solution, view the linear constraints as 

( 8 ) 2 ai)Xj<bi— 2 difXj for z = 0 , 1 , 2 ,..., m, 

all. froo panial 

variables solution 

variables 


where each Xj in the partial solution has its assigned value in the summations on 
the right-hand side of ( 8 ), and the coefficients in the restriction for t = 0 are 
Oqj = —Cj and bg = — Aq — 1. Then there is no feasible completion having an 
objective-function value greater than the lower bound if 


(9) 


2 minimum 0 ) 

all Btw 
variables 


>6i- S 

palpal 

solution 

variables 




for any i. 


The term on the left-hand side of (9) is simply the sum of all the negative coef¬ 
ficients of the free variables. If such a sum is larger than the right-hand side, 
then even letting Xj = I for every free variable where 0 ^ < 0 will not satisfy the 
zth constraint in ( 8 ). To check your understanding, apply (9) to the examples 
(5) and (7). 
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Another observation of importance is that given a partial solution, you can 
sometimes conclude that a free variable must have a particular value for any 
feasible completion with an objective-function value greater than the current lower 
bound. To illustrate, verify that if the model contains the constraint 

(10) 1a:, — 1a:2 + 2a*3 — 1a:4 < 0, 

and the partial solution is (^i = 1, atj = 1), then any completion must have = 0 
to be feasible in (10). This example suggests a computation analogous to (9). 
Specifically, for any free variable Xi^ if 

(11) 2] minimum 0) + — Y] a^Xj for any i, 

all n-co par^l 

varlahIc.H solution 

variables 

then x^ = 0 if > 0, and 1 ^ik < 0. 

[The tests (9) and (11) can also be applied to composite constraints, formed 
by adding positive combinations of the constraints (2), namely, 

j=i \i-ri /. t=i 

where > 0. To see how such composite or surrogate constraints can be 
useful, consider the constraints —a:, — Xo — ^3 ^ —I, 2a:2 — 2a:4 + 2a:5 < 0, 
2a: 3 -f 2 a:4 ^ 2a:5 < 0. Given the partial solution a:, = 0, (9) applied to each 
constraint does not indicate infeasibility; but check that (9) applied to the sum of 
the constraints shows there is no feasible completion. Similarly, given 
(11) applied to each constraint yields no information about the values of the free 
variables. But (11) applied to the sum of the constraints shows that atj = atj = 0 
in any feasible completion.] 

The procedure at iteration t is 

Step 1, Termin?ite the computations if the master list is empty. Otherwise, 
remove a problem from the master list. 

Step 2, If you can find free variables that must have particular values for any 
feasible completion with an objective-function value greater than a:o> augment 
the chosen partial solution accordingly. If you can determine that there is no 
feasible completion having an objective-function value greater than x^y then let 
^ 0 ^* = ^ 0 , and return to Step L Otherwise, continue to Step 3. 

Step 3, If the (augmented) partial solution is complete (contains all n vari¬ 
ables), record it, let a:o'*^^ be the associated value of the objective function, and 
return to Step 1, Otherwise, proceed to Step 4. 

Step 4, Select any free variable a:^ not in the (augmented) partial solution. Add 
two problems, to the master list. In one, set Af^t = 0 ^be (augmented) partial 
solution, and in the other, set Af^. = 1. Let = atJ, and return to Step I, 

At termination, if you have recorded a feasible solution yielding a^, it is optimal; 
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otherwise, no feasible solution exists. The method must terminate in a finite 
number of iterations. 

Example. Consider the problem 

( 12 ) . maximize 3xi + G.Vo + 8 x 3 + 6 a *4 + I 3 .V 5 
subject to 

(13) —Sati — 6 x 2 + 6 x 3 + 12 x 4 + 7.Vj < 8 (f = 1) 

(14) 6 x 4 + 12x2 - 3 .V 3 - 6 x 4 + 7 .V 5 <8 (i = 2) 

(15) x^ = 0 or 1 for 7 = 1, 2,. . . , 5. 

This is the example in Sec. 13.5 transformed to include only zero-one variables. 

Let xj = 0, corresponding to the feasible solution in. which all x^ = 0. Assume 
that the master list contains Problem 1 with partial solution X 5 = 1, and Problem 2 
with X 5 = 0 . 

At Step 1, remove Problem 1. A feasible completion having an objective-function 
value greater than xJ must satisfy (13) through (15) as well as 

(16) —3x, — 6 x 2 — 3 x 3 — 6 x 4 — 13 x 5 < — 1 {i = 0 ). 

In the first part of Step 2, you try to find any free variables that must have a partic¬ 
ular value in order to satisfy (13) through (16), given that X 5 = 1. Verify that 
applying ( 11 ) to z = 1 and z = 2 yields X 4 = 0 and .V 2 = 0. Then, given that 
X 5 = 1, X 2 = X 4 = 0 , a second application of ( 11 ) to z = 1 and z = 2 yields 
X 3 = O ’and Xj = 0. Thus you have obtained an augmented solution containing 
all the variables. In the second part Step 2 you check for feasibility in z = 0, 1, 2, 
which can be formally accomplished by applying (9). [Since no free variables are 
left; the calculation amounts to testing whether the augmented solution is feasible 
in (13) through (16). Verify that it is.] 

This brings you to Step 5, where you record the completion, 

(17) objective function = 13 Xj = X 2 = X 3 = X 4 = 0 Xj = 1. 

Let = 13, and return to Step 1. 

Next remove Problem 2, where (xj = 0). The constraint (16) is revised to 

(18) -3x, - 6 x 2 - 3 x 3 - 6 x 4 _ 13x5 -14 (z = 0). 

Applying (11) to z = 0, 1, 2 does not result in augmenting the partial solution, 
nor does applying (9) to z = 0, 1, 2 rule out the possibility of a feasible comple¬ 
tion with an objective-function value greater than 13. Hence, proceed to Step 3 
and then to Step 4^ since the partial solution is not complete. In Step 4, select the 
variable Xj and add to the master list Problem 3, in which (xj = 0, x, = 1 ), and 
Problem 4, in which (xj = 0, x', = 0). 

Returning to Step 1 with xJ = 13, remove Problem 3. Verify that applying (11) 
to* z’ = 0, 1 does not indicate any necessary values for the free variables. Going 
on to i = 2 , check that the test in ( 11 ) indicates X 2 = 0 for any feasible comple¬ 
tion. Augmenting the solution accordingly and reapplying (11) to z = 0, verify 
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that there is no feasible completion. Therefore, Step 2 is complete, and you return 
to Step 1 with .Vo = 13. 

Removing Problem 4, in which (vj = = 0), the calculation (11) for z = 0 

shows that = 1 for any feasible completion. Applying (11) to z = 1, 2, you 
find that a :4 = 0 and Vo = 0. Then testing (9) for z = 0, you obtain that there is 
no feasible completion having an objective-function value greater than Return¬ 
ing to Step 7, you terminate the iterations as the master list is now empty. The 
optimal solution is (17). 



Optimal Solution: Xq = 13 x^ = X 2 = X 3 = X 4 = 0 
(Problem 1) 


Figure 13.7. Partial Enumeration Approach. 

A diagram- of the algorithm applied to the example is shown in Fig. 13.7. As 
you saw in Problem 3, each time the test (11) results in augmenting-the partial 
solution, it may be auspicious to repeat the. test on all the restrictions (z = 0, 
1,. . ., m), employing the newly augmented partial solution. 
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Concluding remarks. You should note two points of difference between 
this method and the Branch-and-Bound Algorithm in Sec. 13.4. First, the method 
here requires only additions and subtractions (no multiplications or divisions). 
[You can guide the selections in Steps 1 and 4 with information garnered from an 
optimal linear programming solution to (1), (2), and the constraints 0 < < 1.] 

Second, each partial solution satisfies the integer stipulations, but may not 
satisfy the linear inequalities (2), in contrast to the linear programming approach. 
By clever selection rules in Steps 1 and 4, the algorithm may determine a feasible 
and nearly optimal* solution to all the constraints at an early iteration. 

If you terminate the iterations short of convergence (before the master list is 
empty), you can ascertain what fraction of the 2^ possibilities have been enumer¬ 
ated (explicitly or implicitly). To obtain this fraction, each time you return to 
Step I from Steps 2 or 5, accumulate the fraction (^)*, where s equals the number 
of variables that were contained in the partial solution at Step 1. In the example 
above, after completing Step 3 on Problem 1, you have enumerated (^)^ = of all 
the 2^ possibilities. Then completing Step 2 on Problem 3, you have enumerated 
an additional [(^)^ = -}] of the possibilities, or altogether so far. The remaining 
[(^)- = i] of the possibilities are enumerated when Problem 4 is terminated. 

As you have observed several times already in this chapter, the computational 
burden is closely related to the number of integer-valued variables in the problem. 
In this method, the burden manifests itself primarily in terms of the number of 
problems placed on the master list. Note that additional constraints (especially 
composite or surrogate constraints) may actually be beneficial in reducing the 
magnitude of the computations. Although you must perform more testing at 
Step 2, you may more frequently return directly to Step 1, without adding new prob¬ 
lems to the master list. 

Computational, experience has demonstrated* that this approach (when properly 
modified by the addition of sophisticated surrogate constraints and programmed 
efficiently on a high-speed computer) can solve certain practical problems with up 
to 100 variables and 50 constraints. This size is modest as compared to the present- 
day capability for solving linear programming problems having several thousand 
variables and constraints. But the state of the art is in rapid flux—it is reasonable 
to expect that models containing several hundred integer-valued variables will 
be solvable in the near future. 

► Efficient computerized versions of the algorithm incorporate a number of sophisticated 
modifications, including a streamlined notation for designating a partial solution as well 
as a condensed way of representing the master list. In addition, more powerful tests 
than (9) and (11) in Step 2, as well as good selection rules in Steps 1 and 4^ are usually 
employed to avoid e-xcessive branching. In the same vein, Step 3 is sometimes amplified 
to test whether, you obtain a feasible solution after setting each free variable Xj = 1 
i^Cj > 0. If so, this is the best feasible completion, and should be recorded in the standard 
way. 

The Partial Enumeration Algorithm can be modified as follows to solve the mixed 
■ integer programming problem in which only the variables a^, forj = 1, 2 ,... ,^ (<n) 
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must be zero-one, and the remaining variables arc restricted by 0 ^ ^ 1 , for j = 

p + 1,. .. , n. A partial solution is restricted to include only the integer-valued variables; 
but the free variables consist of all those that are not contained in a partial solution. 
The tests (9) and ( 11 ) remain valid, except that the test in (11) is to be applied only 
to the free integer-valued variables; also Aq =. instead of —xj — 1. Finally, the 
instructions are altered to read: 

*'St€p 3, If the (augmented) partial solution contains all p integer-valued variables, 
solve the resultant linear programming problem for associated optimal values of the 
remaining free variables (where the constraints of the linear programming problem are 
stated to account for the assigned values of the integer variables in the partial solution). 
If there is no feasible solution, let = xj, and return to Step 1. Otherwise, record the 
obtained optimal solution, let be the associated optimal value of the objective 
function, and return to Step /.If the (augmented) partial solution contains fewer than p 
integer-valued variables, proceed to Step 4.” 

The algorithms in this section and Sec. 13.4 are easily adapted to solve, at least in 
principle, certain optimization models that contain continuous and integer-valued 
variables along with a nonlinear objective function and nonlinear constraints. To 
illustrate, consider the problem 

(i) ’ maximize c(X|, Xj,..., x^) 
subject to 

(ii) a((^„ Xo,. . ., A„) <0 for i = I, 2, ...., m 

(iii) AT; integer-valued and Lj ^ Xj< Uj for 7 = 1, 2,... ,/> (^n) 

(iv) xj>0 for 7 =/> + 1 , . . . ,n (if/K n), 

where the functions in (i) and' (ii) may be nonlinear. In Chaps. 14 and 15 you will 
study how to solve such problems without integer stipulations. If the nonlinear functions 
satisfy the shape postulates given for the various methods in Chaps. 14 and 15, the 
integer constraints can be accommodated by a straightforward modification of the 
algorithm in Sec. 13.4. Specifically, you find an optimal solution to (i) through (iv) by 
applying the algorithms in Chaps. 14 and 15 to solve a sequence of nonlinear programming 
problems, each containing only continuous variables, and which differ as to the bounds 
on the integer variables. 

Suppose next that (iii) and (iv) are modified to 

(v) = 0 or 1 for 7 * = 1 , 2 ,..., n 

and that each function ai{xi, ^ 2 ,..., x„) can be written in the form 

(vi) ^i(<^l) • • • > ^n) — ^ 2 > • • • j ^n) • • • j ^n)j ' 

where aliyi(Ai, X 2 ,..., Af„) and g<(Ai, X 2 ,..., x^) are nondecreasing monotone functions 
[that is, /((X|, xj,,.., Xn) • • • > J'n) if for all 7 , and likewise for each 

function g|]. Then the algorithm in this section is directly applicable. Only the tests at 
Step 2 require modification. 

To illustrate, let be that value of /i(xj, ^ 2 , •.., x„) where each variable in the 
partial solution has its assigned value, and every free variable is set equal to zero; 
similarly, let Gi be the value of gi{xi,X 2 ,... ,Af„) where each variable in the partial 
solution has its assigned value and every free variable is set equal to one. Verify that 
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there is no feasible completion if 

(vii) — Gi > 0 for any i, 

where i = 1, 2,..., m. This lest reduces to (9) when an at{xi, X 2 ,, Xn) is linear. 
If the objective function c{xi, X 2 ,. .., x^) can also be split into the difference of two 
nondecreasing monotone functions, 

(viii) ^(^l> ^ 2 j • • • > ^n) — ^ 2 i • • • i ^n) 4“ ^o(^l> ^ 2 > • • • j ^n)> 

then the test in (vii) can be extended to include 

(ix) F 0 -G 0 + 1 ) >0 (1 = 0 ). 

Let Fuc{xtf) be the function derived from fi(xi, ^ 2 ,..., x^) by letting each variable in 
the partial solution have its assigned value and letting every free variable, except Xk, 
equal zero. Similarly, let Gucixjc) be derived from gi(xi, X 2 ,... y x„) by letting each 
variable in the partial solution have its assigned value and every free variable, except 
Xkf equal one. Verify that for any free variable Xki 

if-FtJt(O) “ Gik(0l > 0 , then x^ == 1] 

(x) ) for any i, 
ifF,*(l) - Gail) > 0, then x, = oj 

to yield a feasible completion. When (viii) is valid, then (x) can be extended to include 
i ±= 0, where the constant (x{ -f l).is added to the left-hand sides, as in (ix). The test in 
(x) simplifies to ( 11 ) when an ai(xi, ^ 2 ,..., a*„) is linear. ^ 

REVIEW EXERCISES 

1 (a) Calculate the number of tours in the traveling salesman problem for 3 cities. 
For 5 cities. 

(b) Consider the traveling salesman problem with five cities, but assume that the 
salesman must return to City 1 after visiting two cities and then set out again for 
the remaining two cities. Calculate the number of possible sequences. 

(c) Calculate the number of possible sequences in the machine scheduling problem 
with three items and one machine. With three items and two machines. With 
two items and three machines. With two items and four machines. Show how your 
answers change if the items must be processed in the same sequence on each 
machine. 

(d) Consider a one-period product-mix selection model with activities xjj j = 
1, 2,..., n, each of which incurs a setup cost Kj. Calculate how many different 
patterns of setups there arc (assume all = 0 is one of the possibilities). 

(c) Consider the constraint Xi + ^2 + X 3 + • • • -|- = iV, where each Xj must be 

a nonnegative integer. Calculate the number of different feasible solutions if 
n = 6 and W = 1. If n = 7 and iNT = 1. If n = 6 and N = 2. If n = 6 and iV = 6 . 
If n = 7 and = 6 . If n = 6 and N = 6 and the restriction is stated as an 
inequality (:^). If n = 6 and AT = 1, and the restriction is stated as an inequality 
(:^). If n = 6 and W= 2, and the restriction is stated as an inequality (^). 

(f) Consider a capital budgeting problem in which there are three categories of 
projects. Category 1 contains eight projects, and two arc to be selected. Category 
2 contains ten projects, and four arc to be selected. Category 3 contains seven 
projects and five arc to be selected. Calculate the total number of distinct selection 
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combinations that arc possible. Shov/ how this number changes if the specified 
number for each category is only an upper limit on the number that can be 
selected. 

(g) A company wants to schedule three trucks to visit eight stores. Each truck can 
stop at one or more stores, but no store is to be visited by more than one truck. 
Calculate the number of different ways to assign the trucks to the stores. 

(h) A firm performs preventive maintenance on four major pieces of equipment. 
Such work can start during any one of three weeks on Machine 1, during any 
one of six weeks on Machine 2, during any one of two weeks on Machine 3, and 
in any one of five weeks on Machine 4. Calculate the total number of distinctly 
different maintenance schedules (that is, “starting weeks” for each of the four 
machines). 

(i) A corporation is working out a strategy for introducing three new products into 
four marketing areas. The strategy is strictly “sequential” in that the company 
first selects one area, then selects the sequence in which to introduce the products 
(one every two weeks), then after\vards selects the next area, followed by a chosen 
sequence, etc. Calculate the number of different strategies for introducing the new 
products. 

(j) A company plans to build six new plants on sites that have already been selected. 
It wishes to accomplish this expansion over the next five years. Conceivably, it 
could build all the plants in Year 1 or wait to build all the plants in Year 5. 
More likely, it will build some plants each year. Determine the number of different 
patterns for building these plants. The company is also concerned with the number 
of plants it builds each year (in addition to which plants it builds each year). 
Determine all the possible patterns for the numbers of plants built over the five 
years (such as six in Year 1 and none in any other year, or six in Year 5 and 
none in any other year, etc.). 

2 Consider the problem 

maximize 3xi -f- 6 x 2 -f 

subject to 

Xi -f- 2^2 + 2x3 ^ 2| 

Xj -f" 2x2 H" 3x3 ^ 2j 
every x^ a nonnegative integer. 

(a) What is an optimal solution if the integer-value stipulation on every xj is dropped ? 
Is there more than one optimal solution? 

(b) Determine by inspection an optimal integer-valued solution. Does your answer 
in part (a) round to this solution ? 

(c) Draw a solution space representation of the problem, letting X 3 = 0 (that is, 
draw the'implied constraints on x, and X 2 ). Indicate all the feasible integer-valued 
solutions. 

3 Suppose a company has five manufacturing plants that ship to 100 warehouses. If 

the company continues to operate all of its plants and warehouses, it faces a cost 

minimization problem that can be characterized by a standard transportation problem, 

like that described in the beginning of Chap. 6 . 
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(a) Suppose, however, that the total plant capacity is considerably more than is 
required at the warehouses, and each Plant i incurs a large yearly overhead cost 
Pi, Suggest a practical way to determine which plants, if any, ought to be shut 
down to minimize the sum of annual overhead and transportation costs. 

(b) Suppose, instead, that the company wants to shut down some warehouses (the 
associated demand requirements would be reallocated to the warehouses that 
remain open). The company thereby saves overhead costs of operating some of 
the warehouses, but increases transportation costs as a result. Would your sugges¬ 
tion in part (a) be practical for this problem? Explain. 

4 Goforth Manufacturing Company (Sec. 13.2). Consider the following data: 

772 = 3 « = 4 .Sj = 40 .So = 50 6*3 = 60 
/), = 10 £>2 =15 £)3 = 20 a = 25. 

(a) Write in full detail all the linear constraints in the problem. 

(b) Explain why (7) and (8) together ensure that^* and Zij are each cither 0 or 1. 

(c) Explain why it is never essential for optimality to have = 1 and Xij = 0. 

*(d) Suppose allF< = 100, = l./='o = 0 . What is an optimal solution if the integer- 

value stipulations arc dropped ? Are there alternative optimal solutions ? 

(e) Explain why Ua = minimum {Si, Dj) is sufficiently large in (9). 


5 Consider the plant location model described in Sec. 13.2. 

(a) If every = 0, would it be optimal to let every = 1 ? Explain. 

(b) Assume every= 1 and moreover that the sum of the supplies Si equals the sum 

of the demands Z)j. Will an optimal solution necessarily use m -|- — 1 routes? 

If every Fu = F (a constant), is an optimal solution the same as the one for the 
standard transportation problem in which the F^j are not present (F = 0)? (If 
you answer “yes” to a question, provide a supporting argument; if your answer 
is “no,” provide a small example showing why.) 


6 Fixed Charge Problem, Consider the model 


minimize Cj(xj) 

subject to 

for j=l,2,. 
j-i 

every xj ^ 0, 

where 



(Fj -f CjXj for xj > 0, 

and each Kj ^ 0. The Kj arc called fixed charges. 

(a) Show how to formulate this problem as a mixed integer programming model. 
♦(b) Discuss the difficulty that can arise if the objective is to maximize instead of 
minimize (or if not all Kj ^ 0). (Hint: how small can xj be, given that it is 
positive?) 
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(c) Suppose all = K> 0. Give a sufficient condition such that an optimal solu¬ 
tion is the same as one for the linear programming problem in which the are 
not present (all JCj = 0). 

(d) Show how to alter your formulation if the constraints arc of the form 

2 ^ h for i = 1, 2,. .., m, 

j=i 

where 

fO for xj = 0 

Qijixj) = 1 

Uii + aijXj for xj > 0, 

with every Aij ^ 0. 

♦(c) Discuss the difficulty that can arise in part (d) if an < 0, or if an inequality 
for a linear constraint is (>). 

7 Consider a product-mix selection problem, like that in Sec. 2.2, which is of the form 

71 

maximize ^ CjXj 

. 

subject to 

71 

2 ^ I = 1, 2,..., m 

>=i 

every xj > 0, 

where every Cj > 0 and n'^m, 

(a) Suppose the firm wants to produce no more than k liffercnt products. Show how 
to formulate this stipulation in terms of integer-valued variables. 

(b) What difficulties can arise if the firm wants to produce at least k different products. 
{Hint: how small can xj be if it is positive?) 

8 Solution Space Representation, In each part below, graph the region of feasible solutions 
for the set of alternative constraints. Assume Xj ^ 0 and x^ ^ 0. 

(a) Either Xj -|- 2x2 ^ 1, or 2xi + Xj < 1, or both. 

(b) Either Xi + X 2 ^ 1, or Xj -f X 2 ^ 2, or both. 

(c) Either the pair of constraints Xj ^ 1 and X 2 < 1 holds, or the pair Xi > 1 and 
X 2 ^ 1 holds, or both pairs hold. 

(d) At least one of the following constraints must be satisfied: Xi ^ 1, x. < 1, and 

+ ^2 ^ 2. At least two must hold. All three must hold. 


9 Consider the approach for representing alternative constraints as described by (12), 
(13), and (14) in Sec. 13.2. In each part below, write in full detail the representation 
(13) and (14). {Hint: calculate a value for C/( by examining the constraints; let Ui 
be the smallest value that will work.) 

(a) Xi -f- 2x2 + X3 < 8 
4xi -I- Xj -}- X 3 < 20 

X, -f .V 2 + 3 x 3 <^15. 

Show the formulation for A: = 1 and A: = 2, 
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(b) Same constraints as in part (a) augmented by X 2 — ^ 0. Show the formulation 

for k = 3. 

(c) In parts (a) and (b) explain in detail why the representation and the stipulation 
that each yi be cither 0 or 1 solve the problem (that is, ensure that at least k out 
of the p constraints arc satisfied). 

(d) Explain why (13) can just as well be stated as an equality. 

♦(e) Discuss the difficulty in ensuring that exactly k out of p linear constraints (12) arc 
satisfied. 

10 In each part below, show how to formulate the problem in terms of an integer pro¬ 
gramming model. 

n 

(a) maximize subject to 

n 

S aijX, ^ it for «■ = 1, 2,.. ., m 
every ^ 0 

and either the three constraints Xj < 1, a *2 < 1, and + Xo ^ 1.5 hold, or the 
three constraints Xi^ \,X 2 > 1, and Xi + X 2 '^ 2.5 hold. 

(b) maximize [maximum (xj -f .^ 2 , 2 xi -h X 2 , Xi + 2 ^: 2 )] subject to 

n 

OuXj > it for t = 1, 2,.. ., m 
every xj > 0 

(that is, find values for xj that satisfy the constraints and make as large as possible 
the maximum value for the three designated linear functions). 

(c) How would you go about finding numerical solutions to the problems in parts 

(a) and (b)? 

11 Exercise 1, part (j), describes a company that plans to start building six new plants 
on sites that have already been selected, and wishes to complete construction within 
five years. 

(a) Characterize the selection problem in terms of linear constraints involving integer¬ 
valued variables. (Be sure to define your symbols and explain the meaning of each 
restriction.) In each part below, show how the stated stipulation can be encom¬ 
passed by an integer programming formulation. Consider each of the parts 
separately. 

(b) Plants 1, 2, and 3 must be started no later than Year 3. 

(c) No more than two plants can be started in any year. 

(d) No more than three plants can be started in the first two years, and no more than 
five plants in the first four years. 

(e) Exactly three plants must be started in the first two years, and five plants in the 
first four years. 

(f) Plant 2 cannot be started before Plant 1 (they can be started in the same year). 

(g) Plants 4, 5, and 6 cannot be started before Plants 1, 2, and 3 are started (all six 
plants can be started in the same year). 

♦(h) Either Plants 1, 2, 3 must be started before Plants 4, 5, 6, or vice versa (all six 
plants cannot be started in the same year). 
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(i) Plants 1 and 2 must be started in the same year. Similarly, Plants 3, 4, and 5 
must be started in the same year. 

(j) If Plants 1 and 2 are started in the same year, then no other plants can be started 
in that year. 

(k) In Year 1, the company starts either Plants 1 and 2 or Plants 3 and 4. (Assume 
one of these two possibilities must occur.) 

(l) In the first two years, the company starts either Plants 1 and 2 or Plants 3 and 4. 
(Assume one of these two possibilities must occur.) 

(m) Plant 1 can be started in Year 1 only if either Plant 2 or Plant 3 is started, but 
not if both Plants 2 and 3 are started. 

(n) Suppose the company has two possible sites for Plant 1 (instead of a single site 
already selected). Show how to modify your previous formulations to account for 
this site selection problem. 

(o) Suppose the company has only five sites, instead of six, and Plants 1 and 3 compete 
for the same site; that is, the company plans to build only Plant 1 or Plant 3 on 
a specified site. Show how to modify your previous formulations to account for 
this site selection problem. (You may find it necessary to reinterpret the assump¬ 
tions in some of the parts to preserve consistency; make explicit mention when 
this happens.) 

12 Consider a traveling salesman problem with n = 4 cities. 

(a) Using the formulation (19) through (23) in Sec. 13,2, write all the linear con¬ 
straints in full detail. 

(b) Show how the tour City 1 to City 3 to City 2 to City 4 to City 1 leads to values 
for Xij and Ui that satisfy all the restrictions in part (a). 

(c) Show that a subtour City 3 to City 2 to City 3 cannot lead to values for X|j and 
Ui that satisfy all the restrictions in part (a). Do the same for the subtour City 1 
to City 2 to City 3 to City 1. 


*13 Consider the problem 
subject to 


maximize 3x + 7y 
2x+ ^:^25 


x + 2y<6, 

where ^ > 0, and x can equal only the values 0, 1,4, and 6. 

(a) Formulate the problem in terms of an equivalent integer programming model. 

(b) Suppose the objective function contained the terms 3x*, instead of 3x. Revise 
your answer to part (a) accordingly. 


*14 Consider a problem with the linear constraints in exercise 13, where and 

^ 0, but assume that the objective function is to maximize c{x) -}- 7y, where 


Ax 

for 


14- 3x 

for 


5 

for 

3 ^ X < 3.5 

-2 + 4x 

for 

3.5 ^ ^ 5 

-22 + 8x 

for 
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(a) Plot the function c(x), 

(b) Formulate the problem in terms of an equivalent integer programming model, 
using the approach in (24) through (30) of Sec. 13.2. 

(c) Explain in detail why the restrictions (29) and (30) ensure that exactly one 
equals 1, thereby guaranteeing in (28) that no more than two of the weights 
wii and can be positive, and that they must sum to 1 according to (25). 
Use the formulation in part (b) to illustrate your reasoning. 

♦(d) Ignore the constraints (28), (29), and (30), and determine an optimal solution. 
Is this solution truly optimal for the original nonlinear problem? Explain. 

(e) Show how the formulation in part (b) changes if the c(x) function over the 
interval 0 ^ x ^ 2 is 4Ar — 2, instead of 4.v. {Hint: plot the function before giving 
an algebraic formulation.) 

15 As in See. 13.4, let [</] designate the largest integer that is less than or equal to the 

real number d. 

(a) What are the values of 

[ 12 . 8 ] [ 12 ] [- 12 . 8 ] [- 12 ] 

[ 0 . 8 ] [ 0 ] [- 0 . 8 ] 

[3|] [3] [|] [-§]. [-3] [-3|]. 

(b) What are the fractional parts of the numbers in brackets in part (a) ? 

16 (a) Explain in detail what justifies, in Sec. 13.4, writing the inequalities in (10) and 

(11), the equality in (12), and making the stipulation that x in (12) must be 
integer-valued. 

(b) Show the algebraic steps that justify the statement that subtracting (8) from (12) 
yields (14). 

17 Consider the numerical illustration in Sec. 13.4. Suppose the right-hand-side constant 

in (2) is 5, instead of 3. 

(a) What is an optimal linear programming solution (ignoring the integer-value 
stipulations) ? 

(b) Use the cutting plane idea (in an ad hoc fashion) to derive an optimal integer¬ 
valued solution. 

18 Consider the problem 

maximize Xj -|- 2 x 2 

subject to 

5xi + 7x2<21 

—Xi + 3X2 ^ 8 

Xi and X 2 nonnegative integers. 

(a) What is an optimal linear programming solution (ignoring the integer-value 
stipulations) ? 

(b) Use the cutting-plane idea (in an ad hoc fashion) to derive an optimal integer¬ 
valued solution. 
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19 In solving the numerical example (20), (21), and (22) of Sec. 13.4 by the Cutting- 
Plane Algorithm, verify that 

(a) The restriction (14) applied to Row 1 in (24) leads to the constraint (25). 

(b) The linear programming model (24) augmented by (25) has the optimal solution 
displayed in (26). 

(c) A cut formed from Row 2 in (26) is that given in (27). 

(d) The optimal linear programming solution to (26) and (27) is the one displayed 
in (28). 

(e) Justify, verbally, that the solution in (28) is optimal for the original problem (20), 
( 21 ), and ( 22 ). 

20 Apply the Cutting-Plane Algorithm to the numerical example (20), (21), and (22) 
in Sec. 13.4, where you let 

(a) The right-hand-side constant in (21) be 15, instead of 13. 

(b) The coefficient ofxj in (21) be 6 , instead of 7. 

♦21 The special material at the end of Sec. 13.4 describes how to express cuts in terms of 
the original variables and how to show graphically the progress of the algorithm. 
Perform this analysis of the cuts and draw the associated graphical representation 
for exercise 20, part (a). Part (b). 

22 Study the application of the Branch-and-Bound Algorithm to the example ( 8 ) and 
(9) in Sec. 13.5. Trace in detail the calculations at each iteration. Specifically, verify 
optimal solutions in (11), (13), (15), (17), (19), and (20). Check how the constraints 
in each of the nine problems are generated. Verify that Problems 2, 6 , and 8 have no 
feasible solution. Explain in detail the rationale permitting the calculations to stop 
with Problem 5 and the logic behind asserting that (19) is the optimal solution. 

23 Consider the example in Sec. 13.5. Suppose at iteration 4 you select Problem 5 instead 
of Problem 4. Complete the steps of the Branch-and-Bound Algorithm with this 
choice. Construct a tree analogous to that in Fig. 13.1 to show the progress of the 
iterations. 

24 Consider the example in Sec. 13.5, where the right-hand-side constants in (9) arc 
both 22, instead of 8 . Apply the Branch-and-Bound Algorithm. Construct a tree 
analogous to that in Fig. 13.1 to show the progress of the iterations. 

25 Consider how the traveling salesman problem example in Sec. 13.6 is solved by the 
method of excluded subtours. Suppose, at the first iteration, you add three problems 
associated with the subtour ^23 = X 34 = ;c 42 = 1 , instead of with the subtour Xjs = x^i 
= 1. Carry out the remainder of the algorithm, and construct a tree analogous to 
that in Fig. 13.3 to show the progress of the iterations. 

26 Consider how the traveling salesman problem example in Sec. 13.6 is solved by the 
method of designated routes. The rule that was used for Step 1 was to select the highest- 
numbered problem that remained on the list. Hence, Problem 2 was selected at 
iteration 1. Carry out the algorithm using at Step 1 the rule of selecting the lowest- 
numbered problem that remains on the list. (Hence, at iteration 1, begin with Problem 
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1 ,) Employ the same convention at Step 4 of assigning the larger number to the problem 
with = 1. Construct a tree analogous to that in Fig. 13.5 to show the progress of 
the iterations. 

27 Apply the method of designated routes to the traveling salesman problem example in 
Sec. 13.6, where you let Problem 1 designate = oo, and Problem 2 designate ^ 15 = 1. 
(Use the same selection rule at Step 1 and numbering rule at Step 4 as in Fig. 13.5.) 
Construct a tree analogous to that in Fig. 13.5 to show the progress of the iterations. 

28 Perform the analysis in exercise 27, where Problems 1 and 2 refer to ^51 = 00 and 
ATjj = 1 , respectively. 

29 Consider how the traveling salesman problem example in Sec. 13.6 is solved by the 
method of partial tours. Given that a tour consists of a single cycle through all the cities, 
any one of the cities can be designated as the “starting point.” Assume that the tour 
starts at City 2 and apply the method. Construct a tree analogous to that in Fig. 13.6 
to show the progress of the iterations. 

30 Suppose you know that an integer-valued variable x has an optimal solution value 
in the interval 0 < jc ^ U, where U is an integer. Indicate a way to represent x, 
analogous to that in (4) of Sec. 13.7, such that the largest possible value for this 
representation is exactly U. 

31 Consider an integer programming model like that described at the beginning of 
Sec. 13.7. Suppose one of the linear constraints (2) is 

Xj — X2 + 7x3 — 10x4 3x5 — 6x0 < 0. 

In each part below, apply the tests (9) and (11) to discover whether there is no 
feasible completion for the indicated partial solution, or whether one or more variables 
must take on a determinable value. 

(a) X 4 = Xe = 0 . 

(b) xj = X 3 = 1 and X 4 = 0 . 

(c) Xi = X 3 = 1 and Xq = 0 . 

(d) Xj = X 3 = X 5 = 1 . 

(e) xi = X 3 = X 5 = 1 and Xq = 0 . 

(f) X 3 = 1 and Xo = Xq = 0 . 


32 Consider an integer programming model like that described at the beginning of 
Sec. 13.7. Suppose the values of variables Xq, xj, . . ., x„ are fixed in a partial solution 
and the remaining terms in the objective function to be maximized arc 
2xi — 3 x 0 + 6 x 3 — 7 x 4 4 - lOx^; assume that xJ = 7. In each part below, apply the 
tests in ( 9 ) and ( 11 ) to discover whether the indicated augmentation of the partial 
solution can lead to a complete solution that is an improvement, and whether one or 
more variables must take on a determinable value in order to improve on the value 

4 = 7 . 

(a) Xi = 1. 

(b) X, = 0 . 

(c) X, = Xj = 1. 
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33 Consider the application of the Partial Enumeration Algorithm to the example (12) 
through (15) in Sec. 13.7. Verify all the intermediate calculations at each step. Show 
all the tests in detail. 

34 Consider the example (12) through (15) in Sec. 13.7. Apply the Partial Enumeration 
Algorithm, where Problem 1 is the partial solution Xi = 1 and Problem 2 is X| = 0. 
Let Xq = 0, corresponding to all xj = 0. Construct a tree like that in Fig. 13.7 to 
show the progress of the iterations. 


35 Explain your understanding of the following terms: 


divisibility assumption 
integer (diophantine, discrete) 
programming problem 
pure integer programming problem 
mixed integer programming problem 
combinatorial optimization problem 
lattice point 
zero-one variable 
alternative constraints 
“go-no-go” alternatives 
traveling salesman problem 
tours, subtours, and partial tours 
Cutting-Plane Algorithm (Method 
of Integer Forms) 
backtrack algorithm 
^convex hull 


♦convex combination 
integer part 
fractional part 
cut 

♦massive degeneracy 
Branch-and-Bound Algorithm 
(Branch-and-Prune) 

Partial (Implicit) Enumeration Algorithm 

Additive Algorithm 

binary representation ^ 

partial solution 

free variables 

completion 

fathomed partial solution 
composite (or surrogate) constraint. 
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36 Knapsack Problem, Consider the following illustration of a knapsack problem, as 
described in exercises 22 and 47 of Chap. 10: 

maximize 60xj 60^2 + 40^3 -f IOX 4 20^5 -f lOx^ + 3^7 

subject to 

3xi -j- 5^2 4^3 -j- 1^4 “I" 4^5 -j- 3xg -f- Ixj ^ 10 

every xj = 0 , 1 . 

(a) Solve by the Cutting-Plane Algorithm in Sec. 13.4. 

(b) Solve by the Branch-and-Bound Algorithm in Sec. 13.5. 

(c) Solve by the Partial Enumeration Algorithm in Sec. 13.7. 

♦ (d) Solve by a dynamic programming recursion. 

37 Consider the argument in Sec. 13.4 establishing the validity of the cut (11) and (12), 
given the equation ( 8 ). 

(a) Derive a cut analogous to ( 11 ) and ( 12 ) when ( 8 ) is multiplied by a constant 

(P ^ 0). 
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(b) Suppose (8) is 

2^^:, + 31^2 = 

What cut inequality do you obtain if p = 1? 11^ = ^? Ifp = 2? If/^ = 2J? 
If/> = 3}? Are all of these cuts equally strong? Explain. 

(c) Let the symbol designate the smallest integer greater than or equal to the 

real number d. Show that from (8) you can derive ^ 

38 Apply the Cutting-Plane Algorithm in Sec. 13.4 to solve each of the following problems, 
assuming here that all the variables must be nonnegative integers. 

(a) One-Potato, Two-Potato Problem in Sec. 1.6. {Note: let the coefficients of the 
slack variables be .1, so that the slack variables are also integer-valued.) 

(b) Knox Mix Company Problem, described in Sec. 2.2 and soi/cd by the simplex 
method in Sec. 4.4. 

♦(c) Giant Electric Company, described in the special material at the end of Sec. 10.3. 
Use the data in exercise 30 of Chap. 10. 

(d) Exercise 10, parts (b), (c), (f), and (g), of Chap. 3. 

(e) Exercise 20, part (a), of Chap. 4. 

(f) Example (8) and (9) in Sec. 13.5. 

(g) maximize Xi -f X 2 subject to 

4xi — :V2 ^ 10 
2xi -h 5x2 < 10 
4xi — 3x2 ^ 6. 

(h) maximize lOxj — X 2 subject to 

-25xi -f 22x2 ^ 33 

20xi- 5x2^24. 

♦(i) In parts (a), (d), (g), and (h), express the cuts in terms of the original variables, 
using the approach described in the special material at the end of Sec. 13.4. 
Show the progress of the algorithm on a solution space diagram. 

39 Apply the Branch-and-Bound Algorithm in Sec. 13.5 to solve each of the following 
problems, assuming here that all the variables must be nonnegative integers. Construct 
a tree diagram like that in Fig. 13.1 to show the progress of the algorithm. (You may 
want to employ the Upper-Bounded Variables Algorithm in Sec. 5.10 on the larger 
problems.) 

(a) One-Potato, Two-Potato Problem in Sec. 1.6. 

(b) Knox Mix Company Problem, described in Sec. 2.2 and solved by the simplex 
method in Sec. 4.4. 

(c) Exercise 18, part (a), of Chap. 10. 

♦(d) Exercise 18 of Chap. 10, with the right-hand-side constant in the first equation 
equal to 2, instead of 3. 

♦(e) Exercise 18 of Chap. 10, with the right-hand-side constant in the second equation 
equal to 3, instead of 4. Equal to 5, Instead of 4. 

♦(f) Exercise 18 of Chap. 10, with the right-hand-side constant in the first equation 
equal to 2, instead of 3, and in the second equation equal to 3, instead of 4. 
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*(g) Giant Electric Company, described in the special material at the end of Sec. 10.3. 
Use the data in exercise 30 of Chap. 10. 

(h) Exercise 20, part (a), of Chap. 4. 

(i) Example (20), (21), and (22) in Sec. 13.4. 

40 Apply the Partial Enumeration Algorithm in Sec. 13.7 to solve each of the following 
problems. Unless the problem already is stated in terms of zero-one variables, convert 
the model into an equivalent problem with all zero-one variables before applying the 
algorithm. Construct a tree diagram like that in Fig. 13.7 to show the progress of the 
algorithm. 

(a) The following modified version of the Knox Mix Company Problem, described 
in Sec. 2.2 and solved by the simplex method in Sec. 4.4: 

maximize 4xj -f 28 x 2 -j- 5x^ 9 x 4 4 - 63x5 -f 

subject to 

\xi -f- 7x2 + 1^3 + 1^4 -r 7 x 5 -f Ixe < 15 

7x, 4- 49x2 4- 5 x 3 4- 3x4 4 - 21 x 5 4- 2xo ^ 120 

3 x| 4- 21x2 5x3 -f 10x4 4- 70x5 -f- ISx® ^ 100 

every X; = 0, 1. 

(b) Exercise 18, part (a), of Chap. 10. 

(c) Exercise 18 of Chap. 10, with the right-hand-side constant in the first equation 
equal to 2, instead of 3. 

(d) Exercise 18 of Chap. 10, with the right-hand-side constant in the second equation 
equal to 3, instead of 4. Equal to 5, instead of 4. 

(e) Exercise 18 of Chap. 10, with the right-hand-side constant in the first equation 
equal to 2, instead of 3, and in the second equation equal to 3, instead of 4. 

*(f) Giant Electric Company, described in the special material at the end of Sec. 10.3. 
Use the data in exercise 30 of Chap. 10. 

(g) Exercise 20, part (a), of Chap. 4. 

(h) Example (20), (21), and (22) in Sec. 13.4. 

(i) Exercise 38, part (g). 

(j) E.\ercise 38, part (h). (Assume 0 < x^ < 4.) 

(k) maximize — 4xi + 6 x 2 + x^ ^ 8 x 4 -f- Ixj subject to 

— IXi 2x2 ^ ""2 

—4xi -p 5x2 4- 1^3 — 3x4 -f- 1 x 5 ^ —2 

2xi — 3x2 — 2x3 4- 6x4 — 2x5 ^ 1 

every xj = 0, 1. 

41 Consider the traveling salesman problem given by the following data: 

Cj2 — 20 C|3 ^ 4 Cj4 = 10 

^23 = 5 C 25 = 10 

C34 = 6 C35 = 6 

^45 = 20, 
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where Cij = Cjif and there is no route between Cities i and j if a value for Cfj is not 
shown above. Solve the problem by the 

(a) Method of excluded subtours. 

(b) Method of designated routes. 

(c) Method of partial tours. 


*42 Apply the Implicit Enumeration Algorithm to solve the following nonlinear zero-one 
problem. Use the approach given at the end of Sec. 13.7. 

maximize 6x, — 2xJ -f X|X 2 — XiXz + 2 xjX 3 — 5;«^ — ^ 2 ^ + ^^3 + ^0-^4 
subject to 

SxJ — 8x1X2 + ^^5 

^2 + 4 xJ -f ^4 — 6x4 ^ 3 


7;c3 _ ^ 3;^^ < 10 

every Xj = 0, 1. 


FORMULATION EXERCISES 

In exercises 43 through 62, you are to formulate integer programming models. 
Be sure to define all the symbols you use, justify each of the constraints, and develop 
the objective function. When an exercise supplies numerical data, write out the 
model in full detail. 

43 The scholarship and financial aids office at Yankee University is preparing its awards 
for the coming year. It has selected n students to receive awards, and wants to grant 
at least dollars to Student i, for i = 1, 2,..., n. The office has s different scholar¬ 
ships available; Scholarship j confers the amount a j on its recipient. The office may 
have to award several grants to an individual student in order to provide him with at 
least Ml dollars, but the office cannot reduce the amount of any scholarship award 
below the designated level jj. If the office does not award Scholarship j for the year, 
the amount Oj earns interest and is available for distribution in the following year. 

(a) Devise a model for granting scholarships that maximizes the amount of undis¬ 
tributed money, subject to giving each student at least the minimal specified level. 

(b) Show how to modify your answer in part (a) if each Student i is not permitted to 
receive funds from more than two different scholarships, and may not receive a 
total award in excess of 1.1 Afj. 

44 A state government has asked for bids on n construction projects from each of n firms. 
No firm will be awarded more than one contract, so the decision problem can be 
viewed as an assignment model, as described in Sec. 6.4. For political reasons, the 
government officials want to award no more than N large contracts to firms that are 
located outside the state. Let Projects 1, 2,..., s, denote the large projects and Firms 
1, 2,..., t, the companies that arc located out of state. The objective is to minimize 
total cost given the added stipulation. 
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(a) Formulate this optimization problem. 

(b) Show that imposing this added stipulation to the assignment model (3) and (4) in 
Sec. 6.4 and then finding a linear programming solution (where the integer 
constraints are ignored) need not produce a feasible assignment. {Hint: consider 
n = 3, t = t = 2, N = 1, every = 0, r ,3 = c^i = 1, and all other Cij = 2.) 

45 Multi-Index Problem. The governor of the state in exercise 44, Ray Gunn, is unhappy 
about the policy of limiting awards to out-of-state firms. He does, however, stipulate 
that one project must be completed in each three-month period, beginning six months 
from now, and ending 6 -f 3n months from now. Accordingly, each contractor has 
resubmitted a bid stating the cost for each project depending on the designated 
period of completion. Formulate this optimization problem. 

46 Consider a classical transportation model, like that described in Sec. 6.2. Impose the 
restriction that no supply point can serve more than M different demand points, and 
no demand point can be served by more than N different supply points. Assume that 
these stipulations permit a feasible solution. Formulate this optimization problem. 

47 Fred Frosh, a first-year student at Multy University, has decided to take five courses 
in his first term; these courses are designated A, B, C, D, and E. Each course has four 
sections that meet at different times of the day; let .-^1, for example, denote Section I • 
of Course A, and let /ai be the hour at which it meets. For simplicity, assume that 
every course meets every day, and the starting times of courses are 8 a.m., 9 a.m., ... , 

4 p.M. Fred’s preferences for when he takes courses arc influenced by the time of day 
and the reputation of the instructor. For example, let Pn denote Fred’s preference 
for Section 1 of Course A. Unfortunately, Fred cannot select his most preferred 
section of each course due to time conflicts. 

(a) Devise a model that selects a feasible course schedule which maximizes the sum 
of Fred’s preference ratings. 

(b) Indicate how to alter the formulation in part (a) to ensure that Fred has an 
hour for lunch at either noon or 1 p.m. 

*(c) Indicate how to alter the formulation if Fred’s objective function is to maximize 
the number of consecutive hours he has free either from 8 p.m. until his first class, 
or at the end of the afternoon. 

48 Consider the problem of Van Fuller, described in exercise 35 of Chap. 10. Formulate 
his optimization problem in terms of an integer programming model. 

49 Multiplant Expansion Problem. The Manne .‘\gricultural Chemical Corporation operates 
three plants. At the beginning of Period 1, each plant has capacity 10. By the end of 
Period 3, Plant 1 must have capacity 11, Plant 2 must have capacity 12, and Plant 3 
must have capacity 13; these targets are set so that after three periods each plant has 
enough capacity to be self-sufficient in its own marketing area. The capacity of a plant 
can be expanded in unit amounts in any period; for simplicity, assume that capacity 
that is initiated in a period is also available to meet demand requirements during that 
period. Also assume that capacity does not deteriorate. Thus, there are three possible 
ways for Plant 1 to increase its capacity from 10 at the beginning of Period 1 to 11 
at the end of Period 3: (i) add 1 unit in Period 1, (ii) add 1 unit in Period 2, or (iii) 
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add 1 unit in Period 3. (Verify that there are six possible ways for Plant 2 to increase 
its capacity from 10 at the beginning of Period 1 to 12 at the end of Period 3.) Let 
^u\py q) denote the cost in Period t of adding p units of capacity to Plant i, when its 
capacity at the beginning of Period t is ^ units. 

Let Dii be the demand requirement during Period t in the marketing region served 
by Plant i. If Plant i lacks the capacity to fill requirement Pn, then the firm must 
ship items into this region from one or both of the other plants. Let represent the 
cost of shipping an item from Plant j to Plant i during Period t. 

Formulate a model to determine a minimum cost policy for building capacity at 
each of the plants, and meeting all demands during the planning horizon. 

50 The Phil T. Grimes Company must draw up a preventive maintenance schedule for 
five of its major pieces of equipment; the schedule extends over the next eight weeks. 
Assume the pieces of equipment are indexed 1, 2, 3, 4, 5. Maintenance on Piece 1 
requires four units of labor (say, man-weeks) during the first week of maintenance, 
six units in the second week, and three units in the third week. The servicing can 
begin as early as Week 1 or as late as Week 4. Similar data for the other pieces of 
equipment are shown in Fig, 13.8. 



First 

Second 

Third 

Early 

Late 

Equipment 

Week 

Week 

Week 

Start 

Start 

1 

4 

6 

3 

1 

4 

2 

3 

2 

5 

1 
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3 

7 . 

1 

1 

2 

5 

4 

1 

3 

6 

2 

6 

5 

8 

9 

2 

3 

5 


Figure 13.8 


In each part below, formulate an appropriate optimization model. 

(a) Assume that the maximum labor available is P in Week t. Find a feasible schedule 
(that is, determine the starting week for maintenance on each piece of equipment). 

*(b) Suppose Grimes wants to minimize the sum of the weekly fluctuations in labor 
utilization. [If, for example, each project is started as early as possible, the weekly 
labor utilization is 7, 16, 20, 16, 2, 0, 0, 0, so that the sum of the week-to-week 
fluctuations is (16 - 7) (20 - 16) + (20 - 16) (16 -2)-f(2-0)4- 

(0 - 0) 4- (0 - 0) = 33.] 

*(c) Suppose Grimes wants to minimize the maximum labor utilized during any of 
the eight weeks. 

♦(d) Suppose Grimes wants to minimize the maximum weekly fluctuation, (If each 
project is started as early as possible, the resultant maximum fluctuation is 16 — 2 
= 14, between the fourth and fifth week.) 

(c) Show how to modify your answer in part (a) if maintenance on Piece 3 cannot 
start before maintenance begins on Piece 1. If maintenance on Piece 4 has to start 
in the same period as maintenance on Piece 3 (hence, its “late start” week is 
really 5, instead of 6). If maintenance on Pieces 4 and 5 cannot start in the same 
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week. If maintenance on Piece 5 cannot start until after maintenance on Piece 1 
is complete (that is, if service on Piece 1 starts in Week I, for example, then service 
on Piece 5 can start no sooner than Week 4.) 

51 The Chick’n Lick’n Company operates a chain of home-delivery restaurants. Each 
store is franchised by the company, which tries to ensure a store owner of an adequate 
level of business. The company is considering n possible new sites in a major population 
area, and has estimated a store in Location j will result in an acceptable return Rj 
provided there is no other new store within a radius of 5 miles. Let du = 1 if Sites i 
and j are within a 5-mile radius, and 0 otherwise. The company has calculated all 
the possible dijj and wants to choose a pattern of sites so as to maximize total return. 

(a) Formulate the problem as a mathematical optimization model. 

(b) Show how to modify your formulation in part (a) if the company also operates a 
chain of Dipp’n Donut Shops. It can use each of the n sites as either a donut shop 
or a Chick’n Lick’n Store. The company does not want to put two donut shops 
within a 10 -mile radius. 

52 Covering Problem. The Trotter Poll Company keeps a variety of data stored on magnetic 
tape files. It wishes to compile statistics on m different population characteristics (such 
as the age distribution, income distribution, size of family dwelling, etc., in eight 
selected metropolitan areas). Assume that all the required information is recorded 
within n different files. Suppose 7j, forj = 1, 2, . . ., n, is the computer time required 
to search File j, and assume this search time is independent of the number of character¬ 
istics to be summarized from the file. Several of the m characteristics of interest are 
recorded in more than one of the n files, that is, the files contain duplicate information. 
Let Qij = 1 if the nh characteristic is recorded in the jth file, and = 0, otherwise. 
Thus, for example, 3 = Ajs = ^ 19 = I indicates that the first characteristic is recorded 
in Files 3, 8 , and 9. 

(a) Formulate an integer programming model to determine which of the n files to 
search so that data on all of the characteristics are gathered in minimum searching 
time. 

(b) Explain how to modify your formulation if the search time for File j consists of 

if File j is searched at all, plus if Characteristic i is obtained during the 
search. 

53 Delivery Truck Problem {Covering^Problem). The Droppit Parcel Company has five 
customer deliveries to make today. It must unload a shipment of weight 1 at 
Customer A, of weight 2 at Customer B, of weight 3 at Customer C, of weight 5 
at Customer D, and of weight 8 at Customer E. The company has four different 
delivery vans available; Truck 1 has weight capacity 2, Truck 2 has weight capacity 6 , 
Truck 3 has weight capacity 8 , and Truck 4 has weight capacity 8 . The cost of operating 
Truck j is Cj. Assume that a single truck cannot deliver to both Customers A and C; 
similarly, a single truck cannot deliver to both Customers B and D. 

(a) Formulate an integer programming model to determine the minimum cost 
allocation of delivery trucks for making all the shipments. 

(b) Show how to alter your formulation if there is an additional cost qj when the 
truck delivers to Customer i. Exercise continues on p. 504. 
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(c) Show how to alter your formulation if a truck cannot make more than two 
deliveries during the day. 

(d) Explain the impact on the model formulation of imposing additional constraints 
on the truck routes. 

54 Trim Problem. Consider the model in exercise 25 of Chap. 2. In that description, 
all customer demands were met and only one size jumbo roll was used. 

(a) Suppose, instead, that the Fine-Webb Paper Company can manufacture its 
customer orders from jumbo rolls of 50-, 60-, and 70-inch widths. There is a 
setup cost, however, for using each different jumbo size roll (denote these costs 
by J^ 5 o, and K^Qy where the cost units have been chosen to be commensurate 
with the value of trim loss). Show how to alter the formulation in exercise 25, 
Chap. 2, to account for the possibility of using different size jumbo rolls. 

(b) Suppose, instead, that the Fine-Webb Paper Company can ship each customer 
an amount within 20% of his order quantity; hence, the company need manu¬ 
facture as few as 80 reels of 22-inch width, but may manufacture as many as 
120 reels. The foreman wants a schedule that uses no more than two different 
combinations (ordinarily, a linear programming solution would select three 
different combinations, since there arc three linear constraints). Show how to 
formulate an optimization model satisfying these restrictions. 

55 The Mini Manufacturing Company produces four kinds of items. Let Djet be the 
demand requirement in Period t for Item k, where k = 1, 2, 3, 4, and / = 1, 2, . . ., T; 
assume each is integer-valued. Let x^i denote the production level of Item k in 
Period t. Assume that all demand must be met each period, but it is possible to 
produce for inventory in one period to meet demand requirements in subsequent 
periods. Let CkiXj^i be the production cost associated with x^t, and hj^ be the inventory 
holding cost per unit of Item k held at the end of a period. (Assume inventory at the 
beginning of Period 1 is 0.) So far, the situation for each product resembles the produc¬ 
tion and inventory model in Sec. 8.3. There is an additional constraint, however, 
that at most only one type of item can be produced during each period. 

(a) Formulate an optimization model taking account of this production limitation. 
♦(b) Formulate an appropriate dynamic programming recursion. 

♦56 Show how the variable cost structure in the inventory and production scheduling 
model of exercise 44, Chap. 8, can be characterized within an integer programming 
version of the problem. 

♦57 Show how the variable cost structure in the inventory model of exercise 9, Chap. 9, 
can be characterized within an integer programming version of the problem. 

58 Consider an integer programming model containing only zero-one variables. 

(a) Suppose the model contains (in the objective function or a constraint) the term 

Show how a zero-one variable^ can be substituted for this term, provided 
that you add the constraints ^ 2 and —x^ — Xg — -f 3^ ^ 0. 

(b) How does the approach in part (a) generalize to handling the product term 
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* * * (^n)^% where each pj ^ 0. [Be sure your formulation takes advan¬ 
tage of any = 0, as occurred in part (a).] 

(c) Verify that xj, where /> > 0, can be replaced by Xj, 

*(d) Apply the observations in the above parts to exercise 42. 

59 Qiiadratic Assignment Problem, The Ireson Engineering Company serves its industrial 
clients by advising on the layout of plants and office buildings. A typical problem of 
this sort involves a new plant that has been zoned into n different work areas. Each 
area is to be the location of one of n different production facilities. If Facility 1 is 
assigned, for example, to Work Area 3, and Facility 2 is assigned, for example, to 
Work Area 5, the arrangement gives rise to the cost ri 3 ; 25 . The cost is due to the 
amount of “traffic” in materials and personnel flowing between Facilities 1 and 2 
and the distance between Work Areas 3 and 5. (Since there arc rr pairings of facilities 
to work areas, there are altogether cost coefficients Formulate a cost minimiza¬ 
tion model that assigns facilities to locations. {Hint: the objective function is quadratic.) 

60 Another integer programming formulation of the traveling salesman problem is based 
on using zero-one variables Letting Xijk — 1 implies that k is the position in the 
tour at which the salesman visits City i, and from there goes to City j. In every tour, 
let k = 1 for City 1. Write in detail all the restrictions and the objective function 
appropriate to this formulation for a four-city problem. {Hint: the salesman must 
depart from every city. If he visits City i at Position k and goes to City j, then he visits 
City j at Position k + 1.) sure to explain why the formulation rules out subtours. 

*61 Critical Path Scheduling, Consider the Swift Building Company case described in Sec. 6.6 
and displayed in Fig. 6.13. Suppose each job requires a specified amount of a scarce 
resource, such as labor time, per period the job is in process. For example, given 
that it takes Li units of time to complete Job A, assume that units of labor are required 
each week that Job A is being completed. Assume that Rp is the ‘maximum amount of 
labor available in Week p for all jobs in process. 

(a) Formulate the problem as a mathematical optimization model. Use the data ip 
Fig. 6.13. The objective is still to complete the entire project as early as possible. 
[Hint: introduce zero-one variables that indicate the week ^ job starts. Express 
the time a job starts in terms of these variables, and rewrite the precedence order¬ 
ing constraints (2) through (6) in Sec. 6.6 accordingly.] Explain how to modify 
your formulation if the labor time needed for a job differed from one week to the 
next while it was being completed. 

(b) Show how to modify your formulation in part (a) if there also is a second resource 
restriction. For example, assume that units of this other resource are required 
each week that Job A is being completed, and that Sp is the maximum amount of 
this resource available in Week p for all jobs in process. 

♦62 Critical Path Scheduling, Consider the Swift Building Company case described in Sec. 6.’6 
and displayed in Fig. 6.13. Suppose there are several alternative ways to construct 
the entire project. Specifically, instead of Job D, the company can use either Job G 
or Job H. If it uses Job G, the immediate predecessors for Job G arc A and G, for 
Job D is B, and for Job E is B. The other data in Fig. 6.13 arc unchanged. If the 
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company uses Job H, there is no immediate predecessor for Job H, the immediate 
predecessor for Job E is B, and for Job F is C, E., and H. The other data in Fig. 6.13 
are unchanged. Revise the formulation (1) through (6) to account for the choice 
possibility between Jobs D, G, and H. The objective is still to complete the entire 
project as early as possible. 


MIND-EXPANDING EXERCISES 

63 Assembly-Line Balancing, An assembly line consists of locations, called work stations, 
and one or more jobs may be performed at each station in the process of completing 
the manufacture of an item. Assume there are six jobs in all, that a unit must go 
through the assembly line from Station 1, to Station 2, ., ., to the final station. 

Each Job i requires /( minutes to perform. The 
combined time of all jobs assigned to each work 
station is not permitted to exceed T, which is called 
the cycle time of the line; in this example, let 
T* = 10. Certain jobs must be performed before 
others; the precedence relations and times are 
shown in Fig. 13.9. 

The optimization problem is to open as few work 
stations as possible, consistent with the cycle time 
restriction and the precedence relations. Formulate 
the problem as a mathematical optimization 
model. {Hint: let be a zero-one variable which 
equals 1 if Job i is performed at Work Station j. 
Each job must be performed at a station, subject 
to the precedence restrictions. The total time for all jobs assigned to a station must 
not exceed T minutes.) 

64 Machine {Job-Shop) Scheduling Problem, Suppose that three items are to be sequenced 
through n machines. Each item must be processed first on Machine 1, then on Machine 
2,..., and finally on Machine n. The sequence of jobs may be different for each 
machine. Let be the time required to perform the work on Item i by Machine j; 
assume each is an integer. The objective is to minimize the total work spap to 
complete all the items. 

(a) Formulate the problem as an integer programming model. Write the model in 
detail for « = 4. {Hint: One approach is to let Xij represent the time at which 
Item i begins processing by Machine j. In this formulation, you must make sure 
that no two items occupy the same machine at the same time, and that an item 
does not start processing on Machine j + 1 until the work is completed on 
Machine j. Another approach is to let a zero-one variable Xijk correspond to 
assigning Item i to the A:th position of the sequence on Machine j.) 

(b) Indicate the simplifications, if any, that are possible when you stipulate that the 
items are to be processed in the same sequence on each machine. (This can be 
shown to be an optimal policy if there are two or three machines; in the case of 
two machines, there is a simple algorithm for finding an optimal solution.) 
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Figure 13.9 
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65 Machine {Job-Shop) Scheduling Problem, In exercise 64, stipulate that the items are to be 
processed in the same sequence on each of the n machines. Devise a technique of 
solution analogous to the method of partial tours in Sec. 13.6. {Hint: given a partial 
assignment of items to Position 1, 2,., ., k of the sequence, a lower bound on the 
total time span to complete all the jobs can be found in terms of when the A:th job is 
completed on each machine, and of the sum of the processing times on each machine 
required by the jobs remaining to be scheduled.) 

66 Knapsack Problem, Consider exercise 36, and devise an algorithm for solving such 
problems analogous to the method of designated routes in Sec. 13.6. {Hint: given a set of 
variables at specified levels, an upper bound on the value of the objective function for 
an optimal choice of the remaining variables can be found from solving an associated 
linear programming problem.) 

67 Fixed Charge Problem, Consider the model described in exercise 6. Devise an algorithm 
for solving such problems analogous to the method of designated routes in Sec. 13.6. 

68 Qtiadratic Assignment Problem, Review the case of the Ireson Engineering Company, 
described in exercise 59. Consider each possible pair of facilities: (Facility 1, Facility 2) 
(Facility 1, Facility 3), . . ., (Facility 1, Facility n), (Facility 2, Facility 3), (Facility 2, 
Facility 4), . . . , (Facility n — 1, Facility n). Similarly, consider each possible pair 
of work areas: (Area 1, Area 2), (Area 1, Area 3),.. ., (Area n — 1, Area n). Verify 
that altogether there are n{n — l)/2 pairs of facilities, and the same number of pairs 
of work areas. 

Imagine a table having n{n — l)/2 rows, one for each pair of facilities, and the 
same number of columns, one for each pair of work areas. Assume that the cost 
structure is such that the same cost arises if, for example. Facility 1 is assigned to Work 
Area 3 and Facility 2 is assigned to Work Area 5, or if Facility 1 is assigned to Work 
Area 5 and Facility 2 is assigned to Work Area 3. In other words, only the information 
that the pair (Facility 1, Facility 2) is assigned to the pair (Area 3, Area 5) is needed 
to determine the cost. Comment on why such a cost assumption might be reasonable. 

(a) Suppose that the cost associated with each pairing of facilities coupled with each 
pairing of work areas is inserted in this square table. Comment on whether solving 
an assignment problem using these costs yields an answer to the facility location 
problem. {Suggestion: let w = 4 and construct the square table, showing the row 
and column headings; then examine whether an assignment solution is always 
feasible for the location selection problem.) 

(b) Based on your observations in part (a), suggest ways to modify the several 
approaches used for solving the traveling salesman problem in Sec. 13.6 so as to 
find a solution to the location problem. 

69 Primal Cutting-Plane Algorithm, The cutting-plane approach in Sec. 13.4 can be charac¬ 
terized as a “dual” algorithm in that the dual constraints are satisfied throughout 
(after the simplex method has found an optimal solution ignoring the integer con 
straints). The successive solutions are not primal feasible until the final iteration, since 
only then are the integer constraints satisfied. A primal algorithm satisfies throughout 
the integer stipulations as well as the linear restrictions. The details of a primal 
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algorithm that is guaranteed to converge go beyond the scope of this text. But a fev/ 
of the important ideas can be illustrated by an example. 

Assume you have an initial nonoptimal primal feasible integer-valued solution. 
Apply the ordinary Criterion I of the simplex method to select a variable to enter the 
next solution. Apply Criterion II of the simplex method to determine the maximum 
level at which the selected variable can enter so as to keep all the other basic variables 
nonnegative. Denote this level by r > 0. If the ordinary pivot element docs not equal 
1 , you then construct a cut to be used as the row in which the pivoting will occur. 
The cut must have the following two properties: (i) the pivot clement for the selected 
variable equals 1 ; hence, all the coefficients and the basic variables remairt integer- 
valued after the pivot operation, (ii) The right-hand-side constant is no larger than r, 
so that the basic variables remain nonncgalivc. Here is a way to find such a cut. 

Repeat the ratio arithmetic done in Criterion //, now taking the integer part of each 
ratio. Select any row for which the integer part of the ratio is less than or equal to r. 
(If there arc several such rows, select one having the largest integer part.) As a side 
calculation, divide the selected row by the coefficient of the entering variable, and then 
construct the cut given by (12) in Sec. 13.4, Augment the equations with this cut, 
pivot, and return to Criterion /. 

(Ifr < 1, then necessarily the right-hand-side constant for the cut isO. Therefore, 
the objective function does not improve at such an iteration. To guarantee conver¬ 
gence, the method has to be amended in such a way as to rule out the possibility of 
an endless sequence of cuts having 0 right-hand sides. We do not present such a 
procedure here.) 

To sec how the approach works, consider the following modified version of the 
Knox Mix Company Example (Secs. 2.2 and 4.4) 

Xq “ 4xi — 9 x 3 = 6 Row 0 

Ixj 4 - 1 x 3 + ^5 =12 Row 1 

3xi -j- 10 x 3 -f X 7 = 85 Row 2 . 

(Verify that X 4 and Xq have been dropped, Xg has been set equal to 3 and then sup¬ 
pressed, and the second constraint has been eliminated in the original version treated 
in Sec. 4.4.) 

On the first iteration, select X 3 to enter. Perform Criterion II and find that r = (xJ) 
= 8.5. Repeat the ratio arithmetic using integer parts to obtain the values [^] and 
[fJ]* Since only [f§] ^ r, you must write a cut on the equation in Row 2. As a side 
calculation, divide Row 2 by 10 obtaining 

+ xj + JfXr = If, 

which results in the cut 

^3 + ^8 = S Row 3 , 

where Xg is the slack variable for the cut. Append Row 3 to the problem and pivot on 

X3. 

(a) Complete the algorithm until you obtain an optimal solution. [Hint: the approach 
requires three more cuts. The value of the objective function increases from 0 to 
72 (after pivoting on Row 3), then to 76, to 83, and remains 83 at the final 
iteration. You will introduce slack variables xq and Xjo, and will eventually have 
to pivot on Xs and to obtain convergence. When you pivot on Xg, use Row 1 
to generate the cut. Explain why.] 
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(b) Apply ihc method to the original version of the Knox Mix Problem. 


70 Cron/i Theoretic Algorithm, A powerful technique for solving integer programming stems 
from exploiting so-called group theoretic properties of the problem. A treatment of 
all the details of the approach goes beyond the intended level of this text, but the gist 
of the idea can be explained by means of an example. Consider the case of the Knox 
.Mix Company Problem, described in Sec. 2.2 and solved by the simplex method in 
Sec. 4.4. Examine the equations (7) at the final iteration. Let an algebraic representa¬ 
tion of an equation in (7) be ^ hen we can write 

Ai = [/Itl \-fu 

where 0 :<y, < 1, and B = [5] -f y, where 0 <y< 1. In other words, y is the 
fractional part of /!,, andyis the fractional part of i?. The equation, then, is 


E -h S/i-Vi - m -h/. 

1-1 1 1 

Explain why it is necessary that —/be integer-valued for every feasible 

integer-valued solution. Check that if .Vj is a basic variable, then / = 0. Hence, 
explain why it is necessai^ that the Xi must satisfy 25 i/*^i =/(mod 1), that is, the 
left- and right-hand sides must differ by an integer for feasibility. 

X'erify from (7) that, as a consequence, it is necessary to satisfy 


7 V2 + 7'V4 -f 7-^5 + 7-^7 = \ (mod 1) 

7*^2 "h 7^4 4- 7^5 + 7X7 = Y (mod I) 

7'^2 4" 7-^4 4" 7->f5 4" 7*^7 = 7 (mod 1) 

Check that after multiplying through by 7, an equivalent statement is 
Sato 4” 2^4 -f“ 3.V5 -f- 6.V7 = 1 (mod 7) 

(M) lAr2 4- 6jr4 -f- 2x5 4 a :7 = 3 (mod 7) 

2^2 4- 5^:4 -h 4^5 -f 1x7 = 6 (mod 7). 


Verify that a feasible solution to these equations (M), when substituted back into 
(7), results in integer values for the basic variables, but such integer values may not 
be nonnegative. 

You also know from (7) that the objective-function value will equal — 7 X 2 
— ~-X 4 — -i^Xj — 7 X 7 . Therefore, you seek feasible values for these nonbasic variables 
that minimize 8 x 2 4- lljf 4 4- ISxj -f 8 x 7 (where we have multiplied through by 7 
to eliminate fractions). 

Verify that given the modulus arithmetic in (M), X 2 = 2 has the same effect as 
X5 = 1 ; Xj = 2 has the same effect as X7 = 1 ; X5 = X7 = 1 has the same effect as 
X 4 = 1 . The underlying principle is that there is a group of seven columns, each 
column containing an entry for every row, that can be associated with any set of 
integer values for the Xj in (M). In this example, you can generate these columns by 

/5\ 


picking any column of coefficients, say, that for X 2 , which is I 1 I in (M), and calculate 


5 x 2 ' 


the columns I Ixj I, for X 2 = 0 , 1 ,..,, 7, using modulus 7 arithmetic. Verify this 
\2xJ 


process gives 
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® (D (D @ ® (6) ®, 


where the circled numbers underneath the columns will serve to identify them. 
Note that @ is the right-hand side of (M). 

You can identify minimizing the above stated objective function subject to (M) 
in terms of a shortest-route problem. Designate the nodes ®, ®, .. ., correspond¬ 
ing to the seven columns of numbers. Associate all atj = 0 on the left-hand side of (M) 
with Node Q). Associate each = 1 with an arc from Node (T) to another node. 
For example, Xg = 1 yields an arc from Node 0 to Node ® with a cost 3; x.i = 1 
yields an arc from Node ® to Node @ with a cost 11 ; = 1 yields an arc from 

Node 0 to Node 0 with a cost 13. What docs Xj := 1 yield? 

By the same token, draw four arcs leading out of each other node, except Node 
which is the right-hand side of (M). (Also, you do not need to draw arcs leading back 
to Node 0.) For example, there is an arc with cost 3 from Node® to Node® 
corresponding to a unit of ^ 2 ; there is an arc with cost 13 from Node@to Node® 
corresponding to a unit of x^. The nodes and a few arcs arc shown in Fig. 13.10. 
Gornplemhe network diagram. The problem is to find a shortest route from NodeCl) 
to Node 0. Solve this optimization problem in the completed network. 



A route such as 0 to 0 to 0 to 0 corresponds to letting Xj = 2 and = 1 . 
Verify that the shortest route in the completed network corresponds to letting X 2 = 3. 
What are the values for the basic x* in (7) when you let Ag = 3? Explain why this is 
the optimal solution to the original problem. 

(a) Perform the algorithm on the example (20), (21), and (22) in Sec. 13.4. 

*(b) The problem in part (a) is an example of a knapsack model, described in exercises 
22 and 47 of Chap. 10. Use your analysis in part (a) to characterize an optimal 
solution for every possible value of the right-hand-side constant of the constraint, 
(c) Suppose you add to the Kjiox Mix Company Problem a variable X3, having 
cocfiicients Aj, A 2 , A 3 in Rows 1, 2, 3, respectively, where each aj is integer-valued. 
Show that the column of fracticmal parts, associated with xg in (7), would, after 
multiplication by 7, be one of the columns 0, . . ., 

*(d) Establish the connection between the algorithm and the network representation 
given above and the algorithm and network implied by the recursion (ii) in the 
special material at the end of Sec. 10.2, 
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71 A solution to the traveling salesman problem can be characterized in terms of a 
dynamic programming recursion. Observe that at any point of the salesman’s tour, 
when he is about to choose the next city, the associated state variable can be viewed 
as the city where he is currently located, and the set of cities he has visited previously 
(but not the order in which he has visited them). Develop a dynamic programming 
recursion on the basis of this observation. Comment on whether the approach is 
practical for obtaining numerical solutions. 

*72 The advanced material at the end of Sec. 13.2 suggests a way to treat a piecewise- 
linear function r(x) so that it may be incorporated into an integer programming 
problem. The formulation involves using (24), (25), and (27) through (30). The 
method is a variant of an approach discussed in Sec. 14.7. Another formulation 
can be based on an approach in the special material of Sec. 14.7, namely, (i) through 
(vii). Show how that approach can be formulated as an integer programming model 
so as to guarantee the property of adjacent weights. 
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CHAPTER 


Optimization with a 
Nonlinear Objective Function ^ 


14.1 INTRODUCTION TO NONLINEAR PROGRAMMING 


Recall that any linear optimization model can be viewed as 


- (1) 


n 

maximize ^ CjXj 


subject to 

(2) 2 for i = 2,. . ., m 

( 3 ) xj>0 forj = 1, 2,.. ., 

You have already studied several nonlinear variations of this model, such as the 
dynamic programming examples in Chaps. 8, 9, and 10, as well as the com¬ 
binatorial models in Chap. 13, in which several of the Xj were restricted to having 
integer values. The term nonlinear programming usually refers to problems 
such as 

(4) maximize r(^i, ato, . .., ^n) 
subject to 

(5) ai{xi, ATa,.. ., x„) < 0 for t = 1, 2,..,, m, 

where both c{xu . . . , ^n) and .. ., Af„) are real-valued, nonlinear, but well- 
behaved functions of n real variables. Actually, (4) and (5) can be viewed as a 
canonical form for a nonlinear programming problem, since you can employ 
the techniques of Chap. 3 to treat the optimization as “maximize” and the con- 


1 * This chapter requires a knowledge of differential calculus. 
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strainls as inequalities with the direction given in (5). A nonnegativity restriction 
on a variable can be contained in (5), expressed as —Xj < 0. Customarily, 
integer-valued stipulations on the x^ are not encompassed in what are termed 
“nonlinear programming problems.” 

► A qualification is in order here, since you can always include in (5) a set of nonlinear 
constraints that will “trick'* a model into allowing only integer-valued variables for 
feasible solutions. To illustrate, suppose is restricted to be either 0 or 1. Then this 
discrete-valued stipulation can be ensured by imposing the constraints —Xj ^ 0, 
Xj — ^ < 0, and .Vj(l — xj) < 0. Such tricks arc not treated in this and the next chapter. 
The end of Sec. 13.6 briefly treats the solution of nonlinear programming problems 
containing integer-valued variables. 

Need for nonlinear programming techniques. Even a quick 
review of the simple, yet illustrative, examples of linear programming-models con¬ 
tained in Chap. 2 should suffice to raise your doubts that many actual situations 
really fit the mold of strict linearity. You may easily have gotten the impression 
that a linear approach simply ignores such phenomena as: efficiencies or ineffi¬ 
ciencies of large-scale operations in a product-mix model; nonadditive volume 
relationships when chemicals arc combined; the eflfect of sales quantity on sales 
price, and therefore on sales revenue. If you did get such an impression, the 
discussion below will clear up this misunderstanding. 

There is plenty of. evidence that linear programming models have been ap¬ 
plied very successfully in nonlinear environments. Since any model, inevitably, 
only approximates reality, the important question is ^"when does a linearized ver¬ 
sion provide an adequate representation of a nonlinear phenomenon?” What 
you must learn, then, is to distinguish between circumstances that are and are 
not amenable to a straightforward application of linear programming. 

In a particular application, if you have some knowledge about the range of 
values for an optimal solution, then you can usually restrict the solution to this 
range and employ a trustworthy linear approximation. It is only when the model 
has a wide region of feasible solutions, and you have very little idea about an 
optimal solution, that you may be unable to find a good enough linear approxima¬ 
tion. These statements are tautological to some extent, because they relate to how 
you formulate the model in the first place, in terms of specifying the objective 
function and constraints. But for most applications, you will find that the issue of 
whether or not a linearized version suffices is more readily resolved than you might 
guess from the foregoing, admittedly vague, comments. The following two il¬ 
lustrations suggest when a linearized version is liable to be inadequate in a real 
situation. 

Consider a company in the initial stages of developing a model for corporate¬ 
wide long-range planning. A management scientist usually knows that, even if he 
is an experienced businessman, it is difficult to give accurate, detailed forecasts 
of optimal production levels and market penetrations for his company for the next 
10 or more years. Indeed, a major reason for an executive employing such a model 
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is that he realizes how easily his intuition can be amiss when trying to fathom the 
influence of economic factors projected beyond the present. If production costs and 
revenues vary nonlinearly with the scale of operations, linearized guesses may not 
be good enough to give valid answers. (After the model has been used repeatedly, 
however, a linear version may prove workable, provided the model parameters do 
not change drastically over time.) 

The second example is a firm that schedules production by employing a 
dynamic, multi-item model that reflects the presence of significant machine setup 
times, limited machine-group capacities, and fluctuating demand requirements. 
Typically, in such a case, the essence of the optimization problem is to modulate 
the various nonlinear effects of scheduling decisions. Unless the production 
planner can make a very good guess at an optimal solution, any simple lineariza¬ 
tion of the problem is likely to do violence to the fundamental nature of the 
optimization. 

Even though a specific application may require you to build a nonlinear models 
you may be able to use a solution technique that is only a slight variation of a linear 
model. If the nonlinearities are severe, however, because of their shape charac¬ 
teristics or their pervasiveness in the model description, you will have to resort 
to solution techniques that are truly more sophisticated than the simplex algorithm. 
This point will become clear below and in the next chapter. 

Managerial significance of nonlinear models. Today, the pre¬ 
ponderance of mathematical programming applications to real managerial deci¬ 
sion-making situations employ linear approximations, rather than explicit 
nonlinear formulations. But the importance of nonlinear programming applica¬ 
tions is growing. This is due to the rapidly increasing sophistication of managers 
and operations researchers in implementing decision-oriented mathematical 
models, as well as to the growing availability of computer routines capable of 
solving large-scale nonlinear problems. 

Most nonlinearities encompassed in a programming model fall into two 
categories: 

(i) Empirically observed relationships, such as nonproportional variations in 
costs, process yields, and quality characteristics. 

(ii) Structurally derived relationships, which encompass postulated physical 
phenomena, mathematically deduced behavior, and managerially deter¬ 
mined rules. 

Obviously, no hard-and-fast line separates these two categories, since with enough 
data you may be able to derive a structural relationship underlying an em¬ 
pirically observed phenomenon. 

An example of (i) occurs when over a period of years factory output does not 
rise proportionately with an increase in man-hours, whereas process scrap rises 
more than proportionately. An illustration of (ii) arises in a firm having to pay an 
electricity bill calculated from a nonlinear formula involving both the firm’s daily 
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average and its peak-load requirements. In this instance, the firm knows of the 
nonlinear cost structure from a rate schedule negotiated with the utility company. 

Other situations in which nonlinearities are built into the programming models 
are: 

1. Gasoline Blending. A model for blending gasoline from so-called refinery raw 
stocks usually contains nonlinear constraints relating to each blend’s octane rat¬ 
ing, since this qiiality characteristic varies nonlinearly with the amount of 
tetraethyl lead (TEL) added to the mix. 

2. Process Control. In a model for a steel-processing plant, a variable represent¬ 
ing the temperature of a blast furnace can be described by a nonlinear function of 
variables indicating the amount and duration of heat energ)^ applied. Each of 
these variables, in turn, is contained in other constraints as well as in the obj’ective 
function. 

3. Sales Revenue. A company may face a responsive price-volume relationship 
for its products: the lower a product’s price, the greater the sales quantity, even in 
the face of resultant price decreases by competitors. Therefore, sales revenue does 
not vary proportionately with price, and this phenomenon is reflected in the 
objective function of a product-mix model by a nonlinear component. To illustrate, 
let x{p) represent the sales quantity as a function of the price p; then p*x{p) is the 
associated sales revenue. Suppose the sales quantity function is linear, x{p) = 
ap + over the range of interest for p. Then the sales revenue component in the 
objective function is quadratic {ap"^ + bp) in the decision variable p. 

4. Multi-Item Order Quantities. The inventory models in Chaps. 8 and 9 dealt 
with single-item situations. But frequently a wholesaler replenishes his inventory 
by ordering several items at one time from a single supplier, thereby taking 
advantage of economies in shipping costs, paper work, and quantity discounts 
offered by a supplier. This situation can be treated by a large-scale programming 
model in which the associated costs of replenishment appear as nonlinear functions 
of the several order quantities. 

5. Safety-Stock Inventory Levels. In most mathematical programming models used 
for corporate planning, the time periods are rarely less than three months and 
often are a year or more. Such a multiperiod model for a manufacturing company 
usually includes provision for inventory levels that are to act as safety-stocks to 
accommodate weekly fluctuations in sales. One approach employed in these 
models is to let the safety-stock level for an item be a function of both its forecasted 
sales quantity and the fraction of capacity utilization implied by this forecast. For 
example, c may be the weekly capacity available to produce an item, s the item’s 
forecasted average weekly sales, and n*s the item’s safety-stock level, where n 
represents the number of weeks’ sales depending on the capacity utilization factor 
sjc. To illustrate, suppose management has established the formula for n to be 
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[n = m + /•{s/c)]; then the resultant safety-stock level is a quadratic function 
[ms + {flc)s^] of the item’s forecasted average weekly sales. This level may appear 
in several of the planning model’s constraints as well as in the objective function. 

6. Distribution of Effort, In Chap. 10 you saw several illustrations of the so- 
called Distribution of Effort Problem. The decision variables were assumed to be 
integer-valued, and the constraints were limited to one or two, in order to make the 
dynamic programming approach computationally feasible. If you eliminate the 
stipulation of integer-valued variables, a nonlinear model can also be employed 
to solve a Distribution of Effort Problem. Then you can apply nonlinear pro¬ 
gramming techniques, which are computationally practical even when con¬ 
siderably more than two constraints are imposed. 

7. Probabilistic Elements, Nonlinearities frequently arise when a programming 
model’s coefficients are viewed as random variables. The two illustrations below 
are described in brief. 

Consider, first, an ordinary transportation model like the one in Secs. 2.7 and 
6.2. Suppose, however, that the demand Dj at each Destination j is specified by 
a probability distribution, and that the actual values of all Z)j are not known until 
after the shipments are made. If a total amount of is shipped to Destination j, 
the amount remaining after demand occurs is {Xj — Z)j), where a negative value 
indicates the amount of unfilled demand. Suppose the cost Cj{xi — Dj) is assigned 
to the event {xj — D^), Explain why this cost function would be nonlinear. The 
model’s objective function, then, is to minimize the sum of transportation costs 
plus the sum of expected values for cfxj — Dj), Accordingly, the objective function 
displays a nonlinear dependence on the decision variables Xj, This model is dis¬ 
cussed in detail in Sec. 16.6. 

The preceding example can be formulated as a transportation model in which 
several right-hand-side coefficients of the corresponding constraints are random. 
Now consider any linear programming model in which the bi coefficients in the 
linear inequality constraints (2) above are random. To illustrate, suppose the model 
has two constraints f = 1, 2 in (2), and that the coefficients bi are independently 
distributed, where G<(6) represents the probability that the random variable bi is 
at least as large as b. Suppose you want to select the x^ so that the joint probability 
of every constraint being satisfied is at least yS: 

(6) p[|: auXi ^ 6.] •?[!: ayx, < 6,] ^ /S (0 < /8 < 1). 

Then, as you will see in Sec. 16.5, the programming constraints equivalent to (6) 
can be written as 

(7) ~-y^ + aijXj = 0 for i = 1, 2 

(8) ' 

where the product in (8) leads to a nonlinear restriction on^i and ^^ 2 * 
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*8. Portfolio Selection. Financial analysts in banks and insurance companies 
have devoted considerable attention to mathematical models that assist in manag¬ 
ing portfolios of common stocks, bonds, and other securities. Inherent in such 
models is an assessment of a proposed portfolio’s expected gain and the associated 
risk, or probable variation, in the actual gain. A full discussion of how to build 
these models goes beyond the main purpose of this chapter, but the ensuing simple 
.version suggests how nonlinear programming formulations have proved useful. 

Let X) represent the proportion of available funds to be allocated to Security j. 
Suppose that at the end of the planning horizon, is the actual (random) gain 
per dollar invested in Security j, and OL^ the associated expected gain. Assume further 
that you stipulate b to be the lowest acceptable expected gain per dollar invested 
- in the entire portfolio. Then the constraints of the model are 

(9) 2 ^ 2 ind ATj > 0 forj = 1, 2, . . . , « 


( 10 ) tccjx,>b, 


where the left-hand side of (10) represents the expected gain per dollar invested, 
since the expected value of a sum equals the sum of expected values. 

The consideration of risk is introduced by means of the objective function. 
Suppose the x^ are to be selected so as to minimize the variance of the actual gain, 
subject to (9) and (10). Then the objective function involves a quadratic form: 


(11) minimize 2 2 

t=i j=i 

where 

(12) <7,J = £[a, — «(] • [Oj — aj] 

symbolizes the covariance of gain between Securities i and j. 

In more sophisticated versions of the model, there can be additional constraints 
on the composition of the portfolio, several time periods, and other measures of 
risk. Frequently, the computations for these models include a thorough sensitivity 
analysis showing the tradeoffs between expected gain and risk. Needless to say, 
these models must be applied cautiously since the values of aj and ctu are based 
largely on historical data, and hence may be subject to error when applied to 
future events. 


Summary. As you have seen, many different circumstances lead to non¬ 
linear formulations of the constraints or objective functions in mathematical 
programming models. Naturally, you pay a price for introducing nonlinearities 
into a programming model. If the nonlinearities are few, or minor, the additional 
computational burden may be insignificant. Then your only inconvenience is to 
find a computer routine capable of solving the nonlinear model. Otherwise, the 
added complexity may substantially increase the amount of computations neces¬ 
sary to find a solution, and you may have to curtail the number of constraints and 
variables comprising the model. 


. 14.2 • 


Direction and Focus 519 


In any particular situation, you must gain a proper perspective on an ap¬ 
propriate size and complexity for the model by assessing its intended impact on 
decisions. For instance, you may find that you need to employ only a moderate- 
size nonlinear programming model to obtain useful information about the region 
containing an optimal solution. Then you can apply a larger and more detailed 
linear model, making approximations based on the previous answers. This two- 
step approach is especially helpful if the context of the application is strategic 
planning rather than day-to-day operating decisions. 

14.2 DIRECTION AND FOCUS 

As elsewhere in the book, this and the next chapter concentrate on the subject’s 
fundamental concepts. Here we set the stage for the technical discussion to follow. 

Selection of techniques. The underlying theory of nonlinear program¬ 
ming is still evolving, and computational experience with specific algorithms is 
limited. Hence, it would be rash to single out two or three approaches, call them 
eminently successful, and claim they are as important for nonlinear programming 
as the simplex method is for linear programming. Practice has indicated that, 
although it may be possible to discard some suggested techniques as inferior, there 
most likely will never be only two or three methods that are superior for all 
nonlinear programming problems. So you should consider the algorithms pre¬ 
sented in these chapters to be merely a limited selection of solution techniques, 

A systematic comparison between methods is complicated by the fact that a 
nonlinear method can be very effective for one type of problem and yet fail miser¬ 
ably for another. Furthermore, the success of an algorithm in a particular problem 
can depend significantly on the model formulation itself. For example, transform¬ 
ing the measurement scales of the variables can dramatically affect an algorithm’s 
rate of convergence! (This phenomenon is sometimes called form sensitivity.) And 
lastly, a relative evaluation of algorithms must compare the computational burden 
of methods requiring only a few iterations, but each involving many calculations, 
with those taking many iterations, but each involving only a few calculations. 
Such comparisons also hinge, in part, on the characteristics of the specific computer 
to be used. For these reasons, this text does not contain relative evaluations of the 
techniques. 

In sharp contrast with linear and dynamic programming techniques, most of 
the nonlinear programming algorithms that are easy to motivate and explain are 
so computationally inefficient as to be impractical. And, unfortunately, the 
algorithms that have proved most powerful in practice are too intricate to include 
in an introductory text. Therefore, in trying to give a presentation of nonlinear 
programming techniques that is honest and yet readable by beginners, we have 
selected methods that can be explained in elementary terms, and, if appropriately 
modified for an electronic computer, can be made efficient. But we leave such 
modifications, which are indeed essential for successful application to actual 
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problems, to texts that specialize in numerical analysis. Also, we only emphasize 
techniques that are effective for solving medium- to large-scale problems, and 
give almost no attention to those procedures that might work well if the model 
contained merely a few variables and constraints. 

Crux of the computational problems. We suggest that you take 
a few minutes to review the discussion of the algorithmic method given in Sec. 4.3 
(pp. 99-102). Four interrelated characteristics were highlighted as being descrip¬ 
tive of an algorithm: 

• Completeness 

• Domain of applicability 

• Convergence properties 

• Computational requirements. 

Most of these characteristics are considerably harder to ascertain for nonlinear 
programming techniques than with linear programming algorithms. Such con¬ 
siderations preoccupy mathematicians investigating the theory of nonlinear 
programming. But apart from the theoretical questions, there are computational 
problems in applying nonlinear techniques to a specific situation. 

For example, suppose you employ an algorithm that does completely specify 
how to proceed, step by step. You may find that there are mathematical condi¬ 
tions—either necessary, or sufficient, or both—to be satisfied by a model’s objec¬ 
tive function and constraints in order to guarantee that the algorithm converges. 
But in a real application, these conditions may be too complex to verify. You may 
actually have to try the algorithm to determine whether it will solve the problem. 

Further, you may know that the technique converges for an easy-to-recognize 
class of problems, yet you may not have a very good prediction of the number of 
iterations that will be required to obtain a usable answer. 

What is more, you may sometimes find it advantageous to apply a method that 
is indeed theoretically shaky—the algorithm may not always converge to a correct 
answer. But in practice, this method may work quite well, often converging to a 
truly optimal solution in relatively few iterations. 

►Rcsearch'in the development of nonlinear programming techniques is advancing on 
two fronts. At the theoretical level, research is concentrated on establishing weak, and 
hopefully verifiable, conditions on the objective function and constraints that arc 
sufficient to guarantee that an algorithm converges to the optimal value of the objective 
function, regardless of the starting point. Research at the computational level is focused 
on how well an algorithm solves actual problems. The nub of the difficulties at both 
of these levels is that most nonlinear programming methods do not always converge in 
a finite number of iterations (unless the starting point happens to be optimal). A tech¬ 
nique may ensure that the objective function strictly improves from one iteration to 
the next. But even if the method is known to converge, in an actual application the trial 
solutions may oscillate wildly as the objective function improves. Moreover, unless the 
technique has been shown to be theoretically sound, the objective function may converge 
to a value that is not optimal. (This fact is demonstrated in Sec. 14.4 below.) M 
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Organization of topics. The next few sections lay the necessary 
groundwork by explaining how to find an optimal value for a function of a single 
variable, and, afterwards, for a function of several unconstrained variables. As 
you will discover, most of the fundamental notions for obtaining an unconstrained 
optimal solution need only to be suitably, embellished in order to handle the 
imposition of constraints. 

The rest of this chapter deals mainly with techniques for solving optimization 
models having linear constraints and special, but important, types of nonlinear 
objective functions. Several algorithms for more general nonlinear programming 
models are presented in Chap. 15. It is noteworthy that many of the methods 
explained in both chapters for treating constrained problems turn out to employ 
the solution of related linear programming problems as part of the computations. 
Chapter 15 also provides the mathematical theory for nonlinear programming that 
corresponds to the notion of duality in linear programming. 

14.3 OPTIMIZING A NONLINEAR FUNCTION OF 
A SINGLE VARIABLE 

There are two motivating reasons for studying how to optimize a function of 
only a single variable. The first is that one-dimensional problems can illustrate 
many of the difficulties that must be handled in nonlinear optimization. There 
is no need to resort to multidimensional examples, which are sometimes less 
apparent, to explain these new problems. The second reason is that many of the 
techniques for solving several-variable nonlinear optimization problems actually 
employ single-variable optimization in one of the steps. You will have to know 
methods for one-dimensional optimization, therefore, in order to apply the ap¬ 
proaches designed for more general optimization problems. 

To begin, it is convenient to postulate “maximization” as the sense of optimiza¬ 
tion throughout the following discussion. [If the real problem is to minimize an 
objective function /(x), then you can reformulate the model so as to maximize 

-/wo 

Consider, first, the problem of maximizing a linear function r(x) = (cq + 
of a single variable x. The answer to this problem is so trivial that it was not even 
discussed in the chapters on linear programming, but we raise it here as a point 
of departure for the treatment of nonlinear optimization. If x is completely 
unconstrained, then the objective function can be made arbitrarily large for any 
Cx ^ 0. (State how.) If, instead, x is constrained to be in the interval [jj, (that 
is, fl, < X ^ flj)? tt^en X = a, is optimal if Cy < 0, and x = Cj is optimal if Cx > 0. 
Hence an end-point or extreme-point solution is always optimal. As soon as 
you drop the assumption that c{x) is linear, other situations can occur, and several 
are illustrated in Fig. 14,1. 

One possibility is that r(x) may reach its maximum for a finite value of x, even 
ifx is completely unconstrained. This happens in Case 2 in Fig. 14.1, where x = 2 
is optimal. And even if x is constrained to be in the interval aj], an extreme- 
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Figure 14.1. Illustrations of Nonlinear Functions. 
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point solution may not be optimal. To illustrate, if x must be in the interval [0, 3], 
then the optimal x in Case 2 does not occur at either extreme point of the interval, ^ 
a: = 0 or a: = 3, but inside the interval at x = 2, as before. 

In Case 3, as in Case 1, the value of c(x) grows without bound provided x is 
unconstrained. Unlike Case 1, an arbitrarily large value for c{x) can now be achiev¬ 
ed by letting x be either sufficiently large or small. Case 4 shows that it is also pos¬ 
sible that c{x) can be made arbitrarily large by values for x that are constrained, 
say, to be in the interval [0, 1]. Here there is a discontinuity in c{x) at a: = 1; 
by letting x approach 1 from below, c{x) grows without bound. 

Finally, c{x) may be bounded from above, and yet there may be no Jinite value 
for X that maximizes ^(a:). This occurs in Case 5, where x is unrestricted. Notice 
^(a:) is less than 1 for all finite at, but can be made arbitrarily close to 1 by letting 
X be sufficiently large. [The value 1 is called the supremum of ^(a:), which is 
defined as the least upper bound for ^(a:).] A second example is Case 6. How 
large can you make c{x) here? 

To summarize, it is possible that c{x) grows without bound, irrespective of 
whether x is constrained, and that c{x), although bounded from above, does not 
attain a maximum, also irrespective of whether a: is constrained. Any exhaustive 
treatment of algorithmic approaches for maximizing a nonlinear function of a 
single variable would have to take account of all these various possibilities. But 
we find it convenient for the purpose of this chapter to assume: 

(i) X lies within a finite interval /=[ai, aojj that is, fli < x < ^ 2 ? 

where — oo < < a, < oo. 

(ii) c(a:) is continuous and bounded from above for all x in the interval /. 

An important implication of these two assumptions is that ^(a:) does attain its 
maximum value for a value of x in the interval /. 

To test your understanding, explain why assumption (ii) does not permit I to 
include x = 1 in Case 4, or x > 1 in Case 5, or a: < 1 in Case 6. Also observe in 
Fig. 14.1 how the shape of c{x) and the specification of /affects the determination 
of an optimal a:. For example, in Case 3, an optimal a: is always one of the end 
points a: = ai or ^2? whereas such an extreme-point solution is optimal in Case 2 
only if a: = 2 is not contained in the interior of /. These observations suggest that 
an algorithm for optimizing ^(a:) over x in / should exploit the orientation and shape 
characteristics of c{x). 

Consider first what happens when ^(a:) is monotonic over the interval /. Then 
c(Af) always attains its maximum over a: in / at an extreme point, either a: = <Zi 
or Af = ^ 2 - The same is also true when c(a:) is a convex function for at in /, that is, 
if, for any a:, and Ar 2 in /, where Xi < a:2, and for all />, 0 ^ ^ 1, c(a:) satisfies 

(0 M^i) + (1 — py{x 2 ) > c{pxi + (1 — p)xt) Convex. 

Verify that c(a:) in Case 3 is a convex function, and so is always maximized at an 
extreme point of /. 
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^Morc generally, c(x) always attains its maximum at an extreme point of I when r(^) 
is a quasi-convez function over /, that is, if, for any X| and X2 in /, where Xj < X], 
and for all 0 ^ ^ 1, c(x) satisfies 

(i) maximum K^^i), c(x 2 )] ^c(pxi + (I — P)j^ 2 ) Quasi«convex. 

If c(x) is either monotonic or convex, then (i) holds. Two examples of quasi-convex 
functions that are neither monotonic nor convex are the following modifications ofr(x) 
in Case 3. The first is r(x) = x* — 4x -f 3 for x ^ 3, and (r(x) = 0 for x > 3. The second 
is r(x) = X* — 4x d- 3 for x 2 and r(2) = — 2. Draw these two examples, and verify 
that (i) holds. ^ 

If the shape of does not ensure that an extreme-point solution is optimal, 
then you must also search the interior of the interval / for an optimal x. To do 
this, you can divide I into a grid of equally spaced points, and test each of these 
points. If c(x) is “ill-behaved,” the best obtained value for x may be far from, an 
optimal X, because the search is like trying to find the proverbial “needle in a 
haystack.” But even if the function is well behaved—if it does not vary wildly 
within small intervals of x—^you can usually improve on such a grid procedure 
by employing a so-called adaptive search technique. Instead of prespecifying a 
sequence of values of x to be tried, you use the information from the previously 
tested X values to determine the next trial value for x. This means you must be 
able to make a valid inference about the possible values for an optimal x given the 
limited information about c{x) furnished by previous trials. This notion is explored 
next. 

Adaptive search. Consider the function r(x) shown in Fig. 14.2. The 
maximum value for r(x) is at x =y] As you might imagine, an adaptive procedure 
for maximizing r(x) is not likely to work well, except by accident, on a function 
of this sort. To illustrate, suppose the first trial value of x is less than a. Then it 
is possible that the successsive values for x would get closer and closer to a, but 
never beyond a, since c{a) is a local maximum. Suppose, instead, that the 
first trial value for x is in the interval [^, e'\. Then an adaptive method is likely 



Figure 14.2. Example with Multiple Local Optima. 
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to terminate, for there would not seem to be any possible improvement in c{x) 
by making x either a little smaller or larger. The adaptive methods to be explained 
below cannot be guaranteed to eventually narrow the search to a small interval 
around fin Fig. 14.2. But c{x) taken over the interval [0, g] is really a more general 
situation than we need to consider for the purpose at hand—namely, to provide 
adaptive search procedures for single-variable optimization that are to be used 
in multidimensional nonlinear optimization problems. Hence, we impose an 
additional postulate on the shape of c{x). 

Specifically, consider a so-called unimodal function with the following prop¬ 
erties : if is the unique optimal value of x over the interval /, then c{x) strictly 
increases for x <x and strictly decreases for x > ^. If the maximum value of 
c{x) = c over / occurs for an interval of values for x, then c{x) strictly increases 
to c, remains at the value c over the interval of optimal x, and then strictly de¬ 
creases. A formal and more precise definition is the following; a function c(x) 
is unimodal over the interval 7 if, for any Xi and X 2 in /, where Xi < X 2 and 
r(x,) < c{x) = c, c(x) satisfies 


( 2 ) 


c(x,)<c(a:2) forAT, <a:s<* 

Unimodal. 

r(xj) > ^(xs) for x < x, < Xj == 


A function is unimodal whenever it is concave, that is, if for any Xi and X2 in 7, 
where Xi < X 2 , and for all /?, 0 <p < 1, c(x) satisfies 

(3) pc(xi) + (1 —p)c{x 2 ) < c{pxi + (1 —p)Xi) Concave. 

Ifr(x) possesses a well-defined second derivative d^cjdx^y then an equivalent condi¬ 
tion for concavity is d^cjdx" :< 0 for all x. [If d^cjdx^ < 0 for all x, then c{x) is 
strictly concave.] Assume from here on that, in addition to (i) and (ii), 


(iii) c{x) is unimodal for x in the interval 7. 


Check that assumption (iii) rules out [b, as an interval 7 in Fig. 14.2, but allows 
the intervals [by e\ and [^, g]. Now you will see how adding assumption (iii) permits 
an adaptive technique for finding an interval of prespecified width that contains 
a value of x maximizing r(x). 

Suppose you know that an optimal value of x lies in the interval [i4|, ^md 
you have evaluated c{x) at two points, Xl^t ^rt> interval, where 

'^u-T < ^RT (the first point is on the “left” and the second on the “right”). If 
^ ^(^'rt)> then c{x) cannot increase for x > Xrt> and so an optimal ^ 
must lie in the interval [-^i,Xrt]. Make a sketch and explain why. If c(x£n.) < 
^(•^rt)> then r(x) cannot increase for x < Xlft, and so an optimal x must lie in the 
interval [xlft> -^ 2 ]- Explain why. In either case, the comparison of c(xlfx) and 
<r(xRT) has reduced the length of the interval in which you know an optimal value 
of X must lie. As you would imagine, there are numerous adaptive search pro¬ 
cedures you cqn devise to exploit this inferential property of a unimodal function. 

The following approach, sometimes termed the Method of Golden Sec¬ 
tions, has the advantage of being both simple and fairly efficient. (The ap- 
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proach is closely related to what is named the Fibonacci Search Method,) Define 
(4) r = .5(VT - 1) = .618033 .... 

The significance of this constant will be made clear after giving the algorithm for 
maximizing c(a:), where .v lies in the interval / = [fli, a^. 

Step 1. Let Ax = Aj, A 2 = floj “ ^2 — -^lft ~ *^1 “h r“// and 4“ 

Step 2, Compare <:(A*Lrr) with c{x^’i). Go to Step 3 or Step 4, whichever is appro¬ 
priate. 

Sup 3. If c(a:lft) ^ c(a-rt), let = a’rt and H = Xj^ ~ /!,. Stop if H is suf¬ 
ficiently small. Otherwise, let the new .Vrx be the previous AfLpt, and let the new 
^LFT Ai + r^H, Return to Step 2, 

Sup 4. If c(A,-i,rr) < c{xKt), let /I, = .v^ft and H = Stop if H is suf¬ 

ficiently small. Othenvise, let the new be the previous Xrx, and let the new 
;crt = Ai + rH, Return to Step 2, 

Given the value of r specified in (4), the algorithm has three important prop¬ 
erties. First, after each iteration, the new value for H \s r times the previous 
value for H, Hence, after n iterations, you can infer that an optimal x lies in a 
specified interval of width r”(fl 2 — <2i). A short table of r” is given in Fig. 14.3. 

Second, at each iteration, Xi^rr and Xrj are the same distance from their respec¬ 
tive interval end points; that is, atlft -- Ai = Ao — x^j. 

Third, in performing Step 2 (after the first 
iteration), only one of the points ^lft and atrx 
is really new, so there is only a single additional 
value of c{x) to compute. 

An example-of the algorithm appears in Fig. 
14.4; the value a: = 6 is the unique optimal 
solution. The initial interval is [fl|, ^ 2 ] = [0, 10]. 
Observe that by iteration 2, the interval has 
narrowed to [Ai, A 2 ] = [3.82,-7.64]. The trial 
value ;cLrr = 5.28 is 1.46 (= 5.28 — 3.82) above 
Aiy and the trial value Xrt> = 6.18, which was 
already examined at iteration 1, is 1.46 (= 7.64 
— 6.18) below A 2 - Verify that if you complete 
a fourth iteration, the new value Tor atrt be 
evaluated at Step 2 is 6.74. Since a 2 — ai = lOy 
the interval [Ai, A 2 ] at the end of the nth 
iteration has length r"10, where r'* is exhibited 
in Fig. 14.3. 

► Note that the adaptive procedure only guarantees information about an interval in 

which an optimal x must lie. Except by accident, it does not find an optimal x in a finite 

number of iterations. In fact, if the calculations maintain accuracy at a finite number 

of decimal places, then the method may never be able to determine x exactly. For example. 


n 


(.5)"*' 

1 

.6180 

.25 

2 

.3820 

.125 

3 

.2361 

.0625 

4 

.1459 

.0313 

5 

.0902 

.0156 

6 

.0557 

.0078 

7 

.0344 

.0039 

8 

.0213 

.0020 

9 

.0132 

.0010 

•0 

.0081 

.0005 


r=.5(VT-1) 

Figure 14.3. Method of Golden 
Sections and Interval Bisection. 
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if c(x) = — .25x^ + 2x and 7= [1,2], then the unique optimal x \s x = = 

1.2599 . . ., which can never be determined precisely with only a finite number of 
decimals. 

Furthermore, the method provides no information about the maximum value of c{x), 
[Given that you are going to terminate the search as soon as the interval H is below a 
specified width, you can always construct a function for which the maximum value of 
c{x) is arbitrarily larger than the best c{x) you have obtained during the search.] But 
suppose that you know c(x) is actually concave and has well-defined derivatives at 
X = Ai and x = ^ 2 > where Ax < this case you can calculate an upper bound 

on c(x): 


(ii) 




dc(Ax) 

dx 


- A 


dc(A.y 
‘ dx j 


Concave 


for any x in 7. To check your understanding of (ii), verify that this bound in the example 
for Fig. 14.5 is c(x) <6 at every iteration. 

An alternative adaptive search procedure, sometimes called the Method of 
Interval Bisection, can be devised that employs information about the value of 
derivatives of r(.v) for a nonoptimal x. The technique exploits the fact that for a unimodal 
function the derivative ofr(A:) never changes sign more than once. Suppose that c{x) is 
not simply monotonic. Hence the derivative of ^(a:) is nonnegative at x = and remains 
so, as X increases, until c{x) reaches its maximum value; then for further increases in x, 
the derivative is nonpositive. Therefore, an optimal x occurs where the derivative changes 
sign; the approach repeatedly narrows the^intervals in which this sign change must 
orcur. [The method is also valid when r(x) is monotonic.] 

In order to use the approach, you must determine the formula for the derivative 
dcjdx and assume^ in addition, that the shape of r(x) is such that dcjdx is both well defined 
and equal to 0 only for optimal x, (It is easy to devise unimodal functions over a finite 
interval 7 that do not satisfy these shape assumptions—such examples include functions 
that, for nonoptimal a*, are piecewise linear, or have an inflection point, or have a point 
with an unbounded value for dcjdx.) Each application of the bisection method halves 
the length of the interval in which an optimal .v must lie. After n calculations of the 
derivative dcjdx, the interval is (.5 )”'*’*(<22 — «i)> whereas after n evaluations of c{x), 
the Method of Golden Sections has reduced the interval to only r”(a 2 ■" Thus the 
factor (.5)^'*’^ is to be compared with r” and is shown in Fig. 14.3. There may be an 
offsetting computational disadvantage, however, if dcjdx is a more complex expression 
than c(a) [for example, when c{x) = x-«*]. 

The idea of the algorithm is that dcjdx > 0 at a value Airui, then there is an optimal 
X > JCtriai- Similarly, if dcjdx < 0, then there is an optimal x < Xtriai- Specifically, the 
technique is 

Step 1. Let Ax = Aj, A 2 = ao, and H = Az — Ax. 

Step 2. Let Atriai = Ax + .577. Compute dcjdx at Xtriai- Stop if dcjdx = 0, for Xtriai is 
optimal. Otherwise, go to Step 3 or Step 4, whichever is appropriate. 

Step 3. If dcjdx < 0, let A^ = Xtriai and H — Au-iai — Ax. Stop if H is sufficiently small. 
Othei^vise, return to Step 2. 

Step 4. If dcjdx > 0, let Ax = Xtriai and H = A^ — Aftriai* Stop if 77 is sufficiently small. 
Otherwise, return to Step 2. 
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The Method of Interval Bisection is illustrated in Fig. 14.5 for the same example as 
in Fig. 14.4. After the fourth evaluation of dcldx, the interval length for an optimal ^ 
has been reduced to .625, as compared with an interval of length 2.36 after four evalua¬ 
tions of c (x) by the Method of Golden Sections. 



Iteration 

n 

Step 

Calculations 

0 

1 

4, = 0./»2 = I0.H=I0. = 3.82, x^r = 6.18 

1 

2 

4 

c(3.82) = 3.82. c(6.l8) = 5.82 

/>, = 3 82. Wz = 10). H= 10 - 3.82 = 6.18. Xj^pr = 618. 

= 3.82+ (.618)6.18 = 7.64 

2 

2 

r 3 

c(6.l8) = 5.82. c(7.64) = 4.36 

(4, = 3.82).42 = 7.64. H = 7.64 - 3.82 = 3.82. x„r =6.18. 

Xlft = 3-82 + (.382)3.82 = 5.28 

3 

2 

4 

c(5.28) = 5.28, c(6.l8) = 5.82 

4, = 5.28. (42 = 7.64), H = 7.64 - 5.28 = 2.36 

Iteration 

n 

Fk 

Step 

SURE 14.4. Example of Method of Golden Sections. 

Calculations 

0 

1 

4i=0. 42=10.H=10 

1 

2 

4 

x„ioi = 5, dc/dx = 1 > 0 

4, = 5,(42=I0).H=I0-5 = 5 

2 

2 

3 

Xirioi = 5 +-5(5) = 7.5. dc/rfx =-1 < 0 
(4, = 5).42 = 7.5.H = 2.5 

3 

2 

3 

x„io, = 5 + .5(2.5) = 6.25. dc/dx < 0 
(4, = 5).42=6.25.H=I.25 

4 

2 

3 

X . = 5 + .5(1.25) = 5.625, dc/dx > 0 Figure 14.5. Example of 

^ u- r;s>R Method of Interval Bisec- 

Ay = 5.625, (42= 6.25),H- .625 
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14.4 MAXIMIZING A NONLINEAR FUNCTION OF 
SEVERAL UNCONSTRAINED VARIABLES 

This and the next section show how to maximize a real-valued function 
• • • j ^n)> where each Xj is permitted to have any real value, positive or 
negative. (This assumption on the Xj is frequently expressed by saying that the set 
of values a:,, atj, , . . , lies in Euclidean n-space, abbreviated jE”, or equivalently, 
that each Xj lies on the real line, abbreviated /?*.) There are two motivating 
reasons for studying this problem. First, an analysis of the multidimensional, 
unconstrained, nonlinear maximization problem sets the stage for the analyses 
of constrained models. The algorithmic difficulties to be overcome here are also 
present in the constrained cases, and the techniques presented below can be 
suitably modified when constraints arc imposed. Second, a constrained problem 
can often be solved by first converting it to an unconstrained problem. An al¬ 
gorithm utilizing this approach is treated in Sec. 15.9. 

As in the preceding section, in order to guarantee that the solution techniques 
are valid, we have to impose certain assumptions on the objective function. Loosely 
put, we postulate that c{x^^ ^ 2 ,. . . , is smooth and possesses a finite maximum 
value, occurring at the finite values ^i, ^ 2 > • • • > Abbreviating a set of values 
for a:,, Af2, . . . , a:„ by the symbol x, and the expression r(x,, X2,. . -, x„) by the sym¬ 
bol r(x), these assumptions can be stated more precisely as: 

(i) For all values ofx, r(x) is uniquely defined and finite. 

(ii) For all values of x, every partial derivative dc{x)ldxj is uniquely defined, 
finite, and continuous, and hence c(x) is continuous. 

(iii) c{x) possesses a finite maximum c. 

(iv) For any possible value of r(x), say r, there exists an associated finite number 
Me such that every |x^| < Me if c(x) > c. 

You will find it useful to copy these four assumptions, as we repeatedly refer to 
them throughout the rest of the chapter. 

Given the first three assumptions, (iv) implies that c(x) takes on its maximum 
value c for finite values of x,, X 2 ,.. ., x„. Actually, assumption (iv) implies an 
even stronger property: given any possible value c for r(x), larger values for c(x) 
occur only when each Xj lies within a finite interval, that is, —Me Me 

for every j (where Me < 00 and depends on c). 

To see the force of these assumptions, examine Case 5 in Fig. 14.1. Such a 
c(x) is ruled out, first by assumption (ii), since dcjdx is discontinuous at x = 1. 
But even if c{x) is modified so that dcjdx is continuous, keeping c{x) < 1 for all x, 
the case would still be ruled out by assumption (iii) because c{x) has no maximum. 
[Although r(x) < 1 for all x, there is no finite x such that r(x) = 1.] And what is 
more, even if r(x) is further modified to possess a finite maximum—for example, 
c = 3 at X = ^— the case would remain excluded by assumption (iv). To see why, 
suppose r = in (iv); then c{x) > -J- for all x > 2.5. Hence there does not exist 
an Me postulated in assumption (iv). 
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► By employing mathematical terminology', we can state the four assumptions more 
succinctly. First, assumption (ii) can be reworded to read that c(;v) must be continuously 
differentiable; this implies assumption (i) and thatc(x) is continuous. Second, we stipulate 
that for any x\ the set of x such that r(x) ^ c{x') is compact (closed and bounded); this 
implies assumption (iv). Since a continuous function possesses a finite maximum over 
a compact set, the two succinct postulates imply assumption (iii). A 

Applying differential calculus, we can state the following. 

Necessary Condition for a Maximum. Given assumptions (i) through (iii), the 
function c{x) has a maximum at x only if dc{x)ldxj = 0, for j = 1 , 2, .. ., n. 

The validity of the result is easy to see. Suppose there is a variable Xj such that 
dc{x)ldxj > 0. Then c{x) can be increased by increasing xj by a small amount. 
Analogously, if dc{x)ldxj < 0, then c{x) can be increased by decreasing Xj by a 
small amount. 

What are the computational implications of the necessary condition? You can 
conclude immediately that a specified set of values Xi^ Jc 2 , . . . , does not optimize 
c{x) unless all dc{x)ldxj = 0. But, unfortunately, without imposing further restric¬ 
tions on the shape of c(x)^ the necessary condition is not sufficient for a maximum: 
X may not maximize c{x) when all dc{x)ldxj = 0. The illustration in Fig. 14.2 
shows why. The derivative dcjdx = 0 at Points c, b, c, dy e, as well as at fy which 
gives the only global maximum. 

Suppose, for a moment, you can assert that the ijecessary condition is also 
sufficient for a specified function c(a:). Would knowing that an x is optimal if and 
only if all dc{x)ldxj = 0 make it a trivial matter to solve for Probably not, 
because typically the conditions dc/dxj = 0 , for 7 = 1 , 2 ,. . . , «, necessitate 
solving a system of n simultaneous nonlinear equations, which can be as difficult 
a computational task as maximizing c{x). {In fact, a frequently used procedure for 
solving a set of nonlinear equations af(;c) = 0 , for i = 1 , 2 , . . . , m, is to solve, 
instead, the associated optimization problem: minimize 2^*1 Note, 

however, that when c(;c) is a quadratic function, the conditions dcldx^ = 0 , for j = 
1 , 2 , . . . , n give rise to a system of n simultaneous linear equations, which usually 
are easily solved. In the more general case, we find an optimal x by an iterative 
procedui e of the type explained below. 

Algorithmic method. Many of the computational techniques for maxi¬ 
mizing c{x) can be expressed in a standardized format: 

Step 1, Select an arbitrary initial trial point x^. 

Step 2. Terminate the iterations if dcjdxj = 0, for j = 1, 2,. . ., «, at the trial 
point x^. Otherwise, determine values 7 ^, for 7 = 1, 2,. . ., n, and continue to 
Step 3. 

Step 3. Calculate a new trial point 
( 1 ) ^j*'=x^j+yi for 1 , 2 , 

Return to Step 2, where replaces x*. 
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For most nonlinear objective functions, the iterative process never obtains an 
X* such that all dcjdxj = 0. Hence, the algorithm never terminates according to 
the test specified in Step 2, and must be stopped when is nearly optimal. To 
ensure the calculations stop in a finite number of iterations, you can, for example, 
impose a limit on the number of iterations, or terminate when all the values of 
\dcldxj\ are sufficiently small. 

Various algorithms differ in the selection of^-J in Step 2, Many of these methods 
proceed as follows. First, pickwhat is called a direction^*, for eachy = 1,2,, 
n; this selection is usually based on information about how c{x) behaves near the 
trial point x^y that is, it is based on the local properties of c{x). Second, choose a 
step size based on information about how c(x) behaves beyond the local vicin¬ 
ity of X* when proceeding in the selected direction d^y • • y d^. Finally, combine 
the direction and step size by letting 

(2) forj = I, 2,..., 

A visual representation of the process will help you remember the meaning of 
these terms. 

Suppose you want to maximize = — (x, — 3)* — 4(x2 — 2)*. You 

can tell by inspecting c{x) that the unique optimal solution is Xj = 3, X 2 = 2, 
yielding c(x) = 0. [Since r(x) is a quadratic function, the optimal x can be calcu¬ 
lated directly from the necessary conditions dc/dxj =. 0, for j = 1,2. Nevertheless, 
the quadratic example serves to exhibit the main ideas of the algorithm.] The 
contour levels of r(x,, X 2 ) are ellipses, and are depicted in Fig. 14.6. Starting with 
Step ly let xj = X 2 = 0, and hence r(x°) = —25. Since 

(3) ^=-2(;c,-3) and ^ = -8 (a:, - 2). 

which are both strictly positive at x°. Step 2 indicates that a further improvement 
is possible. 

In preparation for cbor»sing the directions di and d 2 y draw the tangent line to 
the contour c(x) at x°, ,cn by the equation 

(4) *.(„-^)+g.(x.-/0.0. 

Verify that for x? = x? = 0, (4) yields 

(5) 6x, 4- 16x2 = 0 OT X 2 = —l^r,, 

which is plotted in Fig. 14.6a. Any direction (</,, d 2 ) can be indicated graphically 
by a vector pointing away from the origin. Several possibilities are drawn in Fig. 
14.6b. The value of r(x) increases beyond c{x) = —25 by moving along any direc¬ 
tion vector {diy do) that lies above the tangent line at x-, provided that the step size 
in this direction is not too large. No matter which direction you choose, eventually 
r(x) decreases as the step size grows large. 

If you were fortunate enough to have selected d\ and d\ proportional to 3 and 
2—for example, d\ ^ \ and dl = f—then you can obtain the optimal solution if 
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Figure 14.6 (a) Contours, (b) Directions, (c) Optimal Step Size. 
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you pick the right step size. But in maximizing an arbitrary nonlinear function 
c{x)y there generally is no way to ensure that the direction you pick leads to an 
optimal solution in only one iteration. Suppose, in the example, that you selected 
dj = 2 and dl = 1, as shown in Fig. 14.6c. If you want to make c{x) as large as 
possible along that direction, then you should extend the vector {di = 2, = 1) 

until it is tangent to a contour of r(xi, Xj). Explain why. This tangency is exhibited 
in Fig. 14.6c, at point (x, = 3^, Xj = If), so that (1) and (2) can be expressed as 

x| = 0 + (1|)(2) = 

^ xj = 0 + (l|)(l) = l|, 

where = If is the optimal step size. At the new point in (6), c(x*) = — 

Given a set of directions d^, forj = 1, 2,..., n, the problem of choosing the 
assocated optimal step size is merely a one-dimensional optimization 
problem: 

maximize c(xf + tdfy xj + td },..,, x^ + td^) 

t>o 

0) = + (“df, 4 + 

In other words, each argument of c(x) is simply xj + tdj^ or in abbreviated form 
c(x* + /(/*), and hence depends on the single variable L The optimal step size 
is that value of / that maximizes c(x* + /</*=), which is expressed symbolically in 

(7) , and the new point x*^‘ is simply x* -f 

To illustrate in the example, let xj = xj = 0 and select rf? = 1 and £§ = 2. 
Then according to (7), you want to maximize 

(8) c{0 + /.I, 0 + /.2) = -(/ - Sy - 4(2/ - 2)*. 

Because the expression on the right of (8) is a concave quadratic fimction of /, 
you can find the optimal t analytically by setting the derivative to 0, Verify that 
this procedure gives = If. For more complex functions c(x)y finding the optimal 
step size may not be an easy task unless the shape of c{x) permits use of a technique 
explained in Sec. 14.3. We return to this consideration in the next section, where 
we discuss the case of maximizing a concave function c{x). 

You can now appreciate what is a distinguishing difference between linear and 
nonlinear optimization. In the linear case, the objective function changes by 
for each additional unit of x,, irrespective of the actual value of X| or of any other 
Xj. In the nonlinear case, the change in the objective function for an added unit 
of X, may depend on the actual values of x< and all other x^. It is this dependency 
that is the source of difficulty in choosing good directions for Step 2. 

► Consider a procedure that at every iteration both picks a direction yielding a strict 
improvement in c{x) and selects the associated optimal step size. As the following example 
shows, although such a procedure causes r(x) to tend to a limit, the limit need not be 
the maximum value for c{x). 

Let c(x) = — (xi)* — (x,)*, which is a strictly concave function and assumes its 
unique maximum value at x, = x, = 0. Let x® be a point on a circle centered at the 
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origin and having radius 2. Then for A: = 0, 1, 2,..., let the point be chosen such 
that the line drawn from through is tangent to the circle having radius 
{k 4- 2)/(A; -f- 1). Check that c{x) is improved at each trial, and that the procedure 
employs the optimal step size, because of the tangency assumption. As k becomes large, 
the trial solutions x* spiral closer and closer to the circle centered at the origin and 
having radius 1. Thus the successive values ofr (x) arc strictly increasing but are approach¬ 
ing “( 1 )* — ( 1 )* = —2 in the limit, whereas the maximum value of is 0 at the 
origin. As this example demonstrates, the directions in Step 3 must be chosen with some 
care, even when c(x) is very well behaved. A 

14.5 METHOD OF STEEPEST ASCENT 

As was mentioned above, most rules for selecting the directions dj, for j = 1> 
2,..., n, are myopic in that they use information about r(;c) obtained at the 
current trial value x*. A well-known myopic procedure for performing Step 2 is 
described below. 

Steepest Ascent with Optimal Step Size, (a) Let each = dcjdxi, evaluated 
at trial point x*. (b) Find > 0 that maximizes c(xi + /t/f, . . . , + /d*). 

(c) Then, in Step 2, let each = t^dJj. 

(This approach was first suggested in 1847 by the mathematician Cauchy.) 
The idea of using for directions the set of partial derivatives—usually called the 
gradient of. c{x) at x* and abbreviated by the symbol Vc{x )—is intuitively 
appealing. Loosely put, the gradient represents the direction of fastest increase 
in c{x) in the vicinity of x*. Nevertheless, the approach cannot ^utprantee swift con¬ 
vergence of an optimal solution, because, like all myopically determined direc¬ 
tions, the function- c{x) need not increase so fast once the trial point has moved 
along the gradient beyond the immediate vicinity of x*. 

_ ^ 

►The so-called directional-derivative of^(x) at the point x along the direction 
di, • • • 9 ^71 is defined as the limiting value of 

g(xi /tdi, ..., Xn 4 hd^) — c(x|, ..., xj 

as h approaches 0 from positive values. Given the smoothness assumptions on c{x) in 
Sec. 14.4, this limit equals 



If the dj are normalized so that the denominator of (ii) is 1, then the value of the direc¬ 
tional derivative is maximized for dj that are proportional to dcfdxj. In other words, 
the rate of increase in c{x) per unit of Euclidean distance is largest for the gradient 
direction. It is precisely in this sense that the gradient is said to be the direction of 
steepest ascent. ^ 
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Examples. For the first example, consider again c{x) = —(aTj — 3)* — 
4(^2 — 2)^. The steepest ascent rule gives di = 6 and = 16 at ;icj = = 0, 

according to the partial derivatives in (3) above. Check that the optimal step size 
must maximize [—(6/ — 3)* — 4(16^ — 2)®], yielding As a result, 

(1) ^i = 0 + (^)(6) =fH = -83 x| = 0+(^)(16)=|fA = 2.20, 

giving c(x*) = —4.89; the point is shown in Fig. 14.7. [Notice that using the 



gradient did not increase c{x) as much as using the directions yielding’(6) in Sec. 
14.4.] Continuing the calculations for one more4teration, the directions at are 


( 2 ) 


dc _ 0/219 ,\_1152 

ax, ~ V265 / “ 265 

if. - -o\- -132 

ax, “ V265 / 265 


Note in Fig. 14.7 that this direction is orthogonal, that is, perpendicular, 
to the previous direction. The reason is that the old gradient is also the tangent 
to the contour at x', since an optimal step size is used, and the new gradient, by 
its very nature, is orthogonal to the tangent of the contour at x*. This orthogonality 
property holds for all pairs of successive gradients and can be expressed mathe¬ 
matically as 


( 3 ) 


M dx, dxj ~ 
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For the second example, consider maximizing 

(4) c(a,-) = Sat, + 4xt + 2xi — Sx^ — 24 — i4y 
which has the partial derivatives 

(5) ^ = 6(1--.) #.-‘>0-'.) 

The Necessary Condition for a Maximum in Sec. 14.4 yields Xi = 1, ^2 = 1, and 
jcj = 3, which indeed is globally optimal. [This example is repeated in the sec¬ 
tions below, so copy (4), (5), and ^ on a sheet of paper for easy future reference.] 
Given any set of directions dj and a step size /, the expression in (4) becomes a 
quadratic function of 1: 

(6) 6(a:, + tdi) + 4(^2 + td2) + 2(a:3 + td^) 

— 3(a:| -f* tdi)" 2(^2 -j- /^2)^ “b 

Further, in this case it can be shown that expression (6) is a concave function of /, 
and l^he optimal step size can be analytically found by setting the derivative equal 
to zero and solving for t. Verify that this calculation yields 


(7) 


optimal t = 


6(1 — ■v,)rf,+ 4(1 — X 2 )d 2 + 2(1 — ■liXi)d3 

6{d,r + 4 (^ 3 )= + mr 


Given that dj = dcjdx) in the method of steepest ascent, the optimal t is never 
negative. 

The results of applying the method of steepest ascent are shown in Fig. 14.8, 


c(x) = 6 x ^ + 4^2 + 2^3 - - 2x1 - 3-^3 


Iteration 

k 

1 Jt|) 

x^ 

X 2 

X* 

Step Sire 
<* 

0 

0 

0 

0 

0 

.1981 

1 

5.547 

1.189 

.729 

.396 

.4001 

2 

6.544 

.736 

I.I25 

1.091 

.2554 

5 

7.646 

1.088 

.999 

2.004 

.3642 

6 

7.778 

,895 

I.OOO 

2.246 

.2551 

lO 

7.965 

.959 

1.000 

2.702 

.2551 

II 

7.978 

1.022 

.999 

2.753 

.3642 

I8 

7.999 

.994 

I.OOO 

2.954 

.2551 

19 

7.999 

1.003 

I.OOO 

2.961 

.3642 

20 

7.999 

.996 

I.OOO 

2.971 

— 

Optimal 

Values 

8 

1 

1 

3 

0 


Figure 14.8, Example of Steepest Ascent with Optimal Step Size. 
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where the initial point is each ^5 = 0. Notice how the successive values of xf and 
X 2 oscillate around their optimal values. Observe how rapidly the value of the 
objective function rises for the first few iterations; by the fifth iteration, c(x) is 
within nearly 5% of its maximum. In contrast, more than twice as many iterations 
are required to obtain all values of Xj that are within 5% of their optimal levels. 

The special nature of this example can be used to illustrate why moving in the 
direction of the gradient does not always lead lo swift convergence. Suppose you 
used a “move one variable at a time” approach to selecting the directions, instead 
of using the gradient. Since in this example there are no cross-product terms in 
c{x), such as XiXzi you obtain the exact optimal solution in three iterations (regard¬ 
less of the order you select for considering the variables). The method of steepest 
ascent converges only in the limit for this example (starting with all xJ = 0). 


*Modificatioias of steepest ascent. The preceding algorithm is 
termed a first-order method, since it uses only the first-order partial derivatives. 
Unless modified, the method of steepest ascent is likely to converge slowly after 
the first few iterations and exhibit certain annoying characteristics. For example, 
the orthogonality property in (3) gives rise to a zigzagging phenomenon, 
illustrated in Fig. 14.7. There are several numerical techniques for modifying the 
method of steepest ascent that reduce the approach’s propensity to zigzag, and 
thereby speed up convergence. 

One such technique, named the Newton-Raphson Method after the 
scientists who proposed it, employs a direction based on a quadratic fit of the 
objective function. This method results in using the second-order partial deriva¬ 
tives to form a weighted average of the first-order partial derivatives, and falls 
in the category of so-called modified gradient techniques. 

Specifically, the are found by solving the linear system 


( 8 ) 


dh 


f^\ dXidXj 




dc 

dXi 


for t = 1, 2,..., n. 


where c(x) is assumed to be such that (8) always has a well-defined solution at 
each trial point x*. This second-order method applied to a quadratic function 
c{x) yields a stationary point in a single iteration for any x°. And if you let each 

= 0, then solving the system (8) is equivalent to solving the system dcldx^ = 0 
forj = 1, 2,.. ., n. 

Another acceleration approach, called subrelaxation, uses a smaller than 
optimum step size. The variety of useful modifications to the method of steepest 
ascent is too large and the subject itself is too special be treated any further in 
this text. 

A notable property of gradient approaches is that their speed of convergence 
can depend on the way a problem is formulated. For example, in the above 
illustration with c(x) = — (x, — 3)* — 4(^2 — 2)*, let z = 2 x 2 — 2 so that 
^2 = .5(z + 2). Then verify that r(x„ z) = — (x, — 3)^ — (z — 2)^ and Xi = 3 
and z = 2 are optimal. Now dcjdxi = —2(xi — 3) and dcjdz = —2(z — 2). 
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Letting = 0 gives the steepest ascent directions </? = 6 and dl == ^I 

hence = y and a*{ and z* are optimal. Thus this transformation of variables 
causes the method of steepest ascent to converge in one iteration. 

I 

Convergence of Steepest Ascent Algorithm. The foregoing 
examples of c(a*) were general enough to exhibit the sequence of calculations in the 
method of steepest ascent using an optimal step size. But the specific numerical 
behavior exhibited may have been misleading, since the selected functions were 
quadratic, and satisfied the Necessary Condition for a Maximum at a unique 
point. You may wonder how this algorithm would behave on a function such as 
that in Fig, 14.2. It can be shown that if c(a) satisfies assumptions (i) through 
(iv) in Sec. 14.4, then for an arbitrary initial trial value x^: 

(A) The entire sequence c(a*) always increases to a limit c*, 

(B) At least a subsequence of converges to a point a*. 

(C) At A*, the objective function is c{x*) = c*, and the Necessary Condition 
for a Maximum is satisfied, that is, each dc/dxj = 0. 

Hence, not only is the sequence of c(a*) strictly increasing, but for k large enough, 
c(a*) becomes arbitrarily close to a limiting value c*. In general, however, there 
is no finite k such that c(a^) = c* or a-^ = a*; roughly stated, the convergence 
is infinite. 

Regrettably, the three properties above do not mean that the entire sequence 
of A^ converges to a point a such that c{x) is maximum. First, c* need not be the 
maximum value for c{x). Second, a* may be a point in Fig. 14.2 like a, c (if 
X® = c), rf, or e, (In a multidimensional example, a* may be a saddle point.) 
Therefore, x* may only be what is called a stationary point, that is, a point 
satisfying the Necessary Condition for a Maximum. 

Property B can be clarified using two-dimensional terminology. It asserts there 
is a point a* such that if you draw a circle centered at a* and let the radius be 
arbitrarily small, then infinitely many a** lie within this circle. But the fact that 
A*®®®, say, is in the circle docs ?iot guarantee that the next point a*®®* is also in the 
circle. Although the entire sequence a-^ need not converge to a*, it typically does in 
real applications. 

Unless you impose further restrictions on the function c(a), the above con¬ 
vergence properties are the best you can obtain. If you are uncertain about the 
shape of a particular r(.v), then ordinarily you would apply the algorithm several 
times, each time using an a® that is distant from the previous initial points as well 
as from the resultant a*. 

► Properties A and B are demonstrated by appealing to a well-known result in mathe¬ 
matical analysis, namely, that every infinite sequence in a compact space has a convergent 
subsequence. (This proposition is frequently referred to as the Bolzano-Weiei-strass 
Theorem.) Assumptions (i), (iii), and (iv) in Sec. 14.4 imply that the set of points 
[a, ^(a)] in for which ^(a) ^ r(A®) is compact; hence, there is a subsequence that 
converges to [a*, r*]. Since c{x^*^) ^ r(A*'‘), the entire sequence c{x^) tends to c* in the 
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limit. Thus Properties A and B hold for any algorithm that improves the value of c{x) 
at each iteration. The specific characteristics of the method of steepest ascent using 
an optimal step size are employed in proving Property C. 

Let denote a general term in the subsequence converging to x*. Then, by the 
continuity of c(x), 

(i) r(x*) = \imc{X^) = c*. 

Abbreviating -}- (^(dc/dxi ),..., A’* l^(dc/dxn)] by the symbol r[A‘ -f 
we have 

(ii) c{X^*^) > c[A< + t^Vc{X^)] ^ c[A< 4- /Vc(A‘)] for / ^ 0, 

where the first inequality follows from the “steepest ascent” procedure, and the second 
from the “optimal step size” procedure. Taking limits on both ends of (ii), employing 
the continuity assumption on each dc/dx), and using (i) gives 

(iii) c{x*) ^ c[X* + tV€(x*)] for all / ^ 0. 

Now (iii) in turn implies Vr(x*) = 0, that is, each dcjdxj = 0 at .v*; for if not, there is 
a positive value for t such that the right side of (iii) is strictly greater than ^(x*), con¬ 
tradicting the inequality in (iii). Hence Property C is established. ^ 

Concave objective functions. Suppose the function c(x) is now 
restricted further by adding an assumption about its shape. A several-variable 
function c{x) is defined as concave if for any tw^o points x and for all 
0</?< 1, 

(9) pc{x) + (1 — p)c{y) < c{pxi + (1 — p)yi,.. .,px„ + {\ — p)y„) Concave, 

and as strictly concave if there is a strict inequality (<) in (9) for 0 < ^ < 1. 
Using geometric language, a function is said to be concave if the chord connecting 
every pair of points on its graph lies on or below the graph, and is strictly concave 
if it always lies strictly below (except at end points x and y). Equivalently, a 
function is concave if it lies on or below every plane that is tangent to its surface. 
Case 2 in Fig. 14.1 shows a strictly concave function; if it were modified to include 
a linear segment, then it would be only concave. The case represented by Fig. 
14.6 is also an example of a strictly concave function. We can state the following 
condition. 

Sufficient Condition for a Maximum of a Concave Function. Given as¬ 
sumptions (i) through (iv) in Sec. 14,4, and that c(x) is concave, if each Bc/dxj = 0 at 
a point X, then t(x) is the maximum value for <;(x). Further, if c(x) is strictly concave, 
then X is unique. 

Therefore, a local maximum of a concave function is also a global maximum. As 
an important consequence, if you apply the method of steepest ascent using an 
optimal step size, the sequence r(x*) increases in the limit to the maximum value 
of c(x); and if the function is strictly concave, the entire sequence x* converges to 
the unique optimal solution x. A further significant computational implication 
is that the optimal step size procedure of the algorithm can be achieved by em- 
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ploying a search method explained in Sec. 14.3. The reason is that if ^(a:) is concave, 
then c{xx + /t/|,. . . , + td^) is a concave function of L [At each iteration, you 

will have to supply an upper bound t on the optimal step size, and then search 
for the best value of I in the interval [0, /].} 

► If you terminate the algorithm at the trial value and c{x) is concave, then you 
can obtain a bound on the maximum value for c(x). The approach is based on the fact 
that a tangent plane to c(a:) at x* overestimates c(x) for any x. Specifically, 

(i) r(x) ^ c(x*^) + 2 Concave 

for any x. 

To illustrate the idea, suppose you can assert how far an optimal x might be from 
x*^. Specifically, assume that < e for everyj, where e > 0. Then the maximum 

value of c(x) cannot be larger than c(x^) -f e The correction factor 

is small when e and the gradient at x* are nearly zero. 

In examining the method of steepest ascent as pietured in Figs. 14.6 and 14.7, you 
may have concluded that the shape of the level-eontours ensured that a local optimum 
was also a global optimum. Specilically, the set of points x such that c(x) ^ c was convex, 
which, in turn, ensured that the contours c(x) — c were nested as c increased. There 
were no “pockets” or “traps” for x*^ to be lured away from the optimum x. It is easily 
shown that concave functions always have this nested convexity property for the level 
contours. But so do certain other functions. [For example, consider the function c{x) — 
—xV(l + A“), which is bell-shaped and hence not concave. The set of points x such that 
c{x) '>€ is convex, namely, an interval centered at x = 0. As c increases, the intervals 
shrink and are nested.] 

Suppose, then, you postulate only the nested convexity property about the level sets 
of c(x). A function with this property is termed quasi-concave and is mathe¬ 
matically defined by the condition that for any two points x and for all/>, 0 < /> :^ 1, 

(ii) minimum [«(*), c(>)] ^ + (1 _ ...,+ (1 - p)y„) 

Quasi-concave. 

(A concave function is always quasi-concave.) This assumption is not quite enough to 
ensure that the sequence r(x*) tends to its maximum value in the method of steepest 
ascent. The quasi-concavity assumption (ii) does not rule out the possibility that the 
limit of the subsequence of x*, namely x*, can be an inflection point. But if you also 
assume that r(x) has no inflection points, or if you can verify that x* is not an inflection 
point [the value ofr(x) docs not increase in any direction from x*], then c(x*) is indeed 
the maximum value. 

Incidentally, if you assume quasi-concavity and that c{x) has no inflection points, 
then you can also use the single-dimension adaptive search techniques given in Sec. 14.3 
for finding an optimal step size. ^ 

14.6 QUADRATIC PROGRAMMING 

This section begins the discussion of techniques for optimizing a nonlinear 
objective function when the variables are subject to constraints. We start with 
what is perhaps the simplest nonlinear extension of a linear programming model. 
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A quadratic programmiog problem consists of an objective function 
composed of linear and quadratic terms and a set of linear constraints. The specific 
version considered below is 

n n n 

maximize 2 CjXj + S S 

n 

2 for i = 1, 2,..., m 

all > 0 and all Si > 0, 

where each represents a slack variable. To avoid treating all the various cases 
that can arise, assume each bi > 0. Also assume that c{x) reaches a finite maximum 
c over all feasible solutions, where r(;c) denotes the objective function in (1). The 
modifications required to handle bi of any sign (or problems without all the slacks 
present) and unbounded optimal solutions are straightforward, but will not be 
covered in this text. 

Section 14.1 included several illustrations (“sales, revenue,” “safety-stock 
inventory levels,” “portfolio selection”) that contained quadratic terms in the 
model formulation. The practical importance of such situations is one justification 
for devoting this entire section to solving quadratic programming problems. But, 
in truth, these applications are not the most frequently occurring examples of non¬ 
linear problems. Hence, the main justification for discussing this special case is 
that the topic represents the next logical step in mastering the ideas and techniques 
of nonlinear programming. It is particularly important that you observe 

(i) How the imposition of constraints influences an optimal solution. 

(ii) How a gradient optimization procedure has to take account of all the 
constraints. 

(iii) How the quadratic assumption leads to linearized algorithms. And 
specifically, how the simplex method is modified to provide an algorithm 
that converges to an exact solution in a finite number of iterations. 

Quadratic programming techniques are also significant because they are some¬ 
times used as subroutines in more general nonlinear programming algorithms— 
at a trial point, the nonlinear objective function is approximated by a quadratic 
function which, in turn, is optimized to provide a direction of improvement. This 
device falls under the heading of an advanced numerical technique for accelerat¬ 
ing convergence, and so will not be treated in this text. 

Given our assumptions about bi and r(x), three types of optimal solutions are 
possible, as shown in Fig. 14.9. Case 1 demonstrates that the optimal solution 
may be in the interior of the constraint region; for the particular example, the 
unique optimal solution is the interior point JCj = 1 and ^2 = 2. Case 2 exhibits 
that an optimal solution may be on the boundary of the constraint region, 
but not necessarily at a vertex or an extreme point. Case 3 corresponds to the 
kind of solution that arises when an objective function is linear, namely, when 


( 1 ) 

subject to 
( 2 ) 
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Figure 14.9. Constrained Optimal Solutions r(x) = — (xj — 1)" — 

(a :2 — 2)^ 4- 5 = 2^1 + 4x0 — atjXo — X 2 a: 2 . Case 1 Optimum in Interior. 

Case 2 Optimum at Boundary. Case 3 Optimum at Vertex. 

there is a basic optimal solution occurring at a vertex of the constraint region. 
Since Case 3 is not the only possibility here, nonlinear programming algorithms 
must permit consideration of nonbasic solutions^ in sharp contrast to linear pro¬ 
gramming techniques. 

Preliminaries. Before discussing the steps of the algorithm, it is helpful 
to make a few remarks about the nature of a quadratic objective function. To 
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begin, we adopt a convenient way to display the function; the approach is easily 
explained by an example. Suppose 

(3) c{x) = 2x, + bx 2 — + l^i ^2 — 3 ^ 1 X 3 — 4^, 

Note that c{x) has been written in (3) so that there is at most only one term 
containing any given pair of variables. For example, if the quadratic function 
originally contained the expressions — 5 X 1 X 3 + 2 x 3 X 1 , then these two terms, both 
containing the pair Xi and X 3 , would be simplified to — 3 X 1 X 3 . The convention is 
to display c{x) in the format: 

c{x) = (0 + Ixi + fxs + 0 x 3 ) • 1 

+ (1 - + 7^2 - -Xi 

+ (f + 7^1 — 4^2 + 0X3) -Xj 
+ (0 — fxi + Oxj + 0x3) •X3, 

Verify that the expression in (4) agrees with (3) when the terms are multiplied 
and combined. Note in (4) that the diagonal terms (—■—4, and 0 ) are the 
coefficients of the squared variables in (3), and the off-diagonal terms are one- 
half of the corresponding coefficients in (3). Also observe that the array of coef¬ 
ficients in (4) is symmetric—the term in the ith row and jth column is the same as 
that in thejth row and ith column. 

Next consider the partial derivatives of c{x) in the example (3): 

= 2 — ^Xi + 1X2 — 3X3 = 2(1 — -J-Xi + ^X2 — 7X3) 

(5) = 5 + 1^. - 8x, = 2(1 + - 4x0 

|i = -3x. = 2(-|x0. 

Hence, the partial derivative of c(x) with respect to Xj is twice the value of the 
parenthetical expression multiplying x^ in (4). A positive value for a partial deriva¬ 
tive indicates that the objective function is increased by increasing the associated 
variable. Note that the partial derivatives are linear functions of the x^. It is this 
fact that leads to only a minor modification of the simplex algorithm presented 
below. 

In the course of this algorithm, you will need to introduce new variables related 
to the parenthetical expressions in (4). For reasons that will become clear later on, 
these are named free variables. To illustrate, suppose you want to introduce 
a free variable—call it u —associated with the expression multiplying Xj in (4). 
Then construct the following definitional relation for u: 

(6) = 1 — -yXi 4“ yX2 — yX3. 

Again by convention, the coefficient for u is the same as minus the coefficient for 
Xj in (6). Rearranging terms in (6), you can express Xj as 

(7) X, = 4 — a + 2xj — 6x,. 
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► In applying the algorithm, you will also need to eliminate a variable Xj from a 
quadratic expression, and substitute, instead, a linear expression containing the other 
variables. For example, you may need to eliminate Xi in (4) and substitute the expres¬ 
sion on the right of (7) in its place. To do this systematically, first eliminate Xi from the 
expressions in parentheses: 

(4 — 1m + 1^2 — 6 x 3 )-1 
+ (0 4" yw + 0-^2 4" 0 x 3 )-Xi 
4- (I — — 3x2 — 3x3)-Xa 

4- (—6 4- zM — 3x2 4- 9x3)-xj. 

Second, substitute the expression (7) for Xj in (i); then expand the resultant product, 
term by term for each clement in (7), and combine with the other expressions in the 
parentheses in (i). For example, the third term in the expanded expression is (0 4- 
4- 0x2 + 0 x3)*2x 2, which is to be combined with (| — ^m —• 3x2 3 x 3 ) •X 2 . The com¬ 

plete result is 

(4 4- Om 4- 1^2 6 x 3 ) • 1 

.... (0 — iM 4- 0x2 4- 0x3)-M 

(“) 

(I 4- Om — 3x2 — 3x3) •X2 
(—6 4- Om — 3x2 4- 9xs)-X3, 

which once again is symmetric. To confirm your understanding, perform the calculations 
yielding (i) and (ii). 

It is no accident that the coefficients in the row and column for m arc all zero (except 
the diagonal term), and that the other coefficients are the same as in (i). This simplifi¬ 
cation always arises when you substitute an expression based on a relation for a free 
variable; hence, in such a case, you need not perform the arithmetic in the second step. 
But if, for example, you substitute (14- 4 x 2 4 x 3 ) for Xj, then you do need to perform 

the expansion indicated by the second step. ^ 

Q^uadratic Simplex Algorithm. The method, in outline, is: 

Step L Let > 0, for £ = 1, 2, ,.., m, be the initial feasible basic 

solution. 

Step 2, Determine directions of local improvement from the current solution. 
Stop if there are none. Otherwise, find the optimal step size in the selected direc¬ 
tion, and continue to Step 3, 

Step 3. Calculate the new solution accordingly. Return to Step 2. 

The status of the algorithm at any iteration can be summarized as follows. 
There is a system of linear constraints derived from (2) as well as some additional 
relationships that have been imposed during the course of previous iterations. 
At Step i, only the m constraints in (2) are present, but at later iterations there can 
be up to m -b « constraints in all. Associated with this system is a set of basic 
variables, comprised of the Xj and the slack variables j,. The nonbasic variables 
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consist of the remaining Xj, j,, as well as “free variables” that have been introduced 
at earlier iterations. 

The detailed instruction for determining directions of improvement in Step 2 
is given by the following criterion. 

Quadratic Simplex Criterion I. (a) Select any free variable to enter the solution 
if the corresponding partial derivative of the objective function is nonzero, (b) If all 
such derivatives for the free variables equal zero, then select a variable (an Xj or to 
enter for which the corresponding partial derivative of the objective function is most 
positive, (c) Terminate the iterations when the partial derivatives are zero for the free 
variables and less than or equal to zero for the other nonbasic variables. 

Aside from the changes in wording necessitated by the nature of the nonlinear 
problem, Quadratic Simplex Criterion I is in the same spirit as Simplex Criterion L 
In both cases, the direction of improvement is the result of entering a single 
nonbasic variable. 

The optimal step size calculation resembles Simplex Criterion IL In part, it tests 
fdr the largest value of the selected nonbasic variable that will maintain feasibility 
in the constraints. But it also finds the value for this variable beyond which the 
objective function decreases. The optimal step size, then, is the smaller of these 
two values. The rule is stated specifically in the criterion below. 

Quadratic Simplex Criterion II. Introduce the selected nonbasic variable at 
a level that yields the most favorable improvement in the objective function, consbtent 
with the constraints. 

The fine details of this criterion are shown fully in the example below. 

Example. Consider the problem 

(8) maximize 6xi + +* 2^3 3xJ — 2^ — 

subject to 

(9) X, + 2x2 + Xj + j = 4 Row 1; 

all Xj ^ 0 and j ^ 0. [You will find it helpful to copy (8) and (9) for future 
reference.] Recall that this same c{x) was analyzed in Sec. 14.5, where you saw 
that the unique unconstrained optimum is Xj = 1, X 2 = 1, and X 3 = 3. Verify 
that this point is not feasible in (9). Let j = 4 be the initial feasible solution for 
Step 1; since j is a slack, it does not appear in (8). Using the suggested convention 
for arraying the objective function, you can wite (8) as 

( + 3xi — 2x2 + Ixj)-! 

+ (3 —3xi ).Xi 

+ (2 — 2x, ) -xa 

+ (1 — 


( 10 ) 
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where the zero-valued terms have been suppressed in the formula. Recall that the 
leading term in each parenthetical expression is one-half the value of the partial 
derivative with respect to the corresponding nonbasic variable. Therefore part 
(b) of Criterion I indicates that .v, enters the solution. Explain why. 

To find the value for a*,, first apply the approach used in Simplex Criterion II 
to calculate the maximum value of .v, consistent with feasibility in the linear 
constraint. This involves taking the ratios of the current right-hand side to the 
coefficients of the entering variable. In the example, the result is 

4 

(11) < — = 4 for a step size that satisfies the constraints. 


The value of Xi giving the maximum improvement in the objective function, 
holding all other nonbasic variables at their current levels is the solution to 

(12) ^ = 0 or equivalently = 0, 

where the partial derivative is calculated for the value of the current trial solution. 
This equation yields 3 — 3 ai = 0, so that 

(13) aTj < 1 for a step size that improves the current value 

of the objective function. 

Since the inequality in (13) is more restrictive than that in (11), the Quadratic 
Simplex Criterion II specifies that x^ enters at the level 1. This completes Step 2, 
and you now continue to Step 3, 

You can achieve a, = 1 at the next trial solution by adding the definitional 
equation for a free variable associated with Xi: 

(14) 3k, = 3 — 3a:i or equivalently a*, + k, = 1. 


At the current solution, k, = 1 since .v, = 0. But by pivoting, S' ihat k, becomes 
nonbasic and falls to level 0, you force .v, to the value 1. In summary, impose (14) 
along with (9), pivot on in (14), and thereby obtain the set of constraints: 

^ ^ 2x2 + a -3 + j — Kj = 3 Row 1 

(15) =—=— 

Xi - 1 - j/j = 1 Row 2 . 

Eliminate in the objective function (10) by substituting the relation in (14) 
to give 


(16) 


(3 + 2^2 + Ijfj) • 1 

+ ( — Sa, )•«, 

+ (2 — 2^2 ) -xt 

+ (1 ‘ 


Each parenthetical expression still represents one-half the value of the partial 
derivative with respect to the corresponding nonbasic variable, but takes into 
account that changing such a variable also affects c{x) through the associated 
changes in the basic variables. Sometimes this value is called a reduced par- 
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tial derivative. The first iteration is complete. The value 3 in the upper-left 
corner of (16) equals the level of the objective function at the current trial solution. 

Returning to Step 2, you see that the solution can be improved by introducing 
ATj. The feasibility restriction, according to (15), is that Xo ^ ^ = 1.5. Verify that 
the value of Xo giving the best improvement in the objective function is ^2 = !• 
Hence to accomplish Step 3, you introduce another free variable: 

(17) 21/0 = 2 — 2 x 2 or equivalently Xz + U 2 = 1. 

Combine (17) with (15), and pivot on ^2 in (17) to obtain 

a :3 + J — Ml — 2 m2 = 1 Row 1 

(18) Xi + 2^1 =1 Row 2 

X 2 + Ms = 1 Row 3. 

Then use (17) to eliminate Xz from (16), giving the reduced objective function: 


( 19 ) 


(5 + 1 ;^ 3)-1 

+ ( -- 3 m, )-m, 

+ ( — 2M2 ) •M2 

+ (1 — -Jr^s) *^3- 


Observe in both (16) and (19) that the first term in the parentheses associated with 
the free variables m, and Ms equals 0. Therefore the partial derivative of the 
transformed objective functions (16) and (19) for each free variable equals 0 in 
these trial solutions. 

Now part (b) Criterion I indicates that x^ should enter. In performing the step 
size computation, you will find that the feasibility calculation based on (18) is 
restrictive. So in this case, you perform in Step 3 an ordinary change-of-basis 
calculation—the variable x^ enters the basis and s leaves. Since x^ appears only 
in Row 1 of (18), no pivoting calculations are needed in the constraints. (If the 
original problem had contained more constraints, then pivot calculations would 
usually have occurred.) But ,V 3 must be eliminated from (19) by means of Row 1 
in (18), giving 

( + *1*^1 “1“ 7^2 — • 1 

7 ““ ^“2 + ”3^) • “1 

^ ^ + ( 4 - + 4-r)-«2 

+ (—f- + 3*^1 + •i’^2 


The current trial solution is 


( 21 ) ATi = 1 ;c 2 = 1 ^3 = Ij 

so that all three variables are at positive levels. Is this solution optimal? Not 
according to part (a) Criterion / above, and here is why. The relations (14) and 
(17) associated with m, and Ms were imposed as artifices to ensure taking the 
optimal step size in applying Criterion //. There is no reason why m, and M2 have 
to be zero. In fact, m, and Ms are unconstrained in sign—the very reason they are 
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called free variables. Consequently, if the partial derivative of the objective func¬ 
tion for a free variable is nonzero, then the solution can be improved by moving 
this variable in the direction indicated by the sign of the partial derivative. As you 
can see in (20), the partial derivatives for Ui and ^2 are nonzero. Therefore, 
select Ui to enter the solution. 

Because the partial derivative for Ui is positive, you should increase Ui, The 
procedure for finding the new level for Ui is exactly the same as that used at previous 
iterations. Check that for feasibility, Mj cannot exceed 1. [If the value of the partial 
derivative were negative, then you would examine the constraints in (18) to 
ascertain how negative Ui could be without violating the constraints. This is 
equivalent to testing how large (—t/i) can-be.] The value of Ui that gives the 
maximum improvement in the objective function, holding all other nonbasic variables 
at their current levels, is Ui = -j^, which is the solution to 


( 22 ) ^ -~ ^ current trial solution. 

Therefore, to ensure Wj = at the next trial solution, you must introduce a new 
free variable : 


(23) = I - - § 1/2 + y 

or 

(24) —-^s + lui + iuz + 1m3 = h- 
Then use (24) to eliminate Ui from (18), yielding 


^3 + -fisS — ^U2 + 1/3 — f 

(25) Xi + — 7*^2 — “3 = 4 

X2 + U2 =1 


and also eliminate Ui from ( 20 ), giving 


(26) 


( 6 § + |«2 - 

+ ( - ¥“3 

+ ( I - ^*^2 + 

+ (-| + 1 ^ 2 - 




Row 1 
Row 2 
Row 3, 


Since Ui has become a basic variable in (24) and is unrestricted in sign, it can 
never again be made nonbasic. Thus, (24) can be dropped in subsequent iterations, 
and the set of constraints is merely (25). 

In general, as soon as a free variable enters the solution, the corresponding 
relation can be dropped—that is why the total number of constraints never ex¬ 
ceeds m + n. (For in the worst case, where all Xj and Si are basic variables, so that 
there are m + n constraints, every nonbasic variable must be a free variable. 
After selecting one of these to enter to basis, one of two cases occurs. Either an 
Xj or Si becomes nonbasic, and you then drop the row in which the selected free 
variable has become basic. In this case, the total number of constraints is reduced 
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by one. Or the free variable associated with a newly imposed definitional con¬ 
straint becomes nonbasic, and you then drop this relation after pivoting on the 
selected free variable. In this case, the total number of constraints is unchanged.) 

Returning again to Step 2, you must introduce since it has a positive partial 
derivative in (26). Verify that the step size computations indicate that you must 
add another free variable: 

(27) = f - 
Check that eliminating i/o from (25) yields 

■^3 4 “ + 2^3 + 1*^4 = ^ 

(28) Xx + 4“ = i Row 2 

•^2 4" *^4=1’ Row 3, 

and eliminating 1/2 from (26), gives 


( 


71 


(29) 


- V-“3 


)-«3 

)-«4 

This time, you find in Step 2 that the iterations can terminate. Explain why. The 
current solution 


4’( 
4 -( 

4 -(-| 




(30) ^> = 4 = = 4 = 


is globally optimal, and the value of the objective function is 7.25. 


Review. Look back over the approach to see the main ideas. In particular, 
note these properties of the algorithm: 

A. As in the simplex method, the direction of improvement is characterized by 
changing only a single nonbasic variable at each iteration. 

B. As in the simplex method, this new variable cannot exceed a level at which 
a current basic variable falls to zero. 

C. Additional relations are added to the constraints (2) whenever the objective 
function reaches a maximum in the selected direction before any current 
basic variable falls to zero. 

D. Each additional constraint is proportional to the partial derivative of the 
objective function with respect to the entering variable, and hence is linear 
because the objective function is quadratic. 

E. Such constraints are added as artifices to facilitate taking the optimal step 
size in the selected direction. Further, they thereby enlarge the basis, and 
so enable an optimal solution to contain more than m variables at positive 
levels. 

We caution that the preceding discussion did not present the method in a form 
best suited for coding into a computer program; such details are inappropriate for 
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this introductory text. The practical necessity of using an electronic calculator for 
finding a solution is underscored by the preceding example; even though the 
model consisted of a single linear constraint plus the nonnegativity restrictions, a 
considerable number of calculations were required to find an optimal solution. 
(Thus you would only apply this algorithm manually when solving a toy example.) 

Convergence properties. A rather simple argument (not given here) 
can be constructed to establish that the method terminates in a finite number of 
iterations for any quadratic objective function and set of linear constraints. 
(Actually, the algorithm must be amended to take the same precaution against 
cycling that was mentioned for the simplex method in Sec. 4.7.) But so far, no 
pronouncement has been made on whether the method ensures that the final 
solution is always optimal. We treat this subject next. 

As you must expect from the discussion in Secs. 14.4 and 14.5, unless you make 
additional assumptions about the shape of the objective function, the most you 
can assert is that the final solution is a relative stationary point. By the same token, 
the Sufficient Condition for a Maximum in Sec. 14.5 generalizes to the constrained 
case: if the quadratic function is concave^ then the terminal solution is globally optimal 
among all feasible solutions, (Of course, this sufficient condition for global 
optimality is not necessary; that is, the quadratic function may not be concave, 
yet the final solution may turn out to be truly optimal.) The quadratic function in 
(1) is concave if and only if 

n n 

(31) 2 2 ^ 0 for all Xi, where z = 1, 2, . . . , «. 

k=l 

In mathematical terminology, (31) states that the quadratic form is nega¬ 
tive semidefinite. 

Most textbooks on linear algebra and matrix theory contain straightforward 
computational tests to determine whether a quadratic form satisfies the condition 
in (31). Such formulas arc easy to verify if n is small, and general procedures can 
be programmed for an electronic calculator to make the tests when n is large. In 
practice, the very formulation of the model often provides the answer. To il¬ 
lustrate, in the portfolio selection example in Sec. 14.1, the objective is to minimize 
a quadratic form [expression (11)] in which the coefficients are covariances. You 
can find a proof in most advanced statistics texts that such a form is positive 
semidefinite, that is, the inequality sign in (31) is reversed. Hence on multiply¬ 
ing the form by (—1) and changing the sense of optimization to “maximize,” 
concavity is automatically established. 

Other algorithms. There are many other algorithms for solving quadratic 
programming problems. Of particular importance is one based on using dual 
variables; this technique is given in Sec. 15.8, and likewise converges in a finite 
number of iterations. The methods in the remainder of this and the next chapter 
also can be employed, but, in sharp contrast to the algorithm above, they do not 
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always converge to a stationary point in a finite number of iterations. The numer¬ 
ical example in this section will be used to illustrate this fact. 

14.7 SEPARABLE PROGRAMMING 

You saw in the preceding section how only a minor modification of the simplex 
method is required to solve quadratic programming problems. The approach 
exploited the fact that the partial derivatives of a quadratic objective function are 
linear. Here wc turn to an approach for solving nonlinear problems that involves 
a different modification of the simplex method. The technique applies to problems 
in which all the nonlinear functions are separable, a property that is described 
below. The idea is to construct a constrained optimization model that linearly 
approximates the original problem. The approximations enlarge the size of the 
model, but since a version of the simplex method can be applied as a solution 
technique, the method has considerable practical significance. Since the ap¬ 
proach can be used equally well to approximate a nonlinear objective function and 
nonlinear constraints, both cases are treated below. 

Separable objective functions. Suppose the objective function can 
be written as the sum of n functions, one for each variable 

( 1 ) 

j=i 

such a function is termed separable. For the moment, we leave unspecified 
whether r(;c) is to be maximized or minimized. To keep the exposition simple, 
the technique is outlined only for the variable Xi and the associated function 
a directly analogous construction applies to each of the other variables. 
Also, since only variable Xi is modified below, it is convenient to suppress the 
subscript in the notation, and refer merely to x. 

Assume, again for simplicity, that you know x must lie in the interval 0 ^ X 

To apply the linearizing technique, choose a grid of values for x, designated as 
0 = Xi < X 2 < • • • < Xp.i < Xp = Xy and express any value for x in terms of 
a weighted average of the grid values: 

( 2 ) X = XiWi + X2W2 + • • • + XpWpy 

where the weights Wj^ must satisfy 

(3) 2 1 2 ind n/jt ^ 0 for A = 1, 2,..., 

In selecting the grid, always observe the convention that if ri(x,) is linear over 
a specified interval of values for then no more than two grid points are located 
in this interval. 

To construct the “approximate” model, substitute the right-hand side of (2) 
wherever x appears in the constraints, add the restrictions in (3), and substitute the 
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piecewise-linear, or polygonal, approximation. 

(4) Ci{Xt)w^ 

for in the objective function. 

To illustrate the construction, consider the convex function Ci{xi) = xj and 
0 ^ Xi < 10, as shown in Fig. 14.10. 



For the example, let the grid points be 0, 1, 4, 9, 10. Then (2) becomes 

(5) , x = Owi + lw 2 + 4 w3 + + 10 ii ;5 

and (4) becomes 

(6) 0zi;i + l^W2 + 4^iv3 + 9^iV4 + lO^ii^j. 

Suppose X = 6 . Then the weights corresponding to representation (5) are W 3 = 

^4 = f > other Wic = 0. This gives the approximate value in ( 6 ) of = 42 

for the true value rj( 6 ) = 36. Suppose, further, that Xj = x is constrained by 

(7) 5x — 2^2 < 1. 

Then after substituting (5) into (7), the constraint is written as 

(8) 5 w 2 + 2011^3 + 45m; 4 + 50i«;5 — 2;c2 < 1. 

The analysis of the numerical example has assumed that the weights used to 
give a: = 6 are W 3 and since 6 lies between the grid points Z 3 = 4 and X 4 = 9. 
But the weights = .4 and = .6 also yield the value 6 in (5), only in this 
case the approximate value for r,( 6 ) in ( 6 ) is 10^ (. 6 ) = 60. Therefore, in order 
that ( 4 ) really results in the intended piecewise-linear approximation to ri(xi), 
it is essential to ensure that a solution employs at positive values either only one 
Wky or at most, two adjacent weights Wic and We can easily establish the 
following property. 
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Property of Adjacent Weights. Given that all the constraints arc linear, if 
^ 2 > • • • > ^n) is to be maximized, and ri(x,) is a concave function, then every optimal 
solution to the approximate model based on (2), (3), and (4) contains only one a;*, 
or at most, two adjacent weights wt and Wk+i at positive levels. The same solution prop¬ 
erly holds if c{x) is to be minimized and rj(xi) is convex. 

An important implication of this property is that if the hypotheses are satisfied 
for all Cj{xj)y and given that all the constraints on Xj are linear, the transformed 
model is a linear programming problem and can be solved by applying the 
ordinary simplex method of Chap. 4. In this case, if the original model has m 
linear inequality constraints, the enlarged model has m n constraints, the 
additional n restrictions being of the form (3). Hence, it is possible that an as¬ 
sociated optimal basic solution implies that all n of the Xj (as well as all m of the 
slack variables) appear at strictly positive levels. Further, the resultant solution 
is an approximation to a globally optimal solution of the original problem. 

But if the hypotheses are not satisfied—for example, c{x) is to be maximized 
and r,(.v) is convex —the solution to the approximate model usually does not satisfy 
the Property of Adjacent Weights. This difficulty can be rectified by modifying 
the details of Simplex Criterion I so as to force the Property of Adjacent Weights to 
hold. Specifically, add the stipulation that at any iteration, if one weight is in 
the current basis, then only the adjacent weights and Wi^^i are to be con¬ 
sidered for entry into the solution. And if two adjacent weights and a/jt+i are in 
the current basis, then no other weights are considered for entry at the present 
iteration. This modification is referred to as a restricted-basis-entry 
rule. When the hypotheses are not satisfied and the restricted-basis-entry rule 
is used, the resultant solution is only an approximation to a locally optimal solution of the 
original problem, (An alternative to the restricted-basis-entry rule is to employ an 
integer programming model, described at the end of Sec. 13.2. Although this 
approach approximates a globally optimal solution of the original problem, it 
is usually less attractive than the restricted-basis-entry technique because it 
involves greater computational difficulties.) 

Actually, it is possible to construct pathological examples demonstrating that 
the restricted-basis-entry rule can cause the simplex method to terminate at other 
than a local optimum of the approximate problem. Such a situation occurs when 
two adjacent weights it;* and ii;*+, are in the current basis, and yet, if you introduc¬ 
ed it;*+2 you would cause it;* to drop from the next basis, and the value of the 
objective function would improve. This bizarre event rarely happens. 


>In place of a weighted average technique, which requires that (3) be added to the 
set of constraints, you can employ another approach that leads to adding a set of non- 
negative, upper-bounded variables. The idea can be easily explained in terms of the pre¬ 
ceding example. Rather than (2), e,xpress a value for x as a sum: 

(0 =J'i+>s + ••• 

where 0 ^ ^k+i for k — 1, 2 ,.,, yp 1. Therefore in the example, 
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instead of (5), you have 


(ii) 


a; +^2 +j’j 


where the nonnegalive and upper-bound constraints are 

(iii) ^ 1 - 0 = 1 0^^j^4-l=3 0 ^>3 ^ 5 0 :^j- 4 ^ 1 . 


Then, instead of (4), you use 

(iv) 
where 

(v) 


'U.' 

2 Cicj/ic, 
k=>l 


- X, • 


Thus Cic equals the slope of the linear approximation to ri(xi) between the points Xf^ 
and Xfc+i- In the example, (iv) becomes 


(P - 0) , (42 - F) , (9^ - 4^) , (10^- - 9^) 


(vi) 




9-4 


-yz + 


10-9 




= 1^1 + 5>'2 -h 13 j>3 + 


If X = 6 , a set of values for_>»;t corresponding to representation (ii) Isyi = l,j ^2 = 3, 
^3 = 2, and ^4 = 0, which in (vi) yields the value 42 as an approximation to ri(6) = 36. 
Using (ii), the constraint (7) is 


(vii) 5yi + 5^2 + + 5^4 — 2 a :2 ^ 1. 

If all the constraints in the model are linear, you can use the Bounded-Variables 
Algorithm explained in Sec. 5.10. 

In this formulation, the analogous wording of the Property of Adjacent Weights is that 
yjc+i is positive only if^^. is at its upper bound X^+i — X^. To see why the property also 
holds in this version under the same hypotheses as before, consider the numerical 
example. Notice that all the variablesin (vii) have the same coefficient, so that they 
have an identical effect on the constraints. But the convexity ofci(xi) causes the coeffi¬ 
cients in (vi) to be monotone increasing. Hence, if the objective is to minimize c(x), 
the variable is always used to its fullest extent beforeenters a trial solution. 

For this version of the problem, when all the constraints are linear, the Bounded- 
Variables Algorithm can also be modified by a restricted-basis-entry rule forcing the 
Property of Adjacent Weights. At any iteration, allow ^jt+i to enter the solution only if 
ytc is out of the basis and at its upper bound Xi^+i — Xi^. And allowj;;fc to leave the solution 
only if^jt+i is out of the basis and at its lower bound 0 . ^ 


Separable constraint functions. Suppose the optimization problem 
contains a separable-type constraint 

(9) S ^ b. 

Then the linearization techniques explained above may be applied here too. 
Again, consider only the variable Xi and the associated function fli(x,), and 
assume the identification in (2) and (3) is employed. The expression 

( 10 ) 

is substituted for Ciixx) in (9), and represents a polygonal approximation. 









• 14.7 • 


Separable Programming 555 


If all the constraints are separable, as in (9), and each component is a convex 
function, then using a transformation such as ( 10 ) leads to an optimal solution 
that satisfies the Property of Adjacent Weights. Such a solution approximates 
a globally optimal solution to the original problem. If one of the component 
functions is not convex, then you may have to force the Property by adding the 
restricted-basis-entry rule to Simplex Criterion I in the manner already described. 
The result may only approximate a locally, but not globally, optimal solution to 
the original problem. 

♦Examples, Consider the quadratic programming illustration that was 
treated in Sec. 14.6: 

( 11 ) maximize (Gxj — Sx?) + ( 4 x 2 — 2 x 1 ) + ( 2 x 3 — -J-^) 
subject to 

(12) a:i+2x 2+X3<4 and all x^ > 0. 

The terms in c{x) have been grouped in (11) to display the function’s separable 
character. Verify that each component function in parentheses is concave. Let the 
grid for Xj and X 2 be (0, .4, .7, 1), and for X 3 be (0, 1, 1.5, 2, 3); let 

X, = Otti -f . 4^2 + .71/3 + 

(13) X 2 = Oy, + Avz + . 71/3 + 

Xj = Owi + \w2 + 1.5m;3 + 2wk + 

Then the constraints for the separable model are 

. 41/0 •7t43 -f- lt*4 -j- . 81/2 XAv^ -f" 2^4 ” 1 “ lzt ^2 "i" L5 m/ 3 "f" 2 m /4 -311/5 ^ 

(14) 2^.= ! l 

JtWl jtal it»l 

all ttjfc, i/jfc, Wk>0. 

Check that the objective function for the separable model is 

(15) maximize 1.92^2 + * • • + 1 . 281/2 + • * • + 1.6667i«;2 + • • • + 

The optimal solution is t /4 = 1, 1/3 --- 1, m ;3 = . 8 , w /5 = .2 with an objective-func¬ 
tion value 7.15. This corresponds to Xj = 1, X 2 == .7, X 3 = .8(1.5) -f- .2(2) = 1.6 
with a value 7.167 in (11). The exact optimal solution is Xi = .875, X 2 = .625, 
X 3 = 1.875 with a value 7.25 in (11). 

Consider the “reciprocal” problem of maximizing 

(16) —Xi — 2 x 2 — Xs 

subject to 

(17) ~r(x) < -7.25 and all Xj = 0. 

The optimal point x is the same as that for ( 11 ) and (12). Explain why. Using the 
same grid and substitution of variables (13), the optimal solution to the separable 
approximation is t /4 = 1 , z;, = 1 , = .568, = .432. This corresponds to 
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= 1, X 2 = .7, A'a = 1.716 with a value of —4.116 in (16) and yields the value 
7.270 for c(x). 

Obtaining separability. To summarize, suppose the entire optimiza¬ 
tion problem can be stated as 

(18) 

subject to 

(19) a,j(xj) ^ for i = 1, 2,.,., m, 

where (19) may also include nonnegativity constraints on the variables. Then 
the techniques explained above permit you to convert the model into a problem 
in which ail the nonlinear functions have been approximated by piecewise-linear 
functions. If every Cj(xj) is concave, and if every aij(xj) is convex, then a solution 
obtained by the simplex method will be globally optimal for the approximate 
model and, hopefully, a good approximation to a globally optimal solution for the 
original problem. If these conditions are not satisfied, then by modifying the 
simplex method, you can obtain at least a local optimum for the approximate 
model. 

The importance of the approach hinges on two factors. One is the ability to 
formulate a nonlinear problem so that the objective function and constraints 
contain only separable functions; this aspect is discussed next. The other is the 
severity of the computational burden imposed by the technique; this aspect is 
discussed afterwards. 

Surprisingly many nonlinear expressions can be put into separable form by 
introducing additional variables and definitional constraints. To some extent, the 
techniques are ad hoc—they might even be called “tricks”—and, hence, do not 
lend themselves to systematic cataloguing. But a couple of illustrations will suggest 
the sort of approaches that work. 

Suppose the model contains in the objective function or the constraints (or both) 
the product of two expressions: 

(20) /(x„ ...,x„) >g(x„ x„). 

A simple example would be Xi^X 2 , As the first step in obtaining separability, 
substitute the single variable w wherever (20) appears. Second, introduce two 
more variables jf and z, and relate them by adding the pair of definitional con¬ 
straints 

(21) /(x„ ... ,x„) — (j> + z) = 0 and g(x,..., x„) — (j> — z) = 0. 

IfJ'ixj, • . • , x„) and ^(^i,. . ., ^n) are themselves separable, then the restrictions 
in (21) are separable in a/l the variables; if not, you will have to apply further 
transformations to convert /(atj, . .., x„) and ^(a:„ . .., x„) into separable func¬ 
tions. Third, to complete the procedure, impose the separable-type constraint 


maximize 2 
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relating w toy and z: 

(22) z*) = 0. 

This ensures that [w = f{xi, , , , , Xn) •g{xii . .., ^n)]« Explain why. Note that the 
function —in (22) is not convex. In short, the expression/-^ in (20) is eliminated 
everywhere by substituting w in its place, and the constraints (21) and (22) are 
added to the model. Verify that if (20) is simply Xj-Xj, then the transformation 
requires three additional constraints and three additional variables. 

As a second illustration, suppose the model contains the expression 

(23) [A(x„...,x„)r**.*•>, 

where A(xi, . . . , x„) > 0 for all possible Xj. Here the function is substituted for 

(23) , and the definitional constraint 

(24) w -/(xi,. . ., x„) - In A(xi,.. ., x„) = 0 

is added to the model. Although (24) is itself not separable, it can be treated by 
the approach used to separate the expression in (20) (where ^ = In A). 

Computational aspects. As you have seen, converting a nonlinear 
programming model into an approximate version with separable functions in- 
c.^'eases the size of the model in two ways: first, the separability transformations 
introduce new constraints and variables. Second, the subsequent linearization 
expands the number of constraints and variables even further. If the original 
problem contains only a few nonlinearities, this approach is quite workable. But 
if the model is highly nonlinear, then the technique may not be as efficient as the 
algorithms presented in the next chapter. 

If the objective and constraint functions imply the Property of Adjacent 
Weights, then you can employ the ordinary simplex algorithm, which is available 
for all electronic computers. The modified method that forces this Property is 
not so widely available, but, nevertheless, is obtainable for most large computers. 

In applying the method to an actual problem, you must, of course, consider 
the selection of the grid for each approximation. Broadly stated, the goal is to 
select the grids that will yield a nearly optimal solution for the original problem. 
Although this goal suggests a grid of finely spaced points, the countervailing 
factor is the resultant effect on the size of the approximate problem. Any further 
discussion of this aspect goes beyond the level of this text, but we point out that 
an alternative algorithm for exploiting a separable structure is explained in Sec. 
15,10. This technique does not require that you preselect the grids; the method does, 
however, involve more computations at each iteration. 

* 14.8 DIRECT LINEARIZATION 

Occasionally, a nonlinear objective function c(xi, X 2 ,..., x„) can be optimized 
by converting the problem into an equivalent one having a linear objective func¬ 
tion with linear constraints, and then solving it by the simplex method. Of course, 
the transformed problem remains a linear programming model if the variables are 
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further restricted by other linear constraints. Three important examples of direct 
linearization are given below. 

Minimum sum of absolute deviations. Consider the following 
nonlinear objective function to be minimized: 

(1) cw = i; 12 cuXi -/*|. 

For the conversion, impose the p linear constraints 

(2) for * = I. 2, 
or, on rewriting, 

(3) for ^ = 1. 2,. .. 
where 

(4) > 0 and Zjt > 0 for ^ = 1, 2, . , . , 

The appropriate linear objective function to be minimized in the transformed 
model is 

( 5 ) 

The idea of the transformation is that if the expression on the left of (2) is positive, 
then^jt equals this value and = 0, Similarly, if the expression is negative, then 
(—Zjt) equals this value and = 0. The optimization in (5) ensures that indeed 
either jjt or z*, or both, will be zero in an optimal solution. The linear program¬ 
ming model, then, consists of (5), the constraints (3) and (4), plus any other con¬ 
straints that must be satisfied by the X) (such as nonnegativity). 

Minimax objective. Next consider as the nonlinear objective function 
to be minimized: 

(6) c(x) = maximum ('£ -/i, • • •, 2 W —fA • 

\>=1 J-l / 

To make the transformation, impose the p linear constraints 

(7) for A: = 1, 2,. . ., 
or, on rewriting, 

(8) forA:= 1,2, ...,A 

Here y is a variable unconstrained in sign. The linear objective function for the 
transformed model is 

( 9 ) 


minimize 
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%■ 

Suppose in (6) is modified to 

(10) c{x) = maximum —/i|,..,, —/pl)' 

Note that 

n n 

—fi\ = maximum —fuft — ^^CtjXj). 

Hence, in addition to (8) and (9), the model for the objective (10) includes the 
constraints 

(11) —2 ^kjXj —y ^ ^fk for A: = 1,2,...,^, 

j=i 

and > 0. The objective function (10) is sometimes called a Chebyshev 
criterion (it minimizes the maximum absolute deviation). 

Both (6) and (10), as well as the objective function (1), which minimizes the 
sum of the absolute deviations, occur in curve-fitting and regression problems. 
The coefficients Cjtj are calculated for p sets of data on the n independent variables, 
where p > n, and the/t are the associated values of the dependent variable. The 
Xj are the regression coefficients to be chosen for the curve-fitting expression. 


Ratio of linear forms. Suppose the objective function to be maximized is 


( 12 ) 



(Sometimes this model is referred to as fractional programming, and 
occasionally as hyperbolic programming.) To avoid having to consider a 
host of possibilities in the exposition assume that the Xj are so constrained that the 
denominator in (12) is strictly positive for all feasible values of Xj, and that the 
maximum of c{x) is finite. 

To convert the problem to a linear model, define the variable r as 


(13) r={fo + ^fjXjr\ 
and hence (12) can be written as 

n 

(14) c^r'^^C)Xtr. 

By assumption, r > 0 for all feasible values oixj. Next make a change of variables: 

( 15 ) y) = TX). 

Then the transformed model becomes 


(16) maximize CqT + ^ Cjyjy 

where, according to (13), the r and^j must satisfy the linear constraint 
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(17) /,r + = 1, 

and r > 0. Note that the change of variables (15) must also be performed in any 
other constraints on atj. For example, if any additional conditions are linear 

(18) = for I = 1, 2 ,..., m, 

the transformed constraints are found by multiplying each relation in (18) by 
r, yielding 

(19) X] — V = 0 for £ = 1, 2, . . ., m. 

j-i 

Applications of (12) occur for certain kinds of production scheduling models 
in which the manufacturing processes generate scrap material. To illustrate, the 
denominator represents the total amount of raw material utilized by the produc¬ 
tion schedule, and the numerator the amount that is usable (nonscrap); thus, the 
ratio to be maximized by the schedule is the fraction of usable raw material, 

* 14.9 MAXIMIZING A CONVEX OBJECTIVE 
FUNCTION 

Consider the case when all the constraints are linear and the objective function 
to be maximized^ c{x)y is convex. Such an objective function is sometimes said to 
represent a situation with economies of scale. We continue to assume that c{x) 
is well defined and possesses a finite maximum c over the feasible values of x. We 
need not assume, however, that each partial derivative dcjdxj is defined and 
continuous for all values of x satisfying the constraints. The convexity assumption 
itself ensures that r(A-) is continuous in the interior of the constraint region. The 
function c{x) can have discontinuities at the boundary of the constraint set. For 
example, c(x) can include a setup cost component that is charged only if a variable 
Xj is strictly positive. We can prove the following proposition. 

Form of an Optimal Solution. Given that all the constraints arc linear and 
possess a basic feasible solution, if ^(a) is convex, well defined, and possesses a finite 
maximum over the feasible values of a, then there exists a basic solution that maximizes 
r(x). 

This result can be exploited to compute an optimal solution if the number of 
linear constraints is small, or if the model has a special structure. A few illustrations 
are given below. 

Suppose the model is a Distribution of Effort Problem, discussed in Sec. 10.2; 
specifically, let the objective be to 

maximize /?(^i,^ 2 ,..., Ji) 


( 1 ) 
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subject to a single linear constraint 

( 2 ) tH,yi = N, 

where each^j ^ 0 and is a continuous variable. Assume that R{y) is convex and 
each > 0. Then the proposition implies that there is an optimal policy con¬ 
taining only oncyj at a positive level. The optimalcan be found by comparing 
the values of R{0, 0,. . ., NjHj ,. .., 0) for j = 1, 2,.,. ., 5. If there are m 
linear constraints, then an optimal solution can be found by comparing the 
objective function for each possible basic solution, of which there are no more than 
(m)« When m is small, or n is small relative to m, this enumeration is feasible on 
a large-scale computer. For example, if m = 2, then only n(n — l)/2 possibilities 
have to be considered. 

A second illustration is the inventory model with deterministic demand and 
concave costs treated in Sec. 9.5. If you multiply the cost function by (—1), and 
thereby change the sense of the optimization, the resultant model meets the 
assumptions of maximizing a convex objective function subject to linear constraints. 

The special structure of the linear constraints in this deterministic inventory 
model can be exploited to find all the basic feasible solutions. To illustrate, sup¬ 
pose demand in every period is positive and initial inventory is zero. Then there 
are only 2^"* basic fesisible schedules for an A^-period horizon, which is smaller 
than ). Each of these schedules corresponds to a pattern of periods in which 
the production variable is strictly positive (production in Period 1 must be posi¬ 
tive), and satisfies the Form of an Optimal Policy given in Sec. 9.5. This complete 
enumeration is appropriate if total cost is a general concave function of all the 
production and inventory variables; the task can easily be done on a high-speed 
computer, provided that A^< 12. But if the objective function is also separable 
in the cost functions for each period, then the algorithm in Sec. 9.6 can be applied, 
and the computations reduce to considering only .5A^^ possibilities. 

► The Form of an Optimal Solution stated above holds for the case when c{x) is linear, 
and hence convex. Since a linear function is also concave, a basic solution that is locally 
optimal must be globally optimal. These comments provide an alternative justification 
for the conclusion that when the simplex method terminates, the final basic solution is 
globally optimal. ^ 


REVIEW EXERCISES 

1 Suggest several managerial decision-making applications of mathematical program¬ 
ming models in which you would expect to find nonlinearities in the objective func¬ 
tion or constraints. Be as specific as you can in describing the nature of the 
nonlinearities. State whether the relationships would be empirically observed or struc¬ 
turally derived (or a combination of both). Comment on whether you could use a 
linear approximation. Discuss what difficulties, if any, might arise in obtaining data 
to give a numerical representation of the nonlinearities. 


562 


Nonlinear Objective Function 


2 Discuss why it is usually harder to solve a nonlinear programming problem containing 
n variables and m constraints than it is to solve a linear programming problem of the 
same dimensions. 

3 Show how to modify (6) through (8) in Sec. 14.1 if there is a third constraint, i = 3, 
with a random and 

(a) The joint probability that every constraint being satisfied is at least 

(b) The joint probability that constraints i = 1 and i = 2 arc satisfied is that 
1 = 2 and i = 3 arc satisfied is y92> and that i = 3 and i = 1 arc satisfied is 

(c) Show how (8) can be transformed into an equivalent constraint containing the 
sum of two nonlinear functions (be sure to state any additional assumptions you 
require to write the transformation). 

4 Discuss the possible decision-making implications of the following situation: you have 
applied an iterative algorithm to solve a nonlinear problem such as that given by 
(4) and (5) in Sec. 14.1, and have obtained a feasible trial solution xf, A-f, . . ., a* 
that yields a value r(.xf, , . ., which is close to being maximal (you conclude this 
from an available upper bound on the maximum value). You have good reasons to 
know, however, that the values of several aJ are far from optimal. 

5 Assumptions (i) and (ii) in Sec. 14.3 imply that a function ^(a) of a single variable a 
attains its maximum for a value of a in the interval /. In each part below, determine 
whether c{x) attains its maximum in the stated interval, and if not, whether (i) or 
(ii), or both, are not satisfied. 

(a) r(A) = 7x for 3 ^ a < 8. 

(b) c{x) = 

(c) c{x) = 

*/j\ / \ fO if A is rational ^ ^ ^ , 

o,h=™« 

6 In each part below, state the values (if any) of the parameters a and b for which the 
function c{x) is convex for 0 < a ^ 1. Is concave. Is monotonic. Is unimodal. 

(a) c(a) = ax + b. (d) c(a) = a log bx, 

(b) c{x) = GA*. (e) Do the same for — 1 ^ a ^ 0. 

(c) c{x) = (f) Do the same for — 1 ^ a < 1. 

♦7 In each part below,, either prove the statement is true, or give a counterexample 

showing it is false. (You may assume the functions are of a single variable a.) 

(a) The sum of two convex functions is convex. 

(b) The sum of two concave functions is concave. 

(c) The sum of two monotonic functions is monotonic. 

(d) The sum of two unimodal functions is unimodal. 

(e) The sum of a convex and a concave function is cither conve-x or concave. 

(f) The sum of a unimodal function and a convex function is either unimodal or 
convex. 


ix for 0 < A < ^ 

\a — 1 for i A ^ 1. 

I for 1 ^ ^ 3. 
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(g) The difference between two convex functions can be convex, or concave, or 
neither, or both. 

(h) If c{x) possesses a well-defined second derivative, then the function is convex if 
a^cjdx^ > 0 for all x, 

(i) The function c{x) is concave if and only if -'c(x) is convex. 

8 In each part below, apply the Method of Golden Sections to the example in Fig. 14.4, 
where 

(a) 5^x< 10. 

(b) 2 < .V < 8. 

(c) c(x) = 18 — 2x for 6 < X ^ 12, and the initial search interval is [0, 10]. 

9 In each part below, apply the Method of Golden Sections to search for a maximum; 

stop after « = 3. 

(a) Case 2 in Fig. 14.1 over the interval 1 < x < 4. 

(b) Case 2 in Fig. 14.1 over the interval 0 :< x ^ 3. 

(c) Case 3 in Fig. 14.1 over the interval 2 < x ^ 4. 

(d) Case 3 in Fig, 14.1 over the interval 0 ^ x :< 2. 

*(e) Apply the upper bound (5) in See. 14.3 for parts (a) and (b). Demonstrate why 
(5) is inappropriate for parts (c) and (d). 

*(f) Apply the Method of Interval Bisection to parts (a) and (b). 

*10 After the description of the Method of Golden Sections in Sec. 14.3, the text states 
three properties of the algorithm, which are based on the definition of r in (4). Prove 
that these properties hold. 

*11 Derive the bound (5) in the special material at the end of Sec". 14.3. 

12 Consider the general algorithm for finding an unconstrained maximum, as described 
in Sec, 14.4. 

(a) Explain what difficulties can arise if you use a fixed step size length, instead of 
an optimal step size. 

(b) Use the example in Fig. 14.6 to illustrate the following approach for. selecting 
directions. At the first iteration, let the direction be parallel to the coordinate 
axis for x,, that is, let = 1 and all other = 0. At the next iteration, let the 
direction be parallel to the coordinate axis for X 2 , that is, let dl= I and all other 
dj = 0. Continue in this way, eventually returning to the = 1 again, and so 
on. (This is known as coordinaUwise ascent,) Employ an optimal step size throughout. 
In how many iterations does the method converge? Rotate the axes of the ellipses 
in Fig. 14.6 by 45®. Again illustrate the method and determine the convergence 
behavior. 

(c) Explain in a two-dimensional diagram, like Fig, 14.6, why an optimal step size 
is indicated by a tangency condition on a contour. Try to draw the corresponding 
three-dimensional representation, where the third axis refers to values of the 
objective function. 

13 (a) Verify that if t(xi, X 2 ,..., x„) is a quadratic function, then the expression that is 
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maximized to find an optimal step size is also a quadratic function of the step 
size. 

(b) Assume you are given a quadratic function c(x) of a single variable that is strictly 
concave for all x. Derive a formula for the unconstrained maximum of c{x) and 
justify your derivation. 

14 Consider the numerical example (4) and (5) in Sec. 14.5. 

(a) Verify the formula in (6). ' 

(b) Given (6), perform the intermediate algebra showing the optimal step size is that 
displayed in (7). 

(c) Verify the results for A' = 0, 1, and 2 in Fig. 14.8. 

15 Consider the numerical example (4) and (5) in Sec. 14.5. Apply the method of steepest 
ascent with optimal step size for each of the starting points below. Terminate after 
completing the calculations for k = 3. 

(a) xj = A-S = 0 and aJ = 3. 

(b) = 0, aS = 1, and aJ = 3. 

(c) aJ = aJ = 0, and aJ = —f. 

(d) a} = 2, aJ = 2 , and aJ = 6 . 

♦16 Consider the Newton-Raphson Method of selecting directions, namely, (8) in the 
advanced material of Sec. 14.5. Demonstrate that letting each aJ = 0 and solving 
the system (8) is equivalent to solving the system dcjdxj = 0 for every j when c (a) is 
quadratic. 

*17 Explain what happens when the step size is, say, 9 times the optimal step size and 
you employ steepest ascent to determine directions. Use contours like those in Fig. 14.5 
for illustration. Give a plausible explanation of why this so-called subrelaxalion approach 
might converge faster than the approach using the optimal step size. 

18 (a) Show that if c(a) is strictly concave for all a = (ai, A 2 ,. . ., a„), where c(a) is the 

maximum value, then there is no other x ^ x such that c(a*) = c(a) [that is, 
there is a unique point at which c{x) reaches its maximum]. 

(b) Show that if c(a) is a concave function, then r(Ai -f tdi,, , ., x^ td^) is a concave 
function of /. 

19 Consider a function ^(a) of a single variable and suppose it satisfies conditions (i), (ii), 
and (iii) in Sec. 14.3. 

(a) Is the condition dcjdx = 0 at a point a* necessary or sufficient, or both, for r(A*) 
to be the unconstrained maximum value? 

(b) Devise a condition for .v* that is both necessary and sufficient for r(A*) to be 
maximal. (Be careful in your answer, because dcjdx = 0 may hold over an interval 
of values for a.) 

20 In each part below, write the function ^(a) in a format analogous to (4) in Sec. 14.6. 
Calculate all the first-order partial derivatives. Show a definitional relation, analogous 
to (7), for a free variable « associated with aj. 
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(a) 3^1 — 2x0 + + 7;cJ -f 3 x 1^2 — 8 x 2^1 + 20^. 

(b) 5^1 -f 9;f3 4- 4xiX2 — 2 x 1 X 3 + + ^ 2^3 — 3^ + ^ 3^1 + 6 x 3 ^ 2 - 

(c) 3(-4x, + 2 x 2 + 3 x 3 - 5 ) 2 . . 

(d) ( 2 xi + 3 x 2 + ly 4- 2(4xi - X 3 4 - 2)2 4 - 3(-5x2 4 - 5 x 3 ) 2 . 

* (e) In each of the above expressions, eliminate the variable X 2 by substituting in the 
relation for the free variable u, 

*(f) In each of the above expressions, eliminate the variable X 2 by substituting in the 
relation 1 — x, — X 3 . 

21 Consider the function c(x) in (3) of Sec. 14.6, and the partial derivatives in (5). 
Assume the trial values for each Xj are 0. 

(a) Does the objective function improve if you enter Xi? If you enter X 2 ? If you 
enter X 3 ? Describe the behavior of the objective function as you let x^ increase, 
and leave the other xj equal 0. Do the same for Xo. 

(b) In applying the Qiiadratic Simplex Criterion //, it is possible that the selected nonbasic 
variable Xj can be made arbitrarily large without violating the constraints. It is 
also possible that dcjdx} > 0 for all nonnegative values of Xj (assuming the other 
nonbasic variables remain equal to 0). What would happen if these two condi¬ 
tions arose simultaneously? What assumption made at the beginning of Sec. 14.6 
rules out these two conditions arising simultaneously? 

22 Consider the example (8) and (9) in Sec. 14.6 solved by the Quadratic Simplex 
Algorithm. 

(a) Show all the intermediate calculations and verify the results in (10) through (30). 

(b) When you reach (20), enter « 2 > instead of ?'|, into the solutl>:.i and continue the 
iterations until you obtain an optimal solution. 

23 Consider the example (8) and (9) in Sec. 14.6. Apply the Quadratic Simplex Algo¬ 
rithm, where the problem is modified as indicated below. 

(a) Let the right-hand-side constant in (9) be 8, instead of 4. 

(b) Let the right-hand-side constant in (9) be 3, instead of 4. 

(c) Let the right-hand-side constant in (9) be 2, instead of 4. 

(d) Add the constraint Xj 4- ^2 :< 1. 

(e) Add the constraint X 2 4- J'fs ^ 2. 

(f) Add the constraint — 2xi 4 - 2 x 3 ^ 1. 

(g) Let the objective function be 6 xi 4- 4 x 2 + 2 x 3 — — 3x|. 

(h) Let the objective function be 2xi -f 6 x 2 4- 4 x 3 — 3xf — 2 a| —- ^xf. 

(i) Add to the objective function the terms —X 1 X 2 — 2 x 2 X 3 . 

24 Apply the Quadratic Simplex Algorithm in Sec. 14.6 to the example in Fig. 14.9, 
where the linear constraints arc as indicated in the three cases below. Plot the trial 
solutions on a diagram like Fig. 14.9 to show the progress at each iteration. 

(a) Case 1: Xj -f- 2x2 ^ ^ 

8 x 1 4- 6 x 2 23 

6 x 1 -f- x, ^ 12 . 

(b) Case' 2: Xj 4- 2x2 ^ 5 

16xi 4- 12x2 37 

6xi 4- X 2 ^ 12. 
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(c) Case 3: 2^:2 ^ 5 

16a:, -f 12.V2 37 

24.vi 4- 4a*2 ^ 27. 

25 Consider the example of a separable function c,(a*,) = aJ treated in Sec, 14.7, Modify 
the analysis in (5) through ( 8 ) when 0 ^ a*, ^ 20, and the grid points arc 0, 5, 8 , 10, 
15, 20. What weight values in the approximation correspond to a*, = 6 and how 
close is the approximation value? Answer these questions for .v, = 9. For .v, = 12. 
For,v, = 17. 

26 Suppose ^i(ai) = \/^ Explain how to construct a piccewisc-lincar approximation 
for 0 ^ A, ^ 10, using the grid points 0, 1, 4, 9, 10, Show how the linear constraint 
3.V, 4" 4 .r 2 ^ 12 appears after substituting the approximation. What weight values 
in your approximation correspond to a, = 6 ? 

27 (a) Give a plausible argument establishing the Property of Adjacent Weights in 

See. 14.7. {Suggestion: use the example (5) through ( 8 ) to illustrate your reasoning; 
remember that a^ is a convex function.) 

(b) Suppose you want to maximize c(a) in (1), where r,(A,) = aJ and Cj { xj ) = CjXj 
foe all other j, and Vvhcrc you use the approximation in (5) and ( 6 ). Assume 
.v, = 8 in a truly optimal solution. What difficulty can arise if you do not use 
a restrictcd-basis-cntry rule? 

(c) Give a plausible argument establishing that if all the constraints arc separable, 
as in (9), and each component is a convex function, then using a transformation 
such as (10) leads to an optimal solution that satisfies the Property of Adjacent 
Weights. 

*28 Consider the alternative technique for representing a separable function, as described 
and illustrated in (i) through (vii) of the special material in See. 14.7. 

(a) Modify the analysis in (vi) and (vii) when 0 < a, < 20, and the grid points are 
0, 5, 8 , 10, 20. What values for thc^^ correspond to a, = 6 and how close is the 
approximation value? Answer these questions for a, = 9. For a, = 12. For 
A, = 17. 

(b) Suppose ri(A,) = Explain how to apply the method for 0 ^ a, ^ 10, 

using the grid points 0, 1, 9, 10. Show how the linear constraint 3 ai 4- 4 a 2 > 12 

appears after substituting the approximation. What values for the correspond 
to A, = 6 ? 

*29 Consider the example (11) and (12) in Sec. 14.7. 

(a) Write the objective function (15) in full detail. Verify that the stated solutionis 
indeed optimal. 

(b) Indicate how to revise (13) through (15) if the grid for both a, and A 2 is (0, .3, . 6 , 
. 8 , 1), and for A 3 is (0, 1.2, 1.4, 1 . 6 , 1 . 8 , 3). 

(c) Obtain an optimal solution for the approximate problem in part (b). {Hint: 
use the information you have above for the truly optimal solution.) Indicate 
what this answer corresponds to in the original problem. 
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(d) Consider the “reciprocal” problem (16) and (17). Find an optimal solution to 
a separable approximation, using the grids in part (b). Indicate what this answer 
corresponds to in the original problem. 

30 Convert each problem below into a separable form. 

(a) maximize X 1 X 2 + XoX^ + subject to 

xj -i- Xj — 2X1X2X3 < 0 

24 + 3 x 2 + 6x2X3 ^ 12 
^3 + ^^1^3 ^ 5 
every X) > 0 . 

(b) maximize (X| + 2 x 0 + 8 x 3 ) (xi + X 3 ) subject to 

(x, - xo + 4 x 3)2 + 3(2 + x,X 2 )^*«*>-^^«> ^ 52 

XiXo > 1 

every xj > 0 . 

(c) maximize 5(2 + XjXg X 3 ) (x 3 /x 4 )^*’”**^ subject to 

X1X2X3X4 < 16 

xi ^ 0 xo > 0 X3 ^ 2 X4 > 2 

(d) maximize 4 x,X 2 X 3 subject to 



X, > 0 Xo > 0 X3 > 1. 


*(c) Suppose after obtaining separability in each part, you then approximate every 
nonlinear separable function by a piecewise-linear function using 10 grid points 
(for each function). Calculate the size of the corresponding linear programming 
problem. Show your analysis in detail. 

*31 Consider the three objective functions ( 1 ), ( 6 ), and (10) in Sec. 14.8. Suppose the 
context of the problem is fitting a linear regression, where refers to the A'th value of 
the independent variable, Cign the “coefficient” of the intercept, andy* observed 
value associated with Thus, Xj is the slope of the fitted regression line and x^ is 
the intercept. Specifically, assume c^i = A* and 0^2 = 1 for A = 1, 2,.. . , 10, and 


f, = 87 /2 = 100 /3 = 107 /4 = 98 /s = 99 
/o = 108 f, = 104 fs = 113 /9 = 116 /,o = 107. 


(a) Assuming there are no restrictions on X| and X 2 , write in full detail an equivalent 
linear programming model for minimizing c{x) given by (1). By ( 6 ). By (10). 

(b) Indicate how your formulations in part (a) change if you also impose the con¬ 
straints i Xj < 2 and Xj > 2 . 

*(c) Write the dual linear programming problems associated with your answers in 
part (a). 

(d) Show how to formulate an optimization model for minimizing the sums of the 
squares of the differences between the linear fit and/*. First assume Xj and Xj are 
unconstrained. Then impose the restrictions in part (b). 
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♦32 Consider the problem 

minimize [maximum {x — 2, — ix — 

(a) Plot each of the linear functions, show the piccewisc-lincar curve of the maximum 
of the three functions, and indicate where the minimum occurs. 

(b) Formulate the problem using approach (8) and (9) in Sec. 14.8. Then draw the 
solution space representation for x and y, and show the optimal solution. 

(c) Answer parts (a) and (b), also using (11), for the objective function 

minimize [maximum {\x — 2|, |—a-|, \ — ix — ^j)]. 

If your answers differ, explain why. 


♦33 In each part below, assume that 0 ^ x < 1 and the function c(a) to be maximized is 
as specified. Plot the function r (x) for x in the feasible region. Formulate an equivalent 
linear problem using the approach in (13) through (19) of Sec. 14.8. Draw the solution 
space representation for this equivalent problem and indicate the optimal solution. 


(a) 

(b) 


2 4- 3x 

4-x ’ 

2 - 3x 
4-x' 


(c) 

(d) 


2 -2Ar 

4 4- A ’ 
2 4- 2x 
4 4- A 


♦34 Convert the following problem into an equivalent linear model: 

—3 4" 2x| 4" 4^2 — 5 a*3 


maximize - 


subject to 


6 4" 3xi — X2 

Xi — X2^0 


7xi 4- 9^2 4" 10 x 3 < 30 

Ai ^ 0 X2 ^ 1 X3 > 0. 


♦35 Devise an argument establishing the Form of an Optimal Solution in Sec. 14,9. 
[Suggestion: illustrate your reasoning by using Fig. 14.9 where you maximize —r(x).] 

♦36 Find an optimal solution to 

maximize A- 2 yz A- 

subject to 

3 yi + 4^2 + 2^3 = 12 
every ^ 0. 

Justify your answer. 


♦37 Section 14.9 discusses the computational implications of knowing that an optimum 
occurs at a basic or extreme-point solution. Parts (a) and (b) refer to a numerical 
evaluation mentioned in the text. 

(a) Calculate the maximum number of basic feasible solutions to a problem having m 
linear constraints and n unknowns, for m = 2, 3, 4, and n = 6, 7, 8 (altogether 
nine pairs of m and n). 

(b) Calculate 2'^"* and (* 1 ?) for JV = 1,2, 3,4,5,6. 
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38 Explain your understanding of the following terms: 


^ nonlinear programming 
end-point (or extreme-point) solution 
interior point 
boundary point 
unconstrained maximization 
(minimization) 

*supremum 
discontinuity 
continuous function 
bounded function 
convex function 
concave function 
strictly convex (concave) function 
*quasi-convex (-concave) function 
unimodal function 
adaptive search 
local maximum (minimum) 
global maximum (minimum) 
inflection point 
Method of Golden Sections 
"•“Method of Interval Bisection 
necessary and/or sufficient condition 
direction 

step size (optimal step size) 
contours 

method of steepest ascent 


myopic rule 
gradient 

♦directional-derivative 
infinite convergence 
convergent sequence (subsequence) 
orthogonal 
♦zigzagging 

♦Newton-Raphson Method 
♦modified gradient 
♦subrelaxation 
stationary point 

quadratic programming problem 
Quadratic Simplex Algorithm 
free variable 

reduced partial derivative 
quadratic form 

negative (positive) semidefinite 
separable function 
piecewise-linear (or polygonal) 
approximation 

Property of Adjacent Weights 
restricted-basis-entry rule 
♦direct linearization 
♦Ghebyshev criterion ^ 

♦fractional (or hyperbolic) programming. 


COMPUTATIONAL AND FORMULATION EXERCISES 

39 Consider the Quadratic Simplex Method described in Sec. 14.6. 

(a) Suggest a way to apply the algorithm where not every ^( > 0 and not every Si is 
present in the constraints (2). 

(b) Discuss how the algorithm operates if there are no linear and nonnegativity con¬ 
straints. Illustrate your discussion with the objective function (8). 

(c) Explain the way the algorithm proceeds in the terminology of Sec. 14.4, namely, 
in terms of selecting a set of directions and choosing a step size. Be sure you show, 
explicitly, what directions arc taken at each iteration. Also comment on whether 
the step size is optimal. [Suggestion: illustrate your answer by using the example 
(8) and (9) in Sec. 14.6.] 

40 Consider the objective function: maximize 6xi — 2xJ -H 2xiXi — 2sq, 

(a) Solve by the method of steepest ascent, starting at x? = xj = 0. Illustrate the 
progress of the iterations on a two-dimensional diagram. Stop after A: -= 5 itera¬ 
tions. 
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(b) Rework part (a), starting at x? = a:? = 10. 

(c) Impose the constraints Xi + ^2 ^ 2, X| ^ 0, and X 2 ^ 0. Solve by the Quadratic 

Simplex Method. Illustrate the progress of the iterations on a solution space 
diagram. 

(d) Rework part (c), letting x, -}- X 2 3. 

(e) Rework part (c), letting X| -f X 2 < 7 . 

(f) Rework part (c), letting the linear constraint be 2x, + Xj < 2. 

(g) Rework part (c), letting the linear constraint be Xi -f- 2 x 2 ^ 2. 

♦(h) Solve the unconstrained problem using the Quadratic Simple.x Method. 

(i) Rework part (c), adding the constraint Xj < 1. 

(j) Rework part (c), adding the constraint x, ^ li^. 

(k) RewoTk part (c), adding the constraint X 2 < i, 

(l) Rework part (c), adding the constraint X 2 < i^. 

41 Consider the objective function: maximize 36x, -|- 36 x 3 — 5xJ — 8 x,X 2 — 5xf. 

(a) Solve by the method of steepest ascent, starting at x^ = xS = 0. Illustrate the 
progress of the iterations on a two-dimensional diagram. Stop after A: = 5 itera¬ 
tions. 

(b) Rework part (a), starting at xj = xS = 3. 

(c) Impose the constraints Xi -f X 2 < 2, Xj > 0 and Xo > 0. Solve by the Quadratic 

Simplex Method. Illustrate the progress of the iterations on a solution space 
diagram. 

(d) Rework part (c), but let the linear constraint be X| -f 3 x 2 ^ 2. 

(e) Rework part (c), adding the constraint Xj < 

(f) Impose the constraints Xj — X 2 < 3, —x, + .vo < 3, x, > 0, and Xg ^ 0. Solve by 
the Quadratic Simplex Method. Illustrate the progress of the iterations on a 
solution space diagram. 

42 Consider the objective function: 

maximize 16 xi -f IGxj 4- 40 x 3 “ H' 6x1X2 — 6x1X3 — 1 Ixg 4- 8x3X3 — 1 Ixj. 

(a) Solve by the method of steepest ascent, starting at xj = xj = xj = 0. Stop after 
k = 5. 

(b) Rework part (a), starting at xj = atJ = aJ = 3. 

(c) Impose the constraints Xi 4“ X 2 -f- X 3 < 4, and every Xj > 0. Solve by the 

Quadratic Simplex Method. 

♦(d) Solve the unconstrained problem using the Quadratic Simplex Method. 

43 Consider the objective function: 

maximize 40x| 10 x 2 4- 30x3 — OxJ — 8 x 1 X 3 — 4 x 1 X 3 — 1 lx$ 4" 14 x 3 X 3 — 9x^. 

(a) Solve by the method of steepest ascent, starting at x^ = xj = xj = 0. Stop after 
it = 5. 

(b) Rework part (a), starting at xj = xj = xj = 5. 

(c) Rework part (a), starting at xj ^ 2, x? = 3, xj = 1. 

(d) Impose the constraints Xi + x^^ 3 and every xj ^ 0. Solve by the 

Quadratic Simplex Method. 

♦(e) Solve the unconstrained problem using the Quadratic Simplex Method. 
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44 Consider a mathematical optimization model consisting of n nonnegative variables, 
a linear objective function, m — 1 linear constraints, and one quadratic constraint, 

2 + 22 ^ 

j~\ it = l 

where the left-hand function is concave. Suggest a computational scheme for finding 
a numerical solution to a particular problem that employs linear and quadratic 
programming algorithms. 

45 Consider the problem 

maximize 2 ^ + Ar; 

subject to 

4 - ^2 ^ 2 X, > 0 and Afo ^ 0. 

(a) Solve by inspection, 

(b) Apply the Quadratic Simplex Algorithm. First, let the initial basic solution be 
“all slack,” Next, let the initial basic solution be 1. Discuss the behavior of 
the algorithm on this problem. 

(c) Transform the model into an approximate problem using the separable program¬ 

ming technique in See. 14.7. Let the grid for Xi be (0, j, 1), and for be 
(0, 1,1, 2). Apply the simplex method to find a solution. Discuss your result, 

(Do you have to use the rcslrictcd-basis-cntry rule?) 

46 Consider the problem 

maximize 6 .Vi -f 8 x 4 — ^ — ^2 

subject to 

4x5 4- xl ^ 16 

3x| 4“ 5x4 15 

X, ^ 0 and Xj > 0. 

(a) Formulate the model into an approximate problem using the separable program¬ 
ming technique in Sec. 14.7. Let the grid for x, be (0, 1, |, 2), and for Xj be 

(0,1, 2, 3, 4). 

(b) Find an optimal solution to the approximation in part (a). Draw a solution 
space diagram and indicate the progress of the iterations as well as the truly 
optimal solution. 

*(c) Change the signs of all the coefficients in the objective function and reverse the 
inequality sign in the first constraint. Alter your formulation in part (a) accord¬ 
ingly. Apply the simplex method to find a solution. (Do you have to use the 
restricted-basis-entry rule?) Draw a solution space diagram and indicate the 
progress of the iterations as well as the truly optimal solution. 

47 One-Potato, Two-Potato Problem (Sec. 1.6). Recall in this example that there are two 
sources of supply, each source yielding different fractions of the products French 
fries, hash browns, and flakes. Suppose that it is possible, at differing costs, to alter 
these yields somewhat. Let /i,/ 2 , and/, be the fractional yield per unit of weight of 
Source 1 potatoes made into the three products; similarly, let g|, gj, and gj be the 
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yields for Source 2. Suppose that eachTi and gi can vary within plus or minus 10% 
of the yields shown in Fig. 1.1. Letr|(/i,/ 2 »/ 3 ) andr 2 (^i, ^ 3 ) be ihe expense ^ssoc\^.icd 
with obtaining the corresponding yields. 

(a) Formulate the appropriate optimization model. 

(b) Apply the techniques in Sec. 14.7 to convert the problem into separable program¬ 
ming form. Discuss whether you would apply a rcstricted-basis-entry rule to 
obtain a numerical solution. 

48 The Tooke Gas Company desires a scientific approach to setting equitable salaries. 
The company plans to base its pay structure on the following point system. The 
Personnel Department rates each type of job according to three different character¬ 
istics (such as skill required, responsibility exercised, seniority demanded). Suppose 
that the first factor is evaluated for five levels of achievement, designated by A\y 
A2y A3, or A5, where Al is the lowest and A5 the highest level of attainment. 
Similarly, the second factor is evaluated B\, B2, B3, or 54, where 51 is the lowest 
and 54 the highest level. And the third factor is scored as Cl, C2, C3, C4, or C5, where 
Cl is the lowest and C5 the highest level. The company wants to assign a dollar value to 
each level within each characteristic, and thereby to determine the average salary for 
a type of job according to the sum of the assigned dollar values. For example, suppose 
a job type is rated A2, 53, and C5. If A2 has been assigned a value of S300, 53 a 
value of S400, and C5 a value of S250, then the resultant average wage for the job is 
(S300 4- S400 + S250 = S950). 

Assume the company has evaluated 5 job types, and thinks that a 10% increase in 
the present average salary for each job is fair. Let bp, and Cp be the ratings of the 
three categories for the pth job, and let Sp be 110% of the present average salary. The 
company wants to assign values for .<41, A2, . . ., .<45, 51, ..., 54, Cl, . . ., C5 so as to 
minimize the sum of squares between the resulting new average wage and Sp, for 
p = 1, 2,.. . , 5. Show how to formulate this problem as a nonlinear programming 
model. Assume each assigned value must be at least S20 greater than the next lower 
level (that is, the value of ^42 must be at least S20 greater than the assigned value of 
i41), and that the assigned values for i41,51, and Cl must each be at least S25. 
Illustrate the constraints for the jobs using the data that Job 1 has ratings i43, 51, and 
C4, and Si = 810,000; and Job 2 has ratings i42, 53, and C5, and J 2 = 812,000. 

49 Consider the regression model y =f[x) error, where y is the dependent variable 
and X is the independent variable. Assume you have available p paired observations 
of the independent and dependent variables, denoted by x^ and for A: = 1, 2,..., ^ 
and wish to estimate the function f{x) from these data. Suppose you assume no specific 
form ioTf{x), such as a linear or quadratic function. For simplicity, assume that the 
values of xi^ are distinct, that Is, Xj ^ Xj for i 9 ^ j, and the xjc are ordered such that 
Xi < X2 < ’ • • < Xp, Let represent the value of f [xk) in the estimated regression 
function. (Note that/^ may have any sign.) 

(a) Write a set of equations, one for each k, determining the error e^ between the 
estimated value yi and the observed value of the dependent variable (Note 
that Cfc may have any sign.) 

(b) If there are no further constraints on the/^, state what their optimal values would 
be so as to minimize the sum of for A: = 1, 2 ,... ,/^. 
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Explain how lo constrain such that the estimated function for/(x) is monotonic 
increasing. Is monotonic decreasing. Is increasing only for x ^ 

Explain how to constrain such that the estimated function for/(x) is convex. 
Is concave. 

Explain how to constrain such that estimated values of/(xt) and/(xt+i) differ 
by no more than d > 0, Differ by at least d. 

Indicate how the above formulations can be altered if there are several x* that 
arc equal. 

Discuss the type and size of model you obtain if the objective is to minimize the 
sum of absolute deviations |ejfe|. If the objective is to minimize the maximum 
absolute deviation. 

Consider the problem of “minimizing the sum of absolute deviations” given by 
(1) in Sec. 14.8. (Assume that each Xj is unrestricted in sign.) Write the dual 
linear programming problem. Show how the dual problem can be transformed 
to a problem having n linear constraints and p nonnegative upper-bounded 
variables. 

Consider the problem of “minimizing the maximum absolute deviation” given 
by (10) in Sec. 14.8. (Assume that each Xj is unrestricted in sign.) Write the dual 
linear programming problem. 

51 Trim Problem, Consider the model described in exercise 25 of Chap. 2, For simplicity, 
alter the data as follows. Assume the jumbo reel is 44 inches wide, the customer 
orders arc for four reels of 13-inch width, and four reels of 14-inch width. Suppose the 
company considers using only two cutting combinations, namely, (two reels each 13 
inches wide, one reel of 14-inch width) and (one reel of 13-inch width, 2 reels each 14 
inches wide). 

(a) Formulate the associated linear programming model where the objective is to 
minimize total trim. Draw a solution space diagram and indicate the optimal 
solution. 

(b) Suppose, instead, that the objective is to minimize the fraction of trim waste, that 
is, the ratio of total trim to total liner board used, inclusive of trim waste. Assume 
that an optimal solution is at an extreme point of the feasible region and find 
the best schedule. Comment on the relative merit of the solutions in parts (a) 
and (b). 

(c) Suppose, instead, that the objective is to minimize the ratio of total trim to total 
liner board shipped to the customers (that is, total liner board used exclusive of 
trim waste). Assume that an optimal solution is at an extreme point of the feasible 
region and find the best schedule. Comment on the relative merit of the solutions 
in parts (a) and (c). 

*(d) Utilize the formulation (12) through (19) in Sec. 14.8 in parts (b) and (c) to 
establish equivalent linear programs. Verify that the solutions you found are 
optimal in these equivalent linear programs. 


♦50 (a) 


(b) 


(c) 

(d) 

(e) 
(0 

♦(g) 
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MIND-EXPANDING EXERCISES 

52 (a) Consider the function which is defined as 

n 

^(« 1 . Cj. • • •. Cn) = maximize 2 c^Xj 

subject to 

n 

2 ^ for £ = 1,2,.. . ,m 

every xj ^ 0. 

Thus g(c) represents the optimal value for a linear programming problem having 
objective function coefficients c = (ci, C 2 , • .., Cn). Determine if g(c) is a convex 
or a concave function. 

(b) Consider the function h{b) which is defined as 

n 

h{bi, ^ 2 > • • • j ^m) “ maximize ^ 

where the Xj are subject to the constraints in part (a). Thus h{b) represents the 
optimal value for a linear programming problem having right-hand-side con¬ 
stants b = {biy bo, •••, ^m)* Determine if h{b) is a convex or a concave function. 
Comment on whether your result means that the optimal value for the tth dual 
variable overstates or understates the benefit of having an additional unit of the 
scarce resource for the ?th constraint. 

53 Prove the Sufficient Condition for a Maximum of a Concave Function that is stated 
in Sec. 14.5. 

54 Suppose r(xi, X 2 ,. . ., a:„) is a concave function. Consider two points w = {wi, W 2 , ..., 

Wn) and y = • • • j.J'n)* Show that the function C(t) = c{tw -{- (1 — t)y) is 

concave in /, where 0 < / ^ I. What is the corresponding result if X 2 , . . ., x„) is 
convex ? 

55 Section 14.4 states a Necessary Condition for a Maximum ofc(xi, X 2 , ,.., in terms 
of the first-order partial derivatives. Assume that ^(a:) possesses continuous second- 
order partial derivatives. Show that, in addition, if r(x) is maximal, then the quadratic 
form 



must be negative semidefinite. [In linear algebra books, this condition is sometimes 
stated as “the Hessian of the function c{x) evaluated at x must be negative semi¬ 
definite.”] 

56 Consider the separable programming model in Sec. 14.7. Suppose the objective func¬ 
tion, which is to be minimized, contains ri(A:,) = xJ, where 0 ^ Xj < 10. 
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(a) Explain how the value of Ci(^i) can be approximated by a value where is 
constrained by 

y ^ maximum • • •^PkXk + ^Jt). 

Using the illustration in Fig. 14.10, let = 1 and qx = 0, corresponding to the 
first linear segment. WHhat are the other values for pi and qi in Fig. 14.10 ? Comment 
on the merits of this approximation as compared to the approximation in (5) 
through (18). 

(b) Discuss what shape characteristics a function c{x) of a single variable x must have 
in order for the approximation approach in part (a) to be valid. 

(c) Assume that the sense of optimization is maximization. Will the approach in 
part (a) work? Explain. Give a formulation analogous to that in part (a) that 
will work if c{x) is a concave function. 

* (d) Suggest how the approach in part (a) can be generalized to handle convex 
functions r(A:i, Xg,. . ., x„) of several variables. 

57 Consider the function ^(a:) of a single variable x, where 0 ^ x ^ f/. Suppose you 
want to approximate r(x) by a continuous piecewise-lincar function having n segments. 
Assume that the vertices of the approximation lie on c(x) [that is, the approximation 
has the same value as r(x) at each vertex], and that the vertices can occur only at 
values X = 0, 1, 2, . . ., U, where U is integer-valued and much larger than n, 

(a) Devise a dynamic programming formulation to find an approximation that 
minimizes the maximum absolute deviation between the approximation andc(x) 
for X = 0, 1, 2, . . ., £/. 

(b) Show how to alter your formulation in part (a) to minimize the sum of the 
absolute deviations between the approximation and c(x) at x = 0, 1, 2,. . ., U. 

*58 Consider the fractional programming model and its equivalent linear programming 
formulation (12) through (19) in Sec. 14.9. 

(a) Suppose you have found an optimal solution to the equivalent linear program 
using the simplex method. Assume xj^ is a nonbasic variable in the optimal solution. 
Explain how you can find an interval of values for c* over which the solution 
remains optimal. Do the same forDo the same for an \Suggestion: let Wi 
be the dual variable corresponding to equation (19) and be the dual variable 
corresponding to equation (17).] 

(b) Assume that > 0. Show how you can use (17) to eliminate the variable r so 
that the equivalent linear program contains only m constraints. Show how to 
modify your answer if^i = 0. 

(c) Devise an algorithmic technique that maintains the m constraints in their original 
form, uses Criterion II of the simplex method to select the basic variable to remove, 
employs the usual pivoting procedure, but that calculates dc{x)ldxj, evaluated at 
the current basic solution, for each nonbasic variable and enters that variable 
having the most positive value for its partial derivative. Explain bow this approach 
differs, if at all, from applying the simplex method to the equivalent linear 
program (16), (17), and (19). 
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CHAPTER 


AdvancedTechniques in 
Nonlinear Programming ^ 


15.1 LARGE-STEP APPROACHES 


In the previous chapter, you saw how to solve several important cases of non¬ 
linear programming problems. Each technique exploited some special form of 
the objective function—quadratic or separable—as well as a special form of the 
constraints—linear or separable. Now you will study the solution of more general 
nonlinear programming problems. 

The techniques presented below do not permit the objective function c{x) 
to be completely arbitrary. It must possess at least the smoothness properties 
assumed in the case of unconstrained maximization treated in Sec. 14.4. In the 
initial four sections of this chapter, we assume that all the constraints are linear. 
Then, starting in Sec. 15.5, we treat models containing nonlinear constraints. 
These restrictions must also satisfy certain smoothness and shape properties. 

To keep matters clear, we repeat the optimization model to be solved along with 
the postulates on c(:v). In Secs. 15.2 through 15.4, we consider the problem 


(1) maximize a: 2 , . . ., Xn) 

subject to 

71 

2 + Si = bi for f = 1, 2,..., m, 

all > 0 and all St > 0. 

To avoid an elaborate discussion that would be necessary to handle “all cases,” 


t This chapter requires a knowledge of differential calculus. 
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assume that the constraints (2) satisfy the stipulations: 

(i) All bi > 0, and hence there exists a feasible basic solution to the con¬ 
straints, namely, all = 0 and each Si — bt. 

(ii) The constraints (2) arc sufficiently restrictive so that the maximum of 

+ C 2 X 2 + • • • + CnXn is finite over all feasible a:, for arbitrarily chosen 
constants 

Assumption (i) ensures that the model has a feasible solution, and trivializes 
the task of finding such a solution. If assumption (i) is dropped, then the algorithms 
must be expanded to show how to find a feasible solution when one exists. 

Assumption (ii) is sometimes termed a regularization postulate. 

It serves two purposes in explaining the algorithms below. First, it eliminates 
having to deal with unbounded solutions arising during the intermediate steps of 
the methods, when c(a:) is approximated by a linear function. Second, when cou¬ 
pled with the smoothness assumptions on r(.v), it rules out an unbounded optimal 
solution to the original problem (1) and (2). [One way to ensure assumption (ii) 
is to include in (2) a constraint that the sum of all the variables cannot exceed a 
large, but finite, upper bound.] If assumption (ii) is eliminated, then the steps 
of the algorithms have to be amended in order to handle the possibility of an 
unbounded solution at an intermediate iteration. 

Assume that ^(a:) also satisfies the following smoothness and shape conditions: 

(iii) r(A:) is single-valued and finite for each x satisfying the constraints (2). 

(iv) Every partial derivative dc{x)ldxj is single-valued, ‘ finite, and continuous, 
and hence c(a:) is continuous at each x satisfying the constraints (2). 

(v) r(Af) possesses a finite maximum c over all values of x satisfying the con¬ 
straints (2). 

If you compare (iii) through (v) with assumptions (i) through (iii) made in Sec. 
14.4, you will find (iii) through (v) apply to only those x that are feasible in the 
constraints. The condition that c{x) reaches its maximum for finite values of Xj is 
implied by assumptions (i) through (v). Assumptions (iii) through fv) can be 
weakened, at the expense of increasing the number of cases that have to be 
treated in the steps of the algorithms. 

Preliminary observations. Note that formulation (1) and (2) includes, 
as a special case, the quadratic programming problem considered in Sec. 14.6. 
Therefore, the same kinds of optimal solutions that arise there can also occur 
here. Specifically, an optimal solution may be in the interior of the constraint 
region; or on the boundary but not at an extreme point; or at an extreme point. 
Since an optimal solution need not be at a vertex of the constraint set, more than 
m of the Xj variables may be strictly positive in an optimal solution. Further, all 
the slack variables Si may be positive, which occurs when an optimal solution is i 
in the interior of the constraint region. Be sure to see how each algorithm below 
allows for the possibility that all n of the Xj and all m of the Si may be strictly posi¬ 
tive in an optimal solution. 
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As in the case of unconstrained maximization, the algorithms may not always 
converge to a globally optimal solution unless c(x) is specially shaped—for exam¬ 
ple, concave. And even when c{x) approaches a limiting value, convergence 
usually does not occur in a finite number of iterations. 

Many of the algorithms in the chapter can be conveniently explained by means 
of a format similar to that employed in Secs. 14.4 and 14.5 for the “steepest ascent 
with optimal step size” algorithm; you should review that material before read¬ 
ing further. The large-step procedure in general is 

Step L Select an arbitrary initial feasible point x®. 

Step 2, (a) At iteration k, determine the directions d) for j = 1, 2,..., n. 
(b) Find a step size t^ such that r(x) is maximized over all feasible points lying 
in the selected direction from Terminate the calculations if /* = 0. Otherwise, 
continue to Step 3, 

Step 3, Compute the new trial point 

(3) + t^df fori = 1, 2,..., n. 

Return to Step 2, where replaces x*. 


The term large-step approacli refers to the fact that the optimal step 
size is used in Step 2. If you look back to the “steepest ascent with optimal step 
size” algorithm in Secs. 14.4 and 14.5 (pp. 530 and 534), you will notice three 
important differences in the wording of Step 2. First, the condition that every 
dcjdxj = 0 is no longer used as a criterion for termination. The reason is that the 
partial derivatives do not all have to equal zero at a stationary point of a constrained 
problem. Second, the optimal step size cannot be determined with reference to 
c(x) alone; t^ must be chosen so that the point x*'*’* satisfies the constraints. Given 
the regularization assumption (ii), the feasibility stipulation means that 
is always finite. The resultant value of might be termed the optimal fea¬ 
sible step size. Third, each direction in Step 2 is no longer specified as 
dcJdxj, evaluated at the current trial point. The reason that values of the partial 
derivatives are not adequate for the directions is easily illustrated by an example. 

Consider the quadratic problem previously analyzed in Sec. 14.6: 


(4) maximize r(x) = 6xi + 4x2 + Zxj — 3xi — 2x| — 
Problem?: < subject to 

(5) + 2x, + Xs + J = 4, 

where all x^ ^ 0 and j > 0. Let the trial point be xj = 1 for all j, so that (5) 
is satisfied with .r = 0, and r(x*) = 6.67. Suppose, as in the method of steepest 
ascent, you let each dj = dcldxj. Then verify that (/f = = 0 and df = 4- 

Unfortunately, no feasible points lie in this direction from x*. The trial point, 
nevertheless, is not locally optimal. Consider the direction (</f= — 1, J = 0, 
= l)j which is along the constraint (5). You can check that for step size f = .1, 
the objective function improves r(x*'*^‘) = 6.76. 
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To summarize, the test for termination, the choice of step size, and the selec¬ 
tion of direction must take account of what happens when the trial point either 
reaches or lies on a boundary of the constraint set. 

The next three sections discuss several large-step approaches that exploit the 
linearity of the constraints. Be sure to note how each algorithm specifies the details 
for Step 2. 

As you will quickly realize, the instructions in nonlinear algorithms have numer¬ 
ous details, and even small illustrative examples require a fair amount of calcula¬ 
tion. To help you read this material, we will suggest here and there that you record 
certain equations or other information. If you heed these suggestions, you can 
avoid flipping back and forth between pages. 

Problem P in (4) and (5) will be used repeatedly to illustrate the algorithms in 
this chapter. Consequently, copy (4) and (5) on a piece of paper for ready refer¬ 
ence. The unique optimal solution is 

(6) = .875 = I = .625 Xj = = 1.875 with c = 7.25; 

jot down this solution too. 


15.2 METHOD OF CONVEX COMBINATIONS 


The idea of this algorithm is to determine directions in Step 2 by means of a 
linear programming optimization, using a linear approximation to c{x) and the 
problem’s linear constraints. Consider a feasible trial point x*. Given assumption 
(iv) in Sec. 15.1 on the partial derivatives ofr(x), the value of c(a:) at another point 
z in the neighborhood of x* can be approximated by a first-order Taylor’s expan¬ 
sion: 

( 1 ) 

Suppose, then, you find a feasible point z that maximizes the term 


( 2 ) 


dc{>^) 
dxj ’ 




subject to the linear constraints on the variables. Let z be this point, and suppose 


( 3 ) 


§ 


dxj 


^j>± 


dxj ^ 


Then the summation on the right of (1) is strictly positive. In all likelihood, Z 
is not in the immediate neighborhood of x*, and so the approximation (1) is poor. 
In fact, c{Z) may be smaller than c(x*). But because of the continuity ofc(x) and 
its partial derivatives, there is a convex: combination, or weighted 
average, ofx* and Z that yields an increase in c(x) above the value c(x^). In other 
words, there is a /, where 0 < / ^ 1, such that by letting 


(4) = (1 — t)x} + i£j forj=l,2,...,n, 


you obtain 


( 5 ) 


c(x**‘) > c(x>‘). 
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Since all the constraints are linear, a convex combination of ^ and z is feasible 
for any / in the interval 0 < / :< 1. 

Let each direction be defined as 

(6) = zj — xj for 7 = 1, 2,. . ., n; 
then (4) can be written in the standard form 

(7) + td^j fori= l,2,...,;z (0 < / < 1), 

as you should verify. [Rearrange the summations in (3) so as to express the ine¬ 
quality in terms of the directions.] 

To recapitulate, the detailed procedure for Step 2 is: 

(i) Evaluate all the partial derivatives dcjdxj at the trial point x*. 

(ii) Solve the linear programming problem consisting of the objective function 
in (2) and the linear constraints, where the symbol Zj is substituted for 
in all the constraints. Define the directions d) according to (6). 

(iii) Terminate the calculations unless the inequality in (3) holds. 

(iv) Otherwise, find a step size such that + td^) is maximized, where 

0 < f < 1. 

If an optimal linear programming solution z yields an equality between the 
two summations in (3) at iteration k, the current trial x* is a constrained stationary 
point. This frequently means that there is no feasible point in the immediate 
neighborhood of ^ at which c{x) is greater than c(pd^). (But it can happen that x* 
is only a point of “inflection.”) Usually, however, there is no iteration k where an 
equality occurs between the two summations in (3). Nevertheless, for an arbitrary 
initial trial point x®, the method always converges in the limit, in the following 
sense: 

(A) The entire sequence t(x^) monotonically increases to a limit. 

(B) At least a subsequence of the x* converges to a feasible point x. 

(C) The point x is a constrained stationary point; that is, either all the partial 
derivatives equal zero at x, or some partial derivatives do not equal zero 
but there is no possible improvement in ^(x) in the feasible neighborhood 
of 

Consider the important case when c{x) is concave. Recall the definition: a 
several-variable function c{x) is concave if, for any two points x and for 
all A 0 < )& < 1, 

+ (1 — P)c{y) ^ + (1 — P)yi> •••,/*» + (! — P)y^ Concave , 

and is strictly concave if the inequality is strict (<) for 0 < ^ < 1, 

When c{x) is concave, the point x is always a global optimum, and c{s£) truly is 
the maximum value. Further, in this case, the optimal step size can be foimd 
by using a single-variable search technique given in Sec. 14.3. 

The Method of Convex Combinations has several characteristics worth noting: 

(i) Only by happenstance would each direction in (6) equal the correspond- 
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ing partial derivatives of c(a:) at In general, each d) does not equal the 
direction of steepest ascent at the trial point. 

(ii) Since S is an optimal solution to a linear programming problem with m 
constraints, it has at most m strictly positive components. Nevertheless, 
if ^ then will differ from according to (4). In other words, 
the trial value of every xj may change at an iteration. 

(iii) If the linear constraints have a special structure—for example, if they 
represent a network model—then you may exploit this fact in the linear 
programming optimization of Step 2, 

(iv) If the objective function is only slightly nonlinear, so that most of the Xj 
appear only linearly with coefficient Cj, then the coefficients in (2) are also 

for these variables. This simplifies the computations at Step 2. 

(v) If<;(;c) is concave, then at each iteration you may calculate an upper bound 
on the optimal value of ^(a:). This property is especially helpful, since the 
bound can be employed to terminate the iterations when a sufficiently 
good solution is obtained. The calculation is based on the fact that a 
first-order Taylor’s expansion of ^(a:), as in (1), yields an overstatement if 
c(x) is concave. Specifically, let x denote an optimal solution; then 

(8) c{x) ^ c{:^) + g ^ ^ c{x^) + t ^ 

where the first inequality follows from the concavity assumption, and the 
second from the optimization with objective function (2). The upper 
bound, then, is the expression on the right of (8). Since this bound does 
not always decrease at every iteration, you should keep track of the lowest 
bound found in all the previous iterations. 

► Convergence of the algorithm can be accelerated by using a second-order Taylor’s 
expansion of c(x), thereby leading to a quadratic, instead of a linear, programming 
problem in Step 2. The inequality (3) is modified accordingly by adding the terms 
involving second-order partial derivatives. ^ 


Example. To illustrate the technique, consider Problem P, stated at the 
end of the preceding section. The corresponding linearized objective function 
(2) is 

(9) [6(1 - xf)]z. + [4(1 - 4)]z, + [2(1 - 


Given the single linear Constraint, the only basic feasible solutions of the linear 
programming problem arc 


Point 1: 

Zi = 4 

Point 2: 

z, = 0 

Point 3; 

Zi = 0 

Point 4: 

z, = 0 


Zs = 0 z, = 0 

Z2 = 2 Zi — O 

zj = 0 zj = 4 

Zj = 0 Zj = 0. 


(10) 
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To test your understanding so far, devise a simple rule that indicates which of 
these points is optimal, given values for in (9). 

The step size formula (7) in Sec. 14.5 (p. 543) can also be utilized here, with 
a slight modification: 


( 11 ) 


optimal / = minimum 



where, for brevity, the superscript k has been omitted in the notation. 
Recall that the unique optimal solution is 


(12) = .875 = I = .625 X 3 = 1|^ = 1.875 c{x) = 7.25. 


Verify that if you let the trial x* be x, then the objective function (9) becomes 
+ ^Z 2 + fzj; as a result, each of the first three points in ( 10 ) is optimal for 
the associated linear programming calculation, and the two quantities shown for 
the test in (3) have equal values, indicating that x is optimal. 

Suppose you initiate the Method of Convex Combinations for this example by 
letting each x] = 0. Check that the linearized objective function (9) yields Point 1 
in (10) as optimal. As a result, (/} = 1 — 0 = 1 and di = d\ = 0; according to 
(11), the associated optimal is .25. Then the formula (7) for the next trial point 
yields ^} = 0 + (j^)4 = 1 and xj = xj = 0. The subsequent iterations are sum¬ 
marized in Fig. 15.1. 


c(x) = 6x^ + 4x2 + 2x3 - 3xf - 2x| - 3 x| 
X, + 2X2 + X3 <4 


Iteration 

k 

c(x*) 

X* 

X* 

Xg 

X* 

Step 

Size 

Upper 

Bound 

Extreme 

Point 

0 

0 

0 

0 

0 

.25 

24 

1 

1 

3 

1 

0 

0 

.48 

II 

3 

2 

4,92 

.52 

0 

1.92 

.12 

13.56 

1 

3 

5.417 

.920 

0 

1.699 

,26 

11.50 

2 

4 

6,222 

.677 

,529 

1.250 

.06 

10.21 

1 

5 

6.338 

,871 

.498 

1.177 

,16 

8.09 

3 

6 

6.480 

•730 

.418 

1.633 

.04 

9.32 

1 

10 

6.717 

.768 

.484 

1.670 

.03 

8,73 

1 

II 

6.748 

.868 

.469 

1.618 

.07 

7.82 

2 

18 

6.907 

.806 

.532 

1.725 

.02 

8.16 

1 

19 

6.919 

.868 

.521 

1.691 

.04 

7.59 

2 

20 

6.934 

,830 

,587 

1.617 




Optimal 

Values 

7.25 

.875 

.625 

1.875 



Figure 15.1. Method of Convex Combinations. 
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Notice that the successive values ofc(Ar) rise rapidly at the start, and then increase 
quite slowly after the fifth iteration. This is typical behavior for a first-order 
method. Observe how the linear programming solution is Point 1 in (10) on every 
even-numbered iteration, and alternates between Points 2 and 3 on the odd- 
numbered iterations. This alternation causes a dampening oscillation in the 
Finally, note that the trial are always interior to the linear inequality constraint, 
although the optimal solution satisfies the constraint exactly. 

> For the sake of contrast, consider how the algorithm proceeds when is one of the 
other extreme points in (10), or the point where each xj = 1, which also exactly satisfies 
the linear inequality constraint. The speed of convergence of the c(x^) is shown in Fig. 
15.2. Notice how much faster r(x*^) approaches its maximum for these initial points. 
By the tenth iteration, not only is c(x^) nearly equal to its optimal value, but the corre¬ 
sponding x^ differ from the optimal xjhy less than .03. 


Iteration 

k 

Point 4 
x9 = 0 

Point 

2 

Points 

1 and 3 

x°=1 

1 

3.000 

4.571 

6.000 

7.167 

2 

4.920 

7.020 

7.143 

7.219 

3 

5.417 

7.187 

7.228 

7.236 

4 

6.222 

7.224 

7.244 

7.244 

5 

6.338 

7.239 

7.248 

7.247 

10 

6.717 

7.249 

7.249 

7.249 

20 

6.934 

7.249 

7.249 

7 249 


Optimal Value of c(x) = 7.25 


PicuRE 15.2. Speed 
Convergence of c{x). 


of 


As another point of comparison, consider what happens when the linear constraint is 
relaxed by letting the right-hand side be 6 or 8. In these two cases, the constraint is not 
really binding, in that the unconstrained optimal solution (xi = 1,^2 = IjXj = 3) is 
feasible as it has an objective function value 8. When the right-hand side is 6, and 
starting with each xj = 0, the trial solution at k == 20 is (xf® = .9162, xf = .8307, 
xf = 2.3585) giving r(x‘®) = 7.7844. Again, convergence is very slow after the fifth 
iteration. When the right-hand side is 8, the result at A: = 20 is much better (xf = .9851, 
x|® = .9485, xf = 2.7668) yielding r (x*®) = 7.9759. 4 


* 15.3 CONCAVE SIMPLEX METHOD 

As the name suggests, this algorithm obtains an optimal solution if c{x) is con¬ 
cave, and is based on merely a few modifications of the ordinary simplex method. 
Actually, it combines the ideas used in the simplex method for upper-bounded 
variables (Sec. 5.10) and in the Quadratic Simplex Method (Sec. 14.6). Like the 
upper-bounded variables algorithm, the method employs a set of only m con- 
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straints. It classifies the original variables into the categories of basic and non- 
basic, and allows some of the nonbasic variables to be in the solution at a positive 
level. Like the Quadratic Simplex Method, it considers the effect of changing 
one nonbasic variable at a time. If the value of a selected nonbasic variable can 
be altered to increatse the objective function, then it is so changed to maintain 
feasibility and bring about the maximum improvement in the objective function. 
The approach differs most importantly from the preceding Method of Convex 
Combinations in that here no more than (m + 1) of the variables change their 
levels at any iteration, these being the current basic variables and the selected 
nonbasic variable. If ^(a:) is not concave, the method yields a constrained station¬ 
ary point. 

Let X* be the trial feasible point at iteration k. For ease of exposition, suppose 
that the variables are numbered such that Xi, X 2 ,. . ., are basic, and x^+u 
. .., are nonbasic at iteration k. Note we define N =m + n so that the slack 
variables are also designated by the same letter, x. Suppose, at iteration k, the 
system of linear constraints is displayed as in the application of the simplex method: 


"1” ^1,771+1 ^m+1 ”f" 

• + hyXy = Oi 

Row 1 

^2 ”1“ ^2,m+1^7n+l "1" 

• + hyXy = B2 

Row 2 

^771 771+1^771+1 "1" 

' * "T" 

Row m. 


where the superscript k has been dropped for simplicity of notation. Since in this 
algorithm some of the nonbasic variables may be strictly positive, a Bi may be 
negative. But the appropriate nonnegativity assumption can be written as 


(2) Xi = 2 tijXj >0 for z = 1, 2 ,..., m 

at the point x*. 

The rationale establishing the relevance of the objective function in the preced¬ 
ing Method of Convex Combinations continues to apply here. Specifically, con¬ 
sider the function 



or, expressed in the fashion of the simplex method. 


(4) 


dc(x^) 

dx 


dc{x^) 


•X 


m 




*•^171 + 1 


dxjr 




0 


Row 0. 


If you were applying the simplex method. Row 0 would have a coefficient of 
zero for every variable in the current basis. To achieve this condition here, you 
can employ the relations in (1) to eliminate ^i,.. ., from (4). Specifically, for 
* = 1, 2,..., m, multiply Row i by dc{x'^)ldxi and add the result to (4), thereby 







586 Advanced Techniques in Nonlinear Programming 


• 15.3 . 


obtaining 

(5) Xq + Oatj + • • • + + /o,w+i^n+i + * * • + = ^0 Row 0 , 


where, as a result of the elimination process, 


( 6 ) 





dxj 


for j = m + 1, 


K 


[If r(;c) were linear, then each partial derivative under the summation sign in 

(6) would be the original objective-function coefficient C( for the basic variable 
The value of/o^ is called a reduced (or relative) partial derivative. 
As we mentioned above, the superscript A* on loj has been dropped for notational 
convenience. Since the constant do on the right-hand side of (5) will play no role 
in the computations, we adopt the convention of indicating it by the symbol 0 
throughout. 

As in the simplex method, the value of toj can be given the following meaning: 


Interpretation of Coefficients in Row 0. Each coefficient for a nonbasic 
variable represents the rate of increase (for negative coefficients) or decrease (for positive 
coefficients) in c(x) with an increase in the associated nonbasic variable above its 
current level ;cj. 


A few comments are in order about the relevance and numerical significance of 
this interpretation. 

Suppose that all Iqj > 0 at iteration k, that is, all Iq* are “positive coefficients.” 
Is the solution optimal? It would be, provided all the nonbasic variables were 
at zero level, for then any small increase in a nonbasic Xj would decrease c(x) 
below r(x*). Consequently, x* would be a constrained stationary point, and there¬ 
fore optimal, since c(x) is concave. But if there is a nonbasic variable Xf at a posi¬ 
tive level and then the solution may possibly be improved by reducing 

the value of xJ. If both conditions 

(7) all toj > 0 and all = 0 
are satisfied, then x* is truly optimal. 

Suppose (7) is not satisfied, so that there is either a negative to) or a positive 
io^)i or both. Then you must select a nonbasic variable and alter its value in an 
effort to improve the current level of the objective function. Toward this end, 
first determine, as in Criterion / of the simplex method, the smallest to) coefficient: 

(8) minimum (/oi,. .., W) = hp = 

Verify that /n,in ^ 0 and may be negative if (7) is not satisfied. Second, deter¬ 
mine the largest value for a toj}^)'. 

(9) maximum (^oixf,..., ton^if) = = t,r,ux- 

Verify that ^ 0 and may be positive if (7) is not satisfied. Given that (7) 
is not satisfied, either or or both, are nonzero. It can be shown that a 
valid rule for determining which nonbasic variable to alter is: 
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Choice of a Nonbasig Variable. If (7) is not satisfied, increase variable Xp 
if ^min < — Otherwise, decrease variable Xq . 

Having picked a nonbasic variable to change, you must next compute the 
corresponding directions dy Actually all = 0 except those for the basic variables 
and the selected nonbasic variable. Consider first the case when Xp is chosen. Then 
a straightforward modification of Criterion II of the simplex method will yield the 
following rule: 

Directions When Xp Increases, (a) Take the ratios of xi evaluated in (2) to the 
coefficients tip (ignore ratios with zero or negative numbers in the denominator), (b) Let r 
be the value of the minimum ratio, (c) Let dp = r and dt = —ttpr for * = 1, 2,..., m. 

Second, consider the case when Xg is chosen. Then a straightforward modifica¬ 
tion of Criterion II of the bounded-variables simplex method (p. 149) yields the rule: 

Directions When Xg Decreases: (a) Take only the ratios of — evaluated in 
(2) to the coefficients tip (ignore ratios with zero or positive numbers in the denominator), 
(b) Let R be the value of the minimum ratio, and let r = minimum (R, a:J). (c) Let 
dg = —r and di = ttpr for i = I, 2,..., m. 

Note that r > 0 in both cases. (If r = 0, then all dj = 0, but you still do not 
terminate the calculations.) 

Finally, given the directions, you must calculate the optimal step size t, thereby 
completing the computations in Step 2, The step size procedure is exactly the same 
as in the Method of Convex Combinations. 

The new trial point is calculated as usual in Step 3. If the optimal step size 
is in the interval 0 < /* < 1, then return to Step 2, and repeat the process, 
revising the values of tgj in (6) at the new trial point x^*^. (This case always applies 
when the variable Xg is to be decreased and r < R.) But if = 1 is optimal 
(which occurs if r = 0), then perform a change-of-basis calculation prior to return¬ 
ing to Step 2, Specifically, introduce the variable Xp or Xg (whichever is appro¬ 
priate) into the basis, and drop a basic variable associated with the ratio giving r. 

We now summarize the algorithm for any iteration k. The current trial point 
X* is composed of a set of basic variables, and a set of nonbasic variables, some of 
which may be positive. The procedure is 

(i) Calculate the reduced partial derivatives according to (6), and test for 
optimality according to (7). If an improvement is possible, choose a non¬ 
basic variable to alter. 

(ii) Determine the directions of change for the current basic variables and the 
selected nonbasic variable; these are computed so that the solution remains 
feasible if the optimal step size /* = 1. 

(iii) Given the directions, find a step size that maximizes the value of the 
objective function along the selected direction from x*, that is, maximize 
r(x* + td) for 0 < i ^ 1. 
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(iv) Shift the trial point accordingly, and if the optimal step size /* = 1, 
perform a change-of-basis calculation to complete iteration k. 

The algorithm guarantees that ^ But to prove that c{p^) tends, in 

the limit, to the maximum value for r(A:) when c{x) is concave, additional care 
must be taken to circumvent cycling. (This phenomenon will not be discussed 
here, but you may want to review the problem in the context of the simplex 
method, as discussed in Sec. 4.7.) 

This algorithm shares two desirable properties with the Method of Convex 
Combinations. A special structure for the linear constraints can be exploited in the 
change-of-basis calculations, and the computations for the objective function in 
( 6 ) simplify if (;(;c) is only slightly nonlinear. To repeat the point made at the start, 
the methods differ principally in that all directions may be nonzero for the Method 
of Convex Combinations, whereas at most only m + 1 directions are nonzero in 
the Concave Simplex Method. 

Example. To illustrate the algorithm, again consider Problem P in Sec. 
15.1. Since we want to exhibit several iterations of the technique, we now drop 
the convention on the numbering of the basic variables used to display (1). The 
linearized objective function (4) becomes 

(10) Xo - [6(1 - - [4(1 - 4 )]a:. - [2(1 - = 0 Row 0 . 

Let the initial solution be all xj = 0 and j = 4, giving 

(11) Xo — 6 xi — 4 x 2 — 2 x 3 = 0 Row 0 . 

Then Xi is chosen to be increased. Verify that di = 4, ^2 = ^3 = Oj aod dx = 
where di is the direction for the slack variable. The step size formula (11) in Sec. 
15i2 still applies and yields = .25. Hence the next trial point is (x{ = l^xj = 
x\ = 0, 5 * = 3), with r(x*) = 3. Since < 1, no change of basis is required. 
At A; = 1, the objective function (10) becomes 

(12) Xo — 4 x 2 — 2 x 3 = 0 Row 0 . 

This time X 2 is chosen to be increased. Since 

(13) = 4 - x} - 2x5 - xj = 3, 

verify that dt = 0, </2 = h ^3 = ^4 = ”"3, and the optimal is yielding 

(xj = 1, X 2 = 1, X 3 = 0, j* = 1) with c{x^) = 5. 

At X: = 2, the objective function (10) becomes 

(14) Xo — 2x8 = 0 Row 0 , 

so that Xs is to be increased. Since 

( 15 ) .* = 4 - 1 ( 1 )- 2 ( 1 )- 1 ( 0 ) = 1 , 

verify that = 0, di = 0 , 4/3 = 1, and 4/4 = —1. Now the optimal step size is 
= 1 , so that the new trial point is (xj = 1 , ^ = 1 , xj = 1 , = 0 ), with 

^ 6.67. Here you must perform a change of basis: introduce X 3 and drop s. 
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Check that using the linear constraint to eliminate Xz from Row 0 in (10) 
yields 

^0 + [1 -2(1 - ix5) - 6(1 - + [2-2(1 - ^4) - 4(1 - 

^ +[1-2(1-ixj)]j = 0 RowO . 

Note in (16) how the attractiveness of changing Xi, Xz, or s depends on the partial 
derivative for x^ and the coefficients of each Xj and s in the linear constraint. 
Verify that for the trial point x^, (16) is 

( 17 ) ^0 + 4^1 + 4'^2 + 4 "^ “ ® ^ 221 L 2 * 

All the coefficients toj>0; but toixf = 4 and = 4> so the iterations do not 
terminate. Suppose you choose to decrease ^ 2 . Check that di = 0, dz = —1, 
dz = 2, rfi = 0, and the optimal t = .4. This gives (;if} = 1, Acj = .6, jc} = 1.8, 
= 0), with c{x^) = 7.2. The subsequent iterations are summarized in Fig, 
15.3. The basic variable remains Xi, At A: = 5 the value Xi is reduced; at A: = 6, 


c(x)= Sx\ + 4 x 2 + 2 x 3 - 3x^ - 2 x| “ 3 x| 
x^ + 2 x 2 + X 3 + s = 4 


Iteration 

k 

c(x*) 

X* 

Xg 

^3 

s* 

Step 

Size 

1 

3 

1 

0 

0 

3 

.2500 

2 

5 

1 

1 

0 

1 

.6667 

3 

6.667 

1 

1 

1 

0 

.4000 

4 

7.2 

1 

.6 

1.8 

0 

.12 

5 

7.248 

.88 

.6 

1.92 

0 

,025 

6 

7.2499 

.88 

.624 

1.872 

0 

,005 

7 

7.2499 

.8752 

.624 

1.8768 

0 

.0010 

8 

7.2499 

.8752 

.6250 

1.8745 

0 

.0002 

Optimal 

Values 

7.25 

.875 

,625 

1.875 

0 



Figure 15.3. Concave Simplex Method. 


the value of Xz is increased again. Thereafter, the alterations are negligible. 
As you can see, the method converges swifdy in this special example to the optimal 
value of ^(x) as well as to the optimal value of each Xj. 

* 15.4 OTHER APPROACHES 

There are scores of scientific methods for optimizing a nonlinear objective func¬ 
tion subject to linear constraints. The second part of this chapter contains several 
more of these approaches. Two other techniques are mentioned briefly here because 
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of their prominence in nonlinear programming literature. They can be adapted 
to problems having nonlinear constraints, but the necessary modifications are not 
given in this text. 

One technique is called the Gutting-Plane Method, which makes use 
of the linear bound (8) in Sec. 15.2, and can be applied to the case of a concave 
c(x). At any iteration k, you have a current trial feasible point x* along with a 
set of linear constraints consisting of the original relations as well as some bound¬ 
ing relations, to be described momentarily. The problem has been augmented 
by a variable Xq that appears in all the bounding relations. The variable Xq is 
to be maximized, so the problem at iteration k can be solved by applying linear 
programming techniques. The resultant optimal value of Xq provides an upper 
bound on the optimal value of c{x). 

To find x*'*'*, you add another bounding linear relation to the current set of 
constraints: 


(1) ■ 


Xo 



dxj 





dxj ^ 


This restriction is not satisfied by xj and x* (unless these are optimal values), 
because xj is always larger than r(x*). Consequently, by imposing (1), the current 
trial point is cut off from further consideration. 

It follows that Xo’*"' < Xq. Further, it can be proved that the sequence of x{ 
converges to the optimal value r(x), and there is a subsequence of the x* that 
converges to an optimal solution. But the associated sequence of r(x*) need not 
be monotonically increasing. Hence, when you terminate the iterations short of 
convergence, you should accept the trial point giving the largest value of ^(x) 

c(x)= 6x, + 4^2 + 2x3 — 3 x^ - 2x| “ 3 x| 

Xy + 2X2 + X 3 < 4 


Iteration 

k 

C{**) 


X* 

X 2 

X* 

0 

0 

— 

0 

0 

0 

1 

-24 

24 

4 

0 

0 

2 

2 

16 

2 

1 

0 

3 

5.908 

12.667 

1.167 

0 

2.833 

4 

6.066 

11.022 

.756 

1.233 

.778 

5 

5.685 

8.887 

.222 

.511 

2,756 

8 

5.849 

7.530 

1.437 

.855 

.853 

9 

6.869 

7.390 

1,036 

.892 

I.I80 

10 

6.702 

7.373 

I.28I 

.464 

1.791 

15 

7.249 

7.264 

.668 

.639 

1.853 

Optimal 

Values 

7.25 

7.25 

.875 

.625 

1.875 


Figure 15.4. 
Plane Method. 


Cutting- 
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found so far, which need not be the ^ from the final iteration. The algorithm is 
illustrated for Problem P in Fig. 15.4. Notice in particular the erratic behavior of 
the successive trial solutions x*. 

Another algorithm is termed the Gradient Projection Method. 
Unlike any of the techniques explained so far, it does not apply linear progranuning 
in Step 2. Rather, it is more akin to the method of steepest ascent in the way it 
determines a direction based on the partial derivatives of c{x) at the trial point 
X*, and a step size that maximizes c{x) over all feasible points lying in the selected 
direction from x*. The component directions are calculated by projecting the 
gradient of c{x) on the intersection of those constraints that are exactly satisfied 
at X* (the slack variables for the constraints are at zero level). (The actual com¬ 
putational process is slightly more involved than this, but the details are not impor¬ 
tant for the exposition here.) The step size calculation proceeds by first finding the 
largest value of /, say T, such that the point (x* + id) is feasible, and then by doing 
a one-dimensional search for the maximum of + td) in the interval [0, 7^, 
using an approach explained in Sec. 14.3. 

The calculations are illustrated for Problem P. Suppose that the trial point 
X* is on the linear constraint for ;ci, ^ 2 , and x^ (that is, the slack variable j = 0). 
Consider any set of directions -Di, -Do? ^ 3 - The technique transforms these direc¬ 
tions to <fi, d 2 , dz such that the resultant point remains on the constraint. In 
order for this to happen, the d^ must satisfy 

(2) \d, + 2d2+ Id, = 0. 

State why. The appropriate transformation, or projection, of the Dj to the 


c(x) = 6x, + 4x2 ” ^^2 ”* 3 ^3 

X, + 2x2 + Xj < 4 


Iteration 

k 

0 

1 

2 

3 

4 

5 

6 
10 
15 


c(x*) 

X* 

X* 

^2 

X* 

Step 

Size 

0 

0 

0 

0 

.1981 

5.547 

1.189 

.793 

.396 

.3667 

6.537 

.774 

1.097 

1.033 

.2749 

6.966 

1.060 

.817 

1.306 

.3075 

7.137 

,835 

.813 

1.539 

.2749 

7.205 

.949 

.701 

1.648 

.3075 

7.232 

.859 

.700 

1.741 

.2749 

7.249 

.872 

.637 

1.854 

.2749 

7.249 

.876 

.626 

1.873 

.3075 

7.25 

.875 

.625 

1.875 

— 


Figure 15.5. Gradient Optimal 
Projection Method. Values 
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values of d) can be shown to be 

d, = 42)1 — \D 2 - iZ>s 

(3) (k = + |Z>2 - 

d, = -j^Di - |Z>2 + 1^3. 

Verify that substituting the right-hand sides of (3) into the left-hand side of (2) 
yields 0, as required. The gradient projection approach uses the partial deriva¬ 
tives of c{x) at to provide the values of Dj. Loosely put, the method operates 
by taking the steepest ascent along the contours of c{x) that lie on the binding 
constraints at x*^. 

The sequence of iterations for Problem P is shown in Fig. 15.5. By the second 
iteration, the trial ;c* is already on the constraint. A good starting point on the 
constraint can speed convergence. For example, if you use any of the first three 
extreme points in (10) in Sec. 15.2, or let each xj = 1, the value of c{x^) is within 
.04 of the optimum 7.25. 


15.5 OPTIMIZATION WITH NONLINEAR 
CONSTRAINTS 

Starting with this section, you will learn how to solve optimization problems 
containing nonlinear constraints. For the sake of definiteness, suppose the model 
is stated as 

(1) maximize c{xi, Xj,.. ., x„) 
subject to 

(2) ^ 1 (^ 1 , • • • j ^n) ^ 0 for t = 1, 2,..., 771 and each xj ^ 0. 

Any of the constraints may be linear, and, in a formal sense, (1) and (2) include, 
as special cases, linear programming problems as well as the models in Chap. 
14 and the preceding sections in this chapter. The restriction that every variable 
be nonnegative is made for expository ease; all the algorithms below are trivially 
modified to allow several Xj to be unconstrained in sign. Many of the formulation 
techniques described in Sec. 3.2 apply to nonlinear functions, and so you can view 
(1) and (2) as a canonical statement of a nonlinear programming problem. Copy 
(1) and (2) so that you can easily refer to the model as you read the rest of the 
chapter. 

Because our purpose is to explain the fundamental ideas underlying various 
solution techniques, we make several simplifying assumptions about the nature of 
c(x) and ai(x). Most of the techniques are valid, however, under weaker assump¬ 
tions. We first discuss the postulates on the constraint functions ai(x), and sub¬ 
sequently treat those on the objective function c(x). 

Feasible region. The assumptions on each nonlinear function ai{x) are 
given in terms of its shape and smoothness characteristics. To set the stage, define 
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a real-valued function a{x) to be convex if, for any two points x and for 
allA0</^^ 1, 

(3) , a:„) + (1 - p)a{yu • • • ,^n) 

^ ^ ^ + (1 - P)yu ... ,/^^n + (1 Clonvcx , 

and strictly convex if there is a strict inequality (>) in (3) for 0 <^ < 1. 
[Note that if ^a{x) is concave, then a{x) is convex.] 

A related characteristic of a convex function is that for any two points x and y^ 

(4) a{y) > a{x) + S • Ui - x,) Convex . 

You can inteipret (4) in geometric terms as follows. Let x be any specified 
point such that a{x) = 0. Then equating the summation on the right-hand side 
of (4) to 0 yields a hyperplane that is tangent to the contour a{y) = 0. More 
generally, the right-hand side of (4) is a linear support to the function at the 
given point x, and the inequality (4) can be interpreted as stating that the func¬ 
tion always lies above any of its linear supports. [Let a{x) = xj — X 2 , and specify 
the given point as x, = 2 and X 2 = 4. Draw the contour a{y) =>i — ^2 = 0- 
Verify that the line given by setting the summation on the right side of (4) 
equal to 0 is 4(yi — 2) — {yz — 4) = 0, and is a linear support at the given point.] 
We assume for the remainder of the chapter, unless stated otherwise, that the 
a<(x) in (2) satisfy the following shape and smoothness assumptions: 

(i) Each ai{x) is uniquely defined, finite, and convex for all values of (xi, Xj, 

• • • j ^n)« 

(ii) Each dai{x)ldxf is continuous for all x satisfying the constraints (2). 

[Note that although assumption (i) implies that a,(x) is continuous, we need the 
stronger assumption (ii) that each first partial derivative is continuous.] 

In addition, we stipulate the following. 

Constraint Qualification, There exists at least one point x® ^ 0 such that each 

< 0 . 

Loosely put, the Constraint Qualification states that there is a feasible solution 
that lies strictly in the interior of the region defined by (2). (As you will see, such 
a point is needed to initiate several of the algorithms below.) You will find it 
helpful to jot down assumptions (i), (ii), and the Constraint Qualification, as we 
will refer to them repeatedly throughout the chapter. 

An important implication of (i) is that all the points satisfying (2) form what is 
called a convex set. Specifically, if any two points x^y are feasible in the 
constraints (2), then a point that is a convex combination or weighted average 
[px + (1 — p)y'\^ where 0 < ^ 1, is also feasible. The reason follows from (3): 

since ai{x) ^ 0 and ai{y) < 0 for each :, then ai{pXi (1 — p)yu — ypXn + 
(1 — />)^„) < 0, where 0 < 1. Verify that the same line of reasoning applies 

to the nonnegativity restrictions. The Constraint Qualificatidh ensures that the 
convex set of feasible solutions contains a full-dimensioned interior with an infinite 
number of points. 
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Two other implications of assumptions of (i), (ii), and the Constraint Qualifica¬ 
tion are not so obvious. First, if the point z is feasible for the constraints in (2), 
and ai(z) = 0 for a particular i, then the gradient of ai{x) at z cannot be zero; 
that is, there is at least onej such that dai{z)ldxj ^ 0. Second, the model does not 
really accommodate an equality constraint unless this constraint function is linear; 
in that case, it is permissible to include it in (2) by inserting a pair of inequalities, 
and removing the Constraint Qualification for this pair. 

In many situations, you can establish that a function is convex by analyzing 
whether it can be described as: 

(A) A sum of convex functions—for example, 3x^ — Gat. 

(B) The negative of a concave function—for example, — (4j — 2^). 

(C) A monotone increasing convex function of a convex function—for exam¬ 
ple, a** for a > 0, since a{z) = a' is monotone increasing and convex for 
a > 0, and z = is convex. 

(D) A monotone decreasing convex function of a concave function—for 
example, for a > 0, since a{z) = a"* is monotone decreasing and con¬ 
vex for a > 0, and z = —is concave. 

(E) A convex function of a linear function—for example, for a > 0, 

since a(z) = a^* is convex for a > 0, and z = x —y is linear. 

According to (A), then, the function 3 a:^ — 6a; — 4^ + 2j- -f- a"^* + 

-f- 10 is convex for a > 0. Also, if a (a:) is a function of a single variable a: and has 
a continuous second derivative, then a{x) is convex if and only if d^a{x)ldx^ > 0 for 
all X. (The multidimensional analogue of this inequality is that the Hessian matrix 
of second-order partial derivatives must be positive semidefinite.) 

Objective function. The function c{x) is also hypothesized to satisfy 
certain shape and smoothness assumptions. We maintain assumptions (iii) through 
(v) in Sec. 15.1 and add one more postulate. Specifically, 

(iii) r(x) is single-valued and finite for each x satisfying the constraints (2). 

(iv) Every partial derivative dc{x)ldxj is single-valued, finite, and continuous 
at each x satisfying the constraints (2). 

(v) c{x) possesses a finite maximum c over all values of x satisfying the con¬ 
straints (2). 

(vi) c{x) is concave over all values of a: satisfying the constraints (2). 

[You also should add assumption (vi) to your previous list for easy reference 
as you read the rest of the chapter.] 

Assumptions (i) through (vi) and the Constraint Qualification together guar¬ 
antee that: 

(A) There exists at least one feasible solution ^ such that (;(je) = c. 

(B) If c(x) is strictly concave, then there is a unique optimal solution x, 

(C) If Af is a constrained stationary point, then x is globally optimal. 

These three conclusions remain valid under somewhat weaker assumptions 
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on the af(;c) and c(;c). But an essential requirement for conclusion (G) is that the 
feasible region be convex; for otherwise, a locally optimal point need not be glob¬ 
ally optimal, even if c{x) satisfies assumptions (iii) through (vi). 

New difficulty. The algorithms for nonlinear constraints have much 
in common with those for linear constraints. In particular, each iteration com¬ 
mences with a feasible trial point x* and tests whether x* is optimal. If c{x) can be 
increased further, a direction of improvement is determined and a trial point 
with a larger value of c{x) is found. But a new difficulty arises in calculating the 
directions when the constraints are nonlinear. (Before continuing, you may want 
to reflect on how the techniques in Sec. 15.2, as well as in Secs. 15.3 and 15.4, 
exploited the linearity of the constraints to determine feasible directions of improve¬ 
ment.) 

Suppose the trial point is at the boundary of a truly nonlinear constraint, as is 
shown in Fig. 15.6. The diagram suggests that the trial point be moved along 


Figure 15.6. Problem of 
Feasible DirccUens Model: 
Find a Feasible Point x 
Closest to P. 


^2 


•P 



the constraint boundary. This approach can be very awkward computationally, 
however, because of the nonlinearity of the boundary. There are several alterna¬ 
tive ways to proceed in finding feasible directions of improvement, and you will 
see these described in the subsequent sections. The differences among the several 
ways can be summarized as follows. 

One is to consider only directions that point into the interior of the constraint 
region when the trial solution is on a nonlinear boundary. Notice in Fig. 15.6 
that there is always such a direction of improvement unless the trial point is truly 
optimal. This approach is used in Sec. 15.6. Another way is to prohibit a trial 
point from ever reaching a boundary; this approach is treated in Sec. 15.9. A 
third way is to use a grid of feasible points and approximate the constraint region 
by linearization among these points. This method is employed in Sec. 15.10. 
In all of these approaches, the successive values of c(x) converge monotonically 
to the optimal value c. At least a subsequence of the trial points converges to an 
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optimal solution, and if the optimal solution is unique, then the entire sequence 
converges to this point. 

The several algorithms will be illustrated using Problerii P in Sec. 15.1 as well as 


'maximize 


Problem 


Q. 


subject to 


C{x) = —— 2X2 — Xj 

—6x, — 4x2 — 2x3 + 3x5 + 2;^ + + 7.25 ^ 0 


and all Xj ^ 0, 


which has the same unique optimal solution as Problem P and c = — 4. Copy 
Problem Q, for easy reference in reading the subsequent sections. 

We repeat the warning from Chap. 14 that you should be very cautious in 
making relative evaluations of the techniques based on the particular numerical 
results below, since Problems P and Q, are quite special—they contain a quad¬ 
ratic form without any cross-product terms. Furthermore, the results do not show 
what happens when acceleration devices are applied to improve the rate of con¬ 
vergence. But these examples do underscore the significant observation that non¬ 
linear problems are typically much harder to solve than linear ones. 


15.6 METHOD OF FEASIBLE DIRECTIONS 

This algorithm is an extension of the Method of Convex Combinations treated in 
Sec. 15.2. The principal procedural difference is the formulation of the linear 
programming problem for obtaining the directions of improvement in Step 2, 
Like the Method of Convex Combinations, the algorithm still converges to a 
constrained stationary point when you drop assumption (vi) that c{x) is concave. 

You saw at the end of the preceding section that a fundamental difficulty in 
finding feasible directions arises when a trial solution x* is on a nonlinear 
boundary of the constraint region. The formulation below attacks this difficulty 
by taking linear approximations to the constraint functions at x*^, and then selecting 
directions that point inward from those nonlinear constraints that are binding at 
X*. To explain the approach, it is convenient to formulate the ideas in terms of the 
directions d), and for notational simplicity we drop the superscript k denoting 
the values of at the current iteration. 

Given the continuity assumptions on r(x), the current value of r(x) will improve 
by moving in the directions d^ from x* provided that 



Therefore, (1) gives a test for whether a specific set of directions can bring about 
an increase in the objective function. 

It is also convenient to drop the distinction between the ai{x) and the non¬ 
negativity stipulations, and therefore we designate the entire collection of con¬ 
straints as 

( 2 ) 


^i{x) <0 for I = 1, 2,. . ., M (= m + n). 
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We employ a first-order linear approximation to + d) : 

(3) A,(xi + </„ 4 + <4,..., + 4) !=sA,(x^) + 2 

>-l 

Recall, from the preceding section, that the assumptions on the at(x) ensure that 
at least one partial derivative under the summation sign is nonzero. 

Suppose x^ is interior to the zth constraint so that Ai(x^) < 0. Then the right- 
hand side of (3) is strictly negative if all dj = 0. Verify that in this case the point 
(x + id) v/ill satisfy the ith constraint for any set of directions d^ provided < > 0 
is sufficiently small. 

Suppose next that x* is on the boundary of the ith constraint, so that = 0. 

Then there is a / > 0 such that the point (x* + id) satisfies the ith constraint only 
if the directions dj make the right-hand side of (3) strictly negative. The reason is 
that the right-hand side of (3) equated to 0 yields values for the directions that lie 
on the linear support (or tangent hyperplane) to the constraint Ai{x) at the point 
X*. Therefore, if the directions dj produce zero or positive values for the right- 
hand side of (3), then these directions point on or above this support, and a new 
trial solution (x* + td) will not satisfy the zth constraint for any f > 0. State why. 

In summary, the directions dj lead to a new feasible trial point (x*' + td) that 
increases c{x) if the value in (1) is strictly positive and the right-hand sides in 

(3) are strictly negative. If no such directions exist, then x* is a constrained sta¬ 
tionary point. 

A linear programming model can be employed for finding feasible directions 
to improve c(x), if such exist at the point x*. Specifically, the formulation is to 
determine Xq, ^/i,. .., satisfying 

(4) maximize x© 
subject to 

(5) ^0 < 2 ^^^^•dj or equivalently Xq — 2 •dj ^ 0, 

cxj jmi axj 

+ 2 'dj < —Xo or equivalently 

joi dXj 
j-1 dXj 

Xo and each dj unconstrained in sign. 

Explain why (4) through (6) can be used to discover directions that satisfy 
the test in (1) and make the right-hand sides of (3) strictly negative when x* is 
not a constrained stationary point. (Note that the optimal Xq is never negative, 
since all = 0 and Xq = 0 is feasible.) 

For computational purposes, the formulation (4) through (7) is not quite 
complete. The optimization can lead to an unbounded solution. To circumvent 
this nuisance, it is convenient to add linear regularization restrictions. The non- 


( 6 ) 

(7) 
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negativity constraints included in the Ai{x) guarantee in (6) that thed'^ are bounded 
from below by —;cj for ;co ^ 0; hence, the added regularization restrictions need 
to protect only against large dj. There are many stipulations that will work, a 
simple set being 

(8) dj ^ b for each j\ 

where b is an arbitrarily chosen positive constant (such as 1). By using (8) and 
changing variables {zj = dj + xj), you can transform (5) through (7) into a linear 
programming model with only 1 + m constraints and n nonnegative and upper- 
bounded variables. Specifically, one constraint corresponds to (5), and m con¬ 
straints correspond to only the relations contained in (6); each z^, for j = 
1, 2,. . ., n, satisfies 0 < z^ < 6 + a: 5. Therefore, you can apply the upper- 
bounded simplex algorithm in Sec. 5.10 to solve the transformed version. 

The entire procedure for Step 2 is: 

(i) Evaluate all the partial derivatives dcldx^ at trial point z*. 

(ii) Solve the linear programming problem consisting of (4) through (8). 

(iii) Terminate the calculations if the optimal Xq = 0. 

(iv) Otherwise, find a step size t such that r(z* H- td) is maximized over all 
feasible points lying in direction d from z*. 

If the optimal value of Zq = 0 at (iii), then z* is a constrained stationary point. 
The optimal feasible step size in (iv) is always finite, given the assumption that 
c(z) possesses a finite maximum c for all values of z satisfying the constraints. 

► Actually, it is not necessary to obtain an optimal value forzg at every iteration. Provided 
an optimum is sought regularly—for example, on every 10th iteration—^you need only 
find a feasible solution that gives a positive value for Zq on the other iterations. 

If r(z) is linear, then the value for the upper bound b in (8) should be made large, 
since (5) is an exact representation of the incremental effect on r(z). If a constraint 
Ai{x) is linear, then the variable Zq can be omitted from the corresponding restriction 
in (6). (Such a simplification applies to the nonnegativity restrictions —zj ^ 0.) This 
modification permits the directions to be chosen so as to lie along any of the linear 
constraints that arc binding at the current trial solution. In the limiting case when all 
of the Ai(x) are linear, so that Zq is thereby eliminated from all the constraints except 
(5), the formulation is nearly the same as that in the Method of Convex Combinations 
in Sec. 15.2. The principal difference is the regularization procedure. Given the assump¬ 
tion in Sec. 15.2 that the original linear constraints satisfy a regularization postulate, 
there is no need to impose any further bounding conditions. But here the additional 
regularization must be imposed because the formulation is in terms of the directions. 
It is also possible to devise a procedure to obtain an upper bound on the optimal c{x) 
in a manner analogous to (8) in Sec. 15.2 for the Method of Convex Combinations. 

The approach employs only the first-order partial derivatives of c(z), and can be 
modified to be a second-order approach. Then a quadratic programming technique 
will be used to calculate the directions. ^ 
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Example. Applying the technique to Problem Q,, the constraint (5) is 

(9) 

and restrictions in (6) become 

(10) xo + [-6(1 - xf)]d. + [-4(1 - 4m + [-2(1 - •!*?)]</» 

<&4 + ^4 + 24- 3(*f)* - 2{4Y - i(;«)’ - 7.25 

(11) x,-d,^4 fori=l,2,3. 

The resultant sequence of trial p>oints is exhibited in Fig. 15.7. The problem was 
solved using 6 = 1 in (8). The procedure was simplified further by imposing the 


Iteration 

k 
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.557 

2.199 

1 
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-1 
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-1 
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5 
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.826 
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2.006 

1 

.10 

-1 
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9 

-4.00 

.869 
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1.881 
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-1 
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10 

-4.00 

.875 

.625 

1.875 





Optimal 

Values 

-4 

.875 

.625 

1.875 



Figure 15.7. Method of Feasible Directions—Problem Q.. 


regularization constraints < 1) instead of (11). For A; > 1, the trial point 
X* lies on the quadratic constraint. The solution at A; = 10 agrees with the optimal 
solution to four decimal places. 

15.7 THEORETICAL PROPERTIES OF AN 
OPTIMAL SOLUTION 

Before you go any further in studying other algorithms for the nonlinear 
programming problem |K)sed at the beginning of Sec. 15.5, you will find it helpful 
to learn certain fimdamental ideas about the problem’s theoretical underpinnings. 
In this section you will see several closely related statements of conditions that are 
both necessary and sufficient for a point to be an optimal solution. Such results 
are useful in at least three ways; these conditions: 

(i) Provide numerical approaches for testing the optimality of 
a proposed solution. 

(ii) Give information about solution sensitivity. 

(iii) Lead to algorithmic methods for obtaining an optimal solution. 
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Each of these uses is illustrated below. The purpose of this discussion, then, is 
to develop your insights into the nature of optimal solutions as well as to enhance 
your understanding of optimization techniques. 

Recall from the preceding section that you can determine whether there are 
directions of improvement from a trial point by solving the following linear 
programming model; 


(1) 

subject to 

( 2 ) 


maximize Xq 




j=i dx) 


(3) for* = (=m + «) 

J=l CX) 


( 4 ) 


Xq and each d) unrestricted in sign. 


For the purposes here, there is no need to add any regularization conditions on 
each djy and so conditions such as (8) in the preceding section are omitted. Given 
assumptions (i) through (vi) on c{x) and ai{x) in Sec. 15.5, the point x* is truly 
optimal if, and only if, the optimal value of ;»fo is 0 in (1) through (4). 

Now suppose x is an optimal point, and let a** = jc in (2) and (3), so that Xq = 0 
is optimal in (1) through (4). Then consider the corresponding dual linear pro¬ 
gram (see the advanced material in Sec. 5.4): 


( 5 ) 

subject to 

( 6 ) 

( 7 ) 

( 8 ) 


minimize — Ai{x)yi 



1 


^c{x) , , ^ dAi(x) ri r • 1 o 

—W. „ ^ ^ ^ Q for J = 1, 2,. , 


dxi 


•-Vo 


> 0 for I = 1, 2,..., M, 


,n 


The Dual Theorem ensures that optimal values Ji exist, and also that the optimal 
value for the expression in (5) is 0, given that x is an optimal point. Furthermore, 
the assumptions in Sec. 15.5 ensure that jo > 0. Therefore, after rearranging and 
simplifying terms in (7), we can state the following. 


Optimality Conditions. Given the Constraint Qualification and the assumptions 
(i) through (vi) on a^x) and ^(a) in Sec. 15.5, a point x is optimal if, and only if, 

(9) Ai(x)^0 for i= l,2,...,Af(=m.f «), 

and there exists iZf ^ 0, for i = 1, 2,..., such that 

nO) Ai{x)^Ui = 0 for i = 1, 2,..., Af Complementary slackness 

where a, = and satisfy (5) through (8). 
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The relations (10) and (11) are often referred to as the Kahn-Tucker con¬ 
ditions, in honor of the mathematicians who first proved their validity. 

The inequalities in (9) include the nonnegativity restrictions on each variable, 
and hence (9) merely expresses that x must be feasible. Verify that the conditions 
in (10) can be derived by dividing the expression in (5) by Jo > 0, noting that the 
resultant sum still equals 0 at the optimum, implying that each term in the sum 
equals 0, since each Ai{x) < 0 and Hi > 0. Finally, check that the conditions in 
(11) can be obtained by dividing the conditions in (7) by —Jo- Hence the Opti¬ 
mality Conditions can be deduced from the validity of using the linear program¬ 
ming problem (1) through (4) to verify that a point x is optimal. If you specifically 
designate Ai{x) = at(x), for i = 1, 2,..., m, and Arn^j{x) = —Xj, for J = 1,2, 
.. ., 71, then (10) and (11) can be shown to imply 


( 12 ) 

(13). 


ai{x)*Ui = 0 for i = 1, 2,..., m 



for J= 1,2, ...,7i 


(14) 


dc{x) 

dxj 


« dxj 




for j =1,2,. 


,n. 


Note in (12) that if ai{^) < 0, then Ui = 0; and in (13), that if xj > 0, then the 
corresponding expression in (14) equals 0. [To check your understanding, verify 
that when c{x) and all the ai{x) are linear, then (9), (12), (13), and (14) reduce 
to the results given by the Theorem of Complementary Slackness in Sec. 5.3.] 


► The conditions (9), (10), and (11) remain sufficient for x to be an optimal solution under 
somewhat weaker shape assumptions on c(a:) and fl|(x). Loosely put, the Ai{x) must be 
such that they define a convex constraint region, and, together with c(x), must imply 
that a locally optimal solution is also a globally <optimal solution. 

Instead of eliminating^o> 'vc can state the Optimality Conditions (10) and (11) as: 
there exists J^ ^ 0 for i = 0, 1,. .., Af, but not all J( equal to 0, such that 


(i) 


Ai{x) • J| = 0 for X = 1, 2,..., Af 


(ii) 






fotj.1,2.«. 


[The relations (i) and (ii) arc sometimes referred to as the John conditions.] 
The fact that not all J* equal 0 follows from (6). 

This formulation is important, because the conditions (9), (i), and (ii) remain necessary 
for X to be an optimal solution under very general assumptions. The only stipulation 
needed is that all the functions a<(x) and c(x) have continuous first-order pardal deriva¬ 
tives for every value of (xi, X 2 ,..., x„); subject to this restriction, they can have any 
shape. In order that (9), (10), and (11) also be necessary under such mild restrictions, 
an additional hypothesb such as the Constraint Qualification in Sec. 15.5 must be 
imposed to ensure that Jo > 0. ^ 


The Optimality Conditions are useful in all three of the ways indicated at the 
start of this section. We demonstrate next how they can be employed to verify or 
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derive an optimal solution in a small problem; subsequently, wc show why they 
are significant for sensitivity testing and for the development of algorithms to solve 
large problems. 

Consider Problem P in Sec. 15.1, and let the constraints be designated as 


(15) 


for i = 1, 2, 3 
A^{x) = Xi + 2x2 + AT, -- 4 < 0. 


Then (10) yields, after simplification, 


(16) 


ATjt/, = 0 X 2 U 2 = 0 X^Uz = 

{Xi + 2X2 + Afa — 4)«4 = 0, 


and (11) gives, after simplification, 


0 


6(1 — a:,) = — 


(17) 


4(1 — X 2 ) = —K2 + 2«4 

2(1 — ^^3) = —tfj + 


Suppose you guess that all a:^ > 0 in an optimal solution, and consequently, 
from (16), that Ui = Un = Uz = 0. Then you can express each atj in terms of «4 
according to (17): 

(18) = = = 


Substituting the relations (18) into the constraint in (16) yields, after simpli¬ 
fication, 

(19) (3 - 4114 ) 1/4 = 0, 
so that 

(20) and T T ^ 


Accordingly, the point x in (20) is optimal, since it is feasible in the constraints 
(15), and there exists an associated set of iij > 0 satisfying the conditions (16) 
and (17) at x. To check your understanding, show that the assumptions Xj > 0 
and ^^2 = ATs == 0 imply that there are no > 0 that satbfy (16) and (17). 

The dual linear program (5) through (8) provides a source of information about 
solution sensitivity. But instead of employing this device, we will provide in the 
next paragraphs an important alternative formulation of optimality conditions 
that directly yields the desired sensitivity analysis. What is more, the approach is 
convenient for explaining the algorithmic implications of the preceding Optimal¬ 
ity Conditions; these computational aspects are developed subsequently in this 
chapter. 

A Lagrangian function and its saddle point. Necessary and 
sufficient conditions for a trial solution x to be optimal can also be stated in terms of 
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what is called aLagrangiait function. Specifically, let 

(21) L{x, u) = c{x) — 2 ^i(^) 

The Lagrangian function L(;c, u) is said to possess a saddle point at the pair 
(^ 1 , ^ 2 > • • • , Xji) and (Sj, tt 2 > • • • j Wjf)> where each > 0 if 

(22) L(x, u) < L{x, u) ^ Lix, u) 

for all (xj, X 2 , • • • i Xn) and nonnegative (ii,, z/ 2 > • • • > ^his context, the vari¬ 

ables Ui are called Lagrange multipliers. Loosely put, the function L(r, u) 
decreases as x moves away from x, and the function L(x, t/) increases as u moves 
away from u. 

It can be shown that a pair x and S > 0 satisfying the Optimality Conditions 
(9), (10), and (11) is also a saddle point of L(a;, «), and vice versa. Therefore, 
from the Lagrangian view, we can state the following conditions. 

Lagrange Optimality Conditions. Given the Constraint Qualification and the 
assumptions (i) through (vi) on ai{x) and c{x) in Sec. 15.5, a point x is optimal if, and 
only if, there is a point tZ ^ 0 such that the pair (x, u) is a saddle point of L(x, u). 
Further, the summation term of the right of (21) equals 0 for a saddle point. 

[Interestingly, the sufficient condition that x is optimal if the pair (Jc, «) is a saddle 
point holds when all the smoothness and shape assumptions on a,(x) and c{x) 
in Sec. 15.5 are dropped.] 

Notice that the Lagrangian L(;c, u) is concave in ;c for « > 0, and linear in u. 
The connection between the previous and the Lagrange Optimality Conditions 
can be resisoned 2 is follows. At a saddle point, the first-order partial derivative of 
L(^, u) with respect to each Xj must be nonpositive, in agreement with (14). 
State why. Further, in agreement with (13), either Xj or the first-order partial 
derivative of L(a*, u) with respect to Xj (or both) must equal 0, for otherwise, 
L(;c, u) can be increased by varying Xj, Similarly, the first-order partial derivative 
of L(^, u) with respect to each must be nonnegative, in agreement with (9). 
And either Ui or the first-order partial derivative of L{x, u) with respect to Ui (or 
both) must equal 0, in agreement with (12). State why. 

Because L(;c, ii) equals the optimal value of c{x) at a saddle point, the sensitiv¬ 
ity of the optimal value of c(x) can be determined by calculating the correspond¬ 
ing impact on L(;c, u). For example, if the :th constraint is revised to Ai{x) < 
where ^ is a very small positive number, then the optimal value o£c{x) will increase 
approximately by the amount eut (except in cases where x and u are not continu¬ 
ous functions of e for infinitesimal values of e). To illustrate, you saw in (20) that 
1/4 = 1 for Problem P. If the fourth constraint in (15) is changed to A^{x) < e, 
then r(A:) increases from the value 7.25 by approximately for infinitesimal e. 

>*A Lagrangian function can also be used directly in the computation of an optimal 
solution. There are a variety of ways of doing this; one is treated in detail in Sec. 15.9, 
and a couple of others are sketched below. 
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Suppose the model has a single nonlinear constraint, say, CiW, and all the other 
constraints arc linear. Then revise the objective function to be [^(a:) — aiWw], and 
drop ai{x) from the constraints. Starting with « = 0, solve the model, which consists 
of a nonlinear objective function and only linear constraints; thus, a technique such as 
one in Secs. 15.2, 15.3, and 15.4 can be employed. If m = 0 yields a solution such that 
fliW < 0, then the solution is optimal. Otherwise, continue to increase the value of u 
until ai(A*) = 0. When this occurs, you have in effect obtained a saddle point of the 
Lagrangian expression (21), and the resultant solution is optimal. 

The same approach can also be applied in solving certain dynamic programming 
problems, such as the Distribution of Effort Model in Sec. 10.2. A problem having two 
constraints, for example, can be reduced to one having only a single constraint. Con¬ 
sequently, a dynamic programming recursion with a single state variable can be used to 
solve this modified version. If the decision variables must really be integers, then the 
final solution found by such an approach may be only approximately optimal in the 
original problem. 

Another computational use of a Lagrangian function is in so-called small-step gradient 
algorithms. Such methods form a set of differential equations based on the first-order 
partial derivatives of the Lagrangian function with respect to each X) and W{. Under 
certain assumptions, the associated time paths of the variables determined by the 
differential equations converge to an optimal solution. *4 


♦Duality, Using the Lagrangian function L(a:, u) in (21), we state the fol¬ 
lowing optimization problem, which is termed the dual problem: find an x and « > 0 
to 


(23) 

subject to 

(24) 


minimize L(a:, u) 


deix) A dAAx) nr- i o 


We can prove the following theorem. 


Dual Theorem of Nonlinear Programming. Given the Constraint Qualification 
and the assumptions (i) through (vi) on ai{x) and (;(a:) in Sec. 15.5, (i) if Jc is an optimal 
solution to the nonlinear programming problem, then there exists a ^ 0 such that the 
pair X and u solves the dual problem (23) and (24). Further, if .v is feasible in the original 
problem, and x and u are feasible'for the dual problem, then (;(a:) ^ L(x, k); equality 
holds for an optimal pair x and u. (ii) Conversely, if the pair x and u solve the dual 
problem, (23) and (24), if A((x) ^ 0 for every i, and if either r(x) is strictly concave, or 
there exists a k such that Is strictly convex and Uk > 0, then x solves the original 
nonlinear programming problem. Further, c{x) = L(^, u). 

Notice that the objective function (23) for this dual problem is not so simple as 
it is in the dual problem for a linear programming model. In particular, L(:v, u) 
includes the primal objective function ^(a:). The theorem has several applications 
of importance. One, which is not pursued in this text because of its advanced 
nature, is the use of the dual formulation of a quadratic programming problem 
to derive several elegant computational algorithms. Another application will be 
taken up in Sec. 15.9. 
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* 15.8 QUADRATIC PROGRAMMING REVISITED 


The following quadratic programming problem was considered in Sec. 14.6: 


( 1 ) 

subject to 

( 2 ) 

( 3 ) 


n Ti n 

maximize c(x) =2 + 2 S 


2 ^ for z = 1 , 2 , . . ., m 


all xj ^ 0 . 


Assume here that the have been defined so that Cj^ = for all k and this 
convention is innocuous, for any quadratic expression can be written in this fashion 
(to illustrate, 8 xja ;2 — 5x^Xi = 1 . 5 AriX 2 + 1 . 5 ;c 2 ^i). (Slack variables Si'^0 were 
also included in the formulation of Sec. 14.6, and will be introduced again 
below.) Assume, for simplicity, that each bt > 0. Also postulate that the double 
summation quadratic term on the right of (1) is a concave function. (Expressed 
mathematically, the assumption is that the quadratic form is negative semidef- 
inite.) 

Now we apply the necessary and sufficient conditions for an optimal solution, 
given by (10) and (11) in the preceding section. For this purpose. 


Mx)='^^at}X} — b( for i= 1 , 2 , ...,m 
for j = 1 , 2 ,..., n. 


As you can verify. 


( 5 ) 


= + forj=l, 2 . 


n. 


Then (10) in the preceding section gives 

(6) = 0 for £ = 1, 2 ,..., m 

(7) = 0 for j = 1, 2,..., n, 
and (11) yields 

n m 

(8) + 2 S Cji^ic — S + «n+j = 0 for j = 1, 2 ,...,«. 

Jt^l i»l 

These expressions can be simplified by introducing some additional notation. 
Let ^ 0 be the slack variable for the zth constraint in (2), so that 


(9) anXj +st = bi for z = 1 , 2,..., m, 

and let Wj = u^+j for j = 1, 2,..., n. Then ( 6 ) and (7) become, after simpli¬ 
fication, 

( 10 ) 


s(Ui = 0 for z* = 1 , 2 ,..., m 
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(11) x^Wi = 0 for j = 1, 2, .. ., n. 

Since ii|, Xj, and must be nonnegative, (10) and (11) can be expressed by 
a single constraint: 

m 71 

(12) 2 SiUi + XjWj = 0 Complementary slackness. 

i,i - 

Using the newly defined variables Wj, (8) can be written as 

(13) 2 ^ cjtA.* — 2 forj = 1, 2,..., n. 

An implication of the Optimality Conditions in Sec. 15.7 is that if you can find 
nonnegative values for Xj, Wj such that (9), (12), and (13) are satisfied, 

then such Xj are optimal for the quadratic programming problem. Observe that 
(9) and (13) represent m + n linear constraints in 2{m -(- n) nonnegative variables. 
Of course, (12) is not a linear constraint, but it is so special that numerous techni¬ 
ques can be devised that are capable of fully exploiting the linearity of the other 
necessary and sufficient conditions for an optimal solution. One such algorithm, 
to be presented next, is based on fairly simple modifications of the simplex method, 
and converges in a finite number of iterations. 

Modified Simplex Algorithm. Before stating the procedure, we first 
establish some helpful nomenclature. Denote Xj^and as primal variables and 
W) and Ui as dual variables. Designate as complementary variables each pair x^ and 
and, likewise, each pair Si and In this method, the linear constraints consist 
of (9) and (13), and at each iteration, there is a basic solution containing m n 
variables. Every primal variable in the basis is nonnegative, so, as a consequence, 
the original constraints (2) are always satisfied. As in the simplex method, the 
iterations do not terminate so long as any dual variable is negative. 

A basic solution at any iteration is always one of two types: the first possibility 
is that for each j and each z, the basis contains only one complementary variable. 
Such a solution is called a standard basis, and satisfies the complementary slackness 
constraint (12). The other possibility is that the solution contains a basic pair of 
complementary variables, and as a result there is also a nonbasic pair of comple¬ 
mentary variables. Such a solution is called a nonstandard basis, (Some authors 
use the terms complementary basis and almost complementary basis as synonyms for 
‘^standard” and “nonstandard.”) The principal modification of the simplex 
method is that whenever a nonstandard basis occurs, the selection procedure for 
entering a new variable into the basis seeks to re-establish the complementary 
slackness condition (12). 

Straightforward algebraic manipulation using the constraints (9) and (13) 
yields the following formula for c{x) which is valid whenever the complementary 
slackness condition (12) holds: 

C{x) = + 2 


(14) 


Standard basis. 
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Letting 

(15) xo = 2c{x), 
you can append the linear relation 

(16) ^0 — S — S = 0 

to the set of constraints to provide the value of 2c{x) at a standard basis. 

In explaining the specific rules, wc illustrate the process using Problem P from 
Sec. 15.1. The linear constraints consisting of (16), (9), and (13) are 


Xq — 6x1 ~ ^^2 — 2x3 — 

4i/i 

= 0 

Row 0 

X, + 2X2 + ^3 

+ 

= 4 

Row 1 

— 6xi — 


-\- Wi = —6 

Row 2 

— 4x2 ~ 

2u, 

+ Wi = —4 

Row 3 

— *1^3 — 


+ w, = -2 

Row 4. 


Let the initial solution contain the variables Xq, Ji, Wy, W 2 , which is a standard 
basis. Then apply the following criterion. 

Modified Simplex Criterion I. (a) For a standard basis, terminate the itera¬ 
tions if all the dual variables are nonnegative. Otherwise, select the nonbasic primal 
variable that is complementary to the most negative basic dual variable, (b) For a 
nonstandard basis, select the dual variable of the nonbasic pair. 

In (17), since Wi is the most negative dual variable, select Xj to enter the basis. 

In determining the variable to drop from the basis, it is desirable to maintain 
a standard basic solution, or if none exists, to establish one. But since the method 
maintains feasibility in terms of the primal variables, it is not always possible to 
maintain or establish a standard basic solution. The following criterion for select¬ 
ing a variable to drop from the basis takes account of these considerations. 

Modified Simplex Criterion 11, (a) Take the ratios of the current right-hand side 
to the coefficients of the entering variable in those rows corresponding to a primal basic 
variable (ignore ratios with zero or negative numbers in the denominator). Let r be the 
value of the minimum ratio, which occurs for a primal basic variable in Row k. (b) For 
a standard basis, take the ratio of the current right-hand side to the coefficient of the 
entering primal variable in the row of the corresponding complementary dual variable. 
If this ratio is positive and does not exceed r, then drop the complementary dual variable 
from the basis. Otherwise, drop the basic primal variable in Row k. (c) For a nonstandard 
basis, take the ratio of the current right-hand side to the coefficient of the entering 
unknown in the row corresponding to the dual variable of the basic pair. If thb ratio 
is positive and does not exceed r, then drop this dual variable from the basis. Otherwise, 
drop the primal basic variable in Row k. 

Since Xi is to enter, and the current solution (17) is a standard basis, part (a) 
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indicates r = 4/1, and part (b) indicates the ratio —1/—1. Therefore, the com¬ 
plementary dual variable Wi leaves the basis, and the next solution is a standard 
basis. The pivot operation is exactly the same as in the ordinary simplex method. 
Verify that the constraints, after pivoting out Wi and entering Xi, arc: 


Xo — 4X2 

-2x, 

— Zui 

— a;, 

= 6 

Row 0 

2x2 

+ Xi 

— + 

•^1 + 

= 3 

Row 1 

Xi 



— -ffWl 

= 1 

Row 2 

— 4x2 


-2«i 

+ 

= -4 

Row 3 


—”|•^3 

— Wi 

+ M's 

= -2 

Row 4. 


Now Criterion I implies that X 2 enters the basis, since W 2 is the most negative dual 
variable in (18). Therefore, according to Criterion //, r = 3/2 and the ratio in part 
(b) is —4/—4, so that W 2 leaves the basis. Once again, ihe next solution is a com¬ 
plementary basis, and a so-called complementary pivot operation 
occurs, since a primal variable enters and a complementary dual variable leaves 
the basis. 

Check that the constraints after pivoting are 


(19) 


Xo — 2X3 — Ui — 1^1 

^3 — 

Xi -f i^l — 

Xi + 

— •1^3— 


— W2 = 10 

Row 0 

+ ^W2 = 1 

Row 1 

= 1 

Rov/ 2 

— iwz =1 

Row 3 

-f ^3 = —2 

Row 4. 


Here Criterion / specifies that is to enter. Accordingly, r = (1/1) from Row 1, 
and the ratio in part (b) of Criterion //is — 2/—f, so Si leaves the basis. Conse¬ 
quently, the next solution is a nonstandard basis. 

After pivoting, you have 


( 20 ) 


Xo — -h 2^1 — ^u)i = 12 Row 0 

^3 — + iwi +iw2 =1 Row 1 

X, + ^Ui —iwi =1 Row 2 

X 2 + — ^W 2 = 1 Row 3 

— + «^3 = —4 Rov/ 4 . 


The basic variables in (20) are Xq, Xi, X 2 , Xa, w^; the nonbasic pair of complemen¬ 
tary variables consists of Si and Ui, Consequently, part (b) of Criterion I indicates 
that Ui enters. Accordingly, r = minimum (1/^, 1/^) = 2, and the ratio in part 
(c) is (“4/“"*^) = hence, Wi leaves the basis, thereby restoring the next solu¬ 
tion to a standard basis. 
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The equations, after pivoting, are 

^0 + — - 5 “^! — “ ¥-“^3 = 14^ Q 

^3 + Kov/ 1 

(21) Xi + + *^^2 + *^^3 = -ff ^ 

X2 + -^Si + ^Wi — = I Row 3 

«1 — — TB ^2 — 'A^3 = T Row 4 . 


The iterations terminate, since the current solution is a standard basis with non¬ 
negative values for all the dual variables. The optimal value ofc(^) is .5^o = 7.25. 

A full comparison of the computational merits of this algorithm and the one in 
Sec. 14.6 goes beyond the scope of this text, as it involves a number of technical 
details relating to the development of an efficient computer program. But briefly, 
the tradeoff seems to be that the algorithm in this section converges in fewer 
iterations at the expense of maintaining a system of m + n equations throughout. 
The technique in Sec. 14.6 has the advantage of always converging to a constrained 
stationary point, even when the objective function is not concave; the algorithm 
here does not always converge for an arbitrary shaped quadratic objective function. 

A number of other quadratic programming algorithms that have been devised 
resemble the Modified Simplex Algorithm, in that they too give special attention 
to complementary pivot operations and converge in a finite number of iterations. 


15.9 PENALTY FUNCTION APPROACH 

In Sec. 15.7, you saw that a nonlinear programming problem can be posed in 
terms of finding a saddle point for a Lagrangian function. The Penalty Function 
Approach stems from that kind of formulation. In the method, several (or all) 
of the constraints are incorporated into the objective function. There are several 
ways to do this; we selected the approach below because it has been successful 
in practice. We continue to assume that the nonlinear problem satisfies the assump¬ 
tions (i) through (vi) and the Constraint Qualification in Sec, 15.5. 

We begin the explanation by showing how to combine a single ai{x) with the 
objective function. This simple case will serve to motivate the general approach, 
which is treated afterwards. Specifically, consider as the modified function to be 
maximized: 

(1) c{x\T)=cix)+r[lMx)], 

where r > 0 is a parameter. For those x such that «<(x) < 0, the function [l/at(^)] 
is concave, and therefore c(x | r) is concave for x strictly in the interior of the region 
defined by ai{x) < 0. The term r[l/a,(A:)] becomes a very negative number for 
a feasible point that is near the boundary ai{x) = 0. Consequently, a point that 
maximizes c(x\r) subject to all the constraints can never be on the boundary 
di{x) = 0. If the value of the parameter r is close enough to 0, however, it can 
be shown that such a point is nearly optimal for the original problem. 
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To apply the Penalty Function Approach, you combine several (or all) of the 
constraints with the objective function in a fashion analogous to (1). Then you 
obtain a feasible solution that is optimal for the modified objective function, given 
an initial value for r. You successively reduce the value of r (> 0)—each time 
finding a new solution to the modified problem. You terminate the algorithm 
when the expression involving r has negligible effect on c{x | r), and accept the resul¬ 
tant solution as approximately optimal. If the original model contains the con¬ 
straint 71 (jf) ^ 0, and you suspect that an optimal solution actually docs lie on 
the boundary 71 (x) = 0, then you can define ai{x) =fi{x) — e, where you choose 
^ to be a small positive number. Such an approach, however, can lead to a point 
where 71 (•^) > 0, if you chose e too large. 

What makes this type "of approach computationally attractive? Since a trial 
point is never on a nonlinear boundary, at each iteration you can calculate 
directions for improvement as if the optimization process were for an unconstrained 
objective function, and for this reason, you may view the method as a sequential 
unconstrained T/zaximization technique (the approach is sometimes referred to by 
the acronym SUMT), After computing a set of directions, you next calculate 
an optimal feasible step size. You are aided in this optimization by the fact that 
the modified objective function is strictly concave in the interior of the feasible 
region. When actually applying the approach, you find only a near optimum to 
c(x\r) for each r, since ordinarily you cannot calculate an exact optimum in a 
finite number of iterations. The complete details of the method are given below. 

The method, as it will be described, combines all the constraints, including the 
nonnegativity conditions, into a modified objective function. But note that if the 
constraints contain only a few nonlinear functions, the rest being linear, then the 
Penalty Function Approach can be coupled with any technique for optimizing a 
nonlinear function subject to linear constraints. Specifically, the objective func¬ 
tion can be modified by introducing only the nonlinear constraints; then a method, 
such as one of those discussed in Secs. 15.2, 15.3, and 15.4, can be employed for 
successively smaller values of r > 0. Each resultant solution is feeisible in the origi¬ 
nal problem, and also is approximately optimal for r sufficiently small. 

SUMT Algorithm. Assume that all the constraints are to be added to 
the objective function, includirig the nonnegativity conditions Xj > 0. Accord¬ 
ingly, designate the constraints as Afx) <0 for f = 1, 2,. . ., M (= m -f ;i). 
The modified problem is to maximize 

(2) c{x I r) = cix) + r (r > 0) 

over all feasible x. The second summation on the right of (2) includes the terms 
so that c{x\r) is strictly concave and possesses a unique optimal solution 
for each r. The bracketed term on the right of (2) is negative for all feasible x, 
so that c{x\r) < c{x). 
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The procedure in detail is: 

oStep L Select an initial value r* > 0, and an initial trial point such that each 
< 0. Let A: = 0 and « = 1. 

Step 2, Ux^ nearly maximizes c{x\r^)y go to Step 4. Otherwise, determine direc¬ 
tions dj for j = 1, 2,. . . , w. Choose the step size that mziximizes c{x^ + td^ | r), 
where the point {x^ + t^d^) must satisfy all the constraints. Continue to Step 3, 

Step 3. Calculate the new trial point 
(3) + t^d) forj = 1, 2,. . ., n. 

Increase A: to A: + 1, and return to Step 2. 

Step 4. Terminate the iterations if y-' is nearly optimal. Otherwise, calculate a 
new where 0 < < r”. Increase « to « + 1, and return to Step 2. 


Effective ways of performing Step 1 are mainly of interest to operations research 
specialists, and thus are not discussed at length in this text. But we do mention 
what'is at issue with regard to the choice of r*. If is too large, then an optimal 
solution to r(.v|r*) is likely to be far from an optimal solution of the original prob¬ 
lem. On the other hand, if is too small, then finding a near maximum of ^(;clr') 
usually requires a considerable number of iterations. A proper choice of r' strikes 
a balance between these two effects. 

You can employ any procedure for selecting directions in Step 2 that is effective 
for an unconstrained problem. Because c{x\t) is strictly concave for feasible x, 
the conditions 


( 4 ) 


r dAM “ 

dc{x\r) _ dc{x) dXi 

dxj dxj « i[A^{x)y] 


= 0 for each j 


are necessary and sufficient for a feasible x to maximize c(A:|r). If you select the 
directions according to the method of steepest ascent, described in Sec. 14.5, 
then each dj = dc{x\r)ldxj, (Practitioners have found it beneficial to actually 
use second-order methods.) 

A formula for reducing r in Step 4 that works well is r”'*’* = where 0 <f < 1. 
The particular value for f does not seem to be very critical, except that as / is 
made small, more iterations are required to optimize c(A:|r”) for each n. 

The method ensures that c{x | r) increases at each new trial point, and it can be 
proved that the maximum value of c(x|r) converges to the optimum value c 
for the original problem as r approaches 0. Furthermore, if x^ truly maximizes 
c{x I r”) when Step 4 is reached, then c(x*) is at least as large as it was at the previous 
Step 4, and usually is strictly better. Similarly, the bracketed term on the right 
of (2) is never larger than it was at the previous Step 4, and an upper bound on c is 


c < — r” 



( 5 ) 
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As we mentioned earlier, in practice x* will not truly maximize c{x \ r”) when 
Sup 4 is reached. So in applying the algorithm, the foregoing properties will 
actually be fulfilled only if | r”) is sufficiently close to its maximum value. 
Observing the same warning, you can compare the value for the expression on the 
right-hand side of (5) to r(x*) in deciding whether to terminate the iterations 
at Step 4, A final word of caution about applying the approach: you must perform 
the calculations with care, because as both r and the denominators of the bracketed 
terms in (2) get small, the arithmetic becomes quite delicate and sensitive to 
round-off errors. 


►The validity of (5) is based on the duality result in Sec. 15.7. Let x be a truly optimal 
solution to c(x|r"). Then, according to (4), x must satisfy 


(i) 


Bcjx) 

dxj 


r" 



BAdx) 1 

[Ai{x)y\ 


= 0 


for each 


Let 

(ii) Ui = ^ ^ ~ 1> 2,..., 

and therefore, x and u satisfy (24) in Sec. 15.7. Hence, from the Dual Theorem of 
Nonlinear Programming; 

(iU) c ^ L(x, u) = c W - r” [g . 

in agreement with (5). A 


Examples* The results for Problems P and Q from Secs. 15.1 and 15.5 
are shown in Figs. 15.8 and 15.9. At each r”, the nearly optimal value for c(x|r") 
was found by using the method of steepest ascent; that is, the directions were 
chosen to be the first-order partial derivatives of c(x|r”) at x*. The values of k 
shown in the figures correspond to those of x* that are deemed nearly optimal 


^ = .5 


n 


k 

c(x*) 


yk 

yk 

Upper 

Bound 

c(**|r") 


— 

0 

1.147 

.1 

.1 

.1 

— 

— 

1 

.0365 

14 

7.091 

.854 

.595 

1,755 

7.397 

6.785 

2 

.0182 

19 

7.136 

,872 

.606 

1.770 

7.323 

6,950 

3 

.0091 

22 

7.168 

.877 

.619 

1.779 

7.285 

7.051 

4 

.0046 

25 

7.189 

.881 

.628 

1.786 

7.262 

7,115 

5 

,0023 

27 

7.203 

.883 

.634 

1.790 

— 

7.157 

10 

.0001 

34 

7.239 

.890 

.650 

1.801 

— 

7.232 

Optimal 

Values 

— 

— 

7.25 

.875 

.625 

1.875 

— 

7.25 


Figure 15.8. SUMT Algorithm—Problem P. 
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^ = .5 


n 


k 

c(x*) 


Xj 

*3 

Upper 

Bound 

c(x*|/''*) 


— 

0 

-6 

1 

1 

3 

— 

— 

1 

.3 

15 

-4.679 

.935 

.776 

2.193 

-3.120 

-6.239 

2 

.15 

23 

-4.474 

.913 

.724 

2.113 

-3.553 

-5.396 

3 

.075 

29 

-4.332 

.897 

.685 

2.064 

-3.774 

-4.890 

4 

.0375 

31 

-4.240 

.884 

.656 

2.043 

-3.899 

-4.581 

5 

.0188 

33 

-4.171 

.874 

.634 

2.028 

-3.955 

-4.387 

6 

.0094 

36 

-4.119 

.868 

.618 

2.015 

-3.975 

-4.263 

7 

.0047 

39 

-4.085 

.866 

.609 

2.001 

-3.989 

-4.182 

Optimal 

Values 

— 

— 

-4.00 

.875 

.625 

1.875 

— 

-4.00 


Figure 15,9. SUMT Algorithm—Problem Q,. 


in Step 2, Notice that the biggest improvement in the objective function c{x) occurs 
in maximizing c(A;|r*). 

> In Problem P, using the same initial point x® but lettingy = .75 slows the convergence 
somewhat. In this case, r = .0027 at n = 10; the corresponding nearly optimal solution 
to c(x|r>°) is = (.8813, .6372, 1.7873) with c(x^^) = 7.2043 and = 7.1473. 

The result is comparable to n = 5 and A: = 27 in Fig. 15.8. 

Using a better initial point = (.5, .5, ,5) withc(x°) = 4.6667 but a larger r' = .6068 
(and f = .5) yields surprising results. Although the initial trial point is better than 
that in Fig. 15,8, at n = 5 and A: = 44 the trial solution is = (.8748, .6246, 1.6646) 
with r(x^^) = 7.0767 and c(x^*lr* = .0379) = 6.7704. The larger value for r* results in 
many more iterations to obtain a result comparable to n = 1 and k = 14 in Fig. 15.8. 

Using a value of 4.1 instead of 4 in specifying the linear constraint yields, at n = 3 
and k = 33, the trial solution x’’ = (.8869, .6584, 1.7775) with c(x’’) = 7.2301. This 
point does not differ much from the solution x’^ in Fig. 15.8. 

In Problem Q, using r* equal to either .1 or .2 has a negligible effect on the k required 
to give an answer comparable to n = 7 and A: = 39 in Fig. 15.9. M 


15.10 GENERALIZED PROGRAMMING 
ALGORITHM 

This method has much in common with the Separable Programming Approach 
in Sec. 14.7, since it employs linear approximations with a grid of points, and uses 
the simplex method to compute a corresponding solution. But the Generalized 
Programming Algorithm is more sophisticated, in that it allows the grid to be 
refined during the course of the iterations. This approach can be formulated in 
a variety of ways; hence, you should view the presentation below as merely illus¬ 
trative of the general idea. We continue to postulate that the nonlinear functions 
satisfy assumptions (i) through (vi) and the Constraint Qualification in Sec. 15.5. 
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To begin, suppose we partition the entire group of nonlinear and nonnegativity 
constraints into two categories. Let the first partition contain R constraints, to be 
designated as 

(1) ATj,..., a:„) < 0 for i = 1, 2,..., 

some of these restrictions may be the nonnegativity constraints. Possibly (1) 
may contain all the constraints, in which case R = m + n\ then the second cate¬ 
gory would be vacuous. When jR < m + w, the region satisfying the second 
category of constraints is convex and contains all of its boundary points, given 
assumption (i) imposed on the constraints in Sec. 15.5, For convenience, let the 
symbol 

S = set of all points satisfying those constraints that 
are not contained in (1). 

(When /? = m + n, 5 is all of Euclidean n-space.) Hence, the optimization prob¬ 
lem can be stated as 

(2) maximize c(xiy X 2 ,.. ., x^) 

subject to the point x satisfying (1) and lying in .S'. 

Consider iteration k of the algorithm. Assume that the current approximating 
grid is X^y A’*,. .., X^y where each of these points satisfies the constraints in Sy 
and at least one of them is feasible in (1), with a strict inequality holding for each 
truly nonlinear Ai{x), (Recall from Sec. 15.5 that we postulated the existence of 
such a point in the Constraint Qualification.) With this grid, you solve what is 
called a restricted master program representing a linear approxima¬ 
tion to the original model: 

T 

(3) maximize 2 c{X^)wt 

tsQ 

subject to 

(4) ^^Ai{X^)wt <0 for i = 1, 2,. .,, jR 

T 

(5) ^ = 1 and each Wi > 0. 

The adjective “restricted” refers to the fact that (3), (4), and (5) do not contain 
every possible X^ that is feasible in Sy but only the subset of J* -f- 1 grid points. 
Each Wt represents a weight for the point XK The restricted master program is 
a linear programming problem, and has a basic feasible solution and a finite 
maximum value c*. (State why.) Because the Ai{x) are convex functions and the 
set S is convex, it is easy to show that any feasible solution to (4) and (5) yields 
a point 

(6) X) = X^wt for each j 

that is itself feasible in the original problem. 

Consider next the dual problem corresponding to (3), (4), and (5), namely 
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(7) minimize ^^+1 
subject to 

(8) j:A^{X^)y,+yn^,>c{X<') for / = 0,1,..., T 

Isl 

(9) Ji > 0 for f = 1, 2,. . ., and unrestricted in sign. 

As explained below, reference to the dual yields a test of whether an optimal 
solution to the restricted master program is also optimal for the original problem. 

Assume that uj^ is optimal for the restricted master program (3), (4), and (5), 
and that the corresponding is found by calculating the weighted average (6). 
Lety-' be an associated optimal solution to the dual problem. It is easily shown that 
if 

(10) s >c(«) or equivalently ^ — Z 

t=l t-l 

for all X in then a:* is truly optimal in the original problem. But if there exists 
an in iS* such that 

(11) 

t = I 

then adding this new point to the restricted master program may result in an 
improved solution. Finding whether such an exists is referred to as the 

subprogram. The effectiveness of the Generalized Programming Algorithm 
depends significantly on the difficulty of computing a solution to the subprogram. 
This aspect will be discussed in more detail later in the section. 

To summarize, the procedure at iteration k is 

Step L Find an optimal solution to the restricted master program. The associated 
trial point can be calculated according to (6). 

Step 2. Determine whether there is a point in S such that the inequality 
in (11) holds. If not, terminate the iterations; x* is truly optimal. Otherwise, con¬ 
tinue to Step 3. 

Step 3, Add the point X^*^ to the restricted master program by introducing 
a new weight having the coefficients c{X^*^) in (3), in (4), and 1 in 

(5). Increase ^ to A: + 1> and return to Step 1. 

It can be proved that the optimal values r* for the objective function of the 
restricted master program increase monotonically to the optimal value c for the 
objective function of the original problem. And since c{x) is concave, the value 
c{p^) is at least as large as c*. 

Solution of the subprogram. Determining an X^*^ satisfying the 
inequality in (11) can itself be posed as an optimization problem: find an x in iS 
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to maximize 

(12) c{x) 

For ease of exposition, assume throughout the remainder of this section that 
the maximum in (12) is finite. (When this assumption is not justified, the procedure 
can be modified in a straightforward fashion.) Verify that (12) is a concave func¬ 
tion, since^ ^ 0 for all i ^ R, If all the original constraints have been included 
in the R restrictions, then (12) is to be solved as an unconstrained maximization 
problem. 

Given that is a solution to (12), you can state that 

(13) + - g 

where, as above, c* is the maximum value of the objective function for the restricted 
master program. Further, the algorithm can be terminated at iteration k + 1 
if = 1 is optimal. (Explain why.) But note that X^*^ will not usually be 
feasible for all the R constraints, and accordingly, Wt^i = 1 will not be feasible in 
the restricted master program. In general, then, the algorithm does not converge 
in a finite number of.iterations; you can terminate the calculations when the 
current value r(x*) is sufficiently close to the lowest obtained value of the expres¬ 
sion on the right-hand side of (13). 

Observe that the expression (12) has the same form as the Lagrangian function 
(21) in Sec. 15.7 (p. 603). Indeed, when R contains all m + n constraints, the 
Generalized Programming Algorithm can be viewed as a procedure that systemat¬ 
ically searches for a saddle point of a Lagrangian function. Loosely put, it alter¬ 
nates between finding an improved trial solution, given trial values for the 
Lagrange multipliers, and revising the values of the Lagrange multipliers, con¬ 
sistent with the current trial solution. 

Separable problem. The procedure works quite well for a model having 
a separable objective function and separable constraints: 

n 

(14) maximize ^ Cj{xj) 
subject to 

(15) g — bi^O for I = 1, 2,..., /7i = /?, 

and where now S consists of only the nonnegativity conditions. Here each Cj{xj) 
is concave and each aij{xj) is convex. 

The subprogram (12) then breaks into n separate optimizations: 

m 

(16) maximize C)(xj) — 2 <»m(*j) J = 1, 2,..., n. 

In some applications, the form of the functions in (16) may be so simple that a 
computable formula for Xj*^ results from setting the derivative equal to zero. 
At worst, a single-variable search technique, such as one given in Sec. 14.3, can 
be employed to find 
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► There is a more elaborate version of the algorithm for the separable case that usually 
speeds convergence, at the expense of a larger restricted master program. It is essentially 
the same formulation as that given for separable programming in Sec. 14.7. Weights 
arc introduced for each xj, and a summation analogous to (5) is included for each set of 
weights. The subprogram (16) is unchanged, but the test (11) is performed using the 
dual variable for the summation equation associated with weights that yield xj, M 

Examples. Problems P and Q,in Secs. 15.1 and 15.5 satisfy the separability 
formulation in (14) and (15). Write Problems P and Q, to exhibit the specific 
formulas for Cj{xj) and aij{xj). Verify that the « = 3 separate subprograms to be 
solved for Problem P are 

maximize Gati — 3xJ — Xij/i for Xj > 0 

(17) maximize 4^2 — 2 x 2 — 2 x 2 ^ for X 2 > 0 

maximize 2 x 3 — — x^j/l for X 3 > 0 . 

The corresponding solutions can be found by taking the derivative of each expres¬ 
sion in (17) and setting it equal to zero, thereby yielding 

(18) = = = 

provided j/i is small enough for these expressions to be nonnegative. If an expression 
in (18) is negative for a particular j, then let = 0 . 

A similar analysis for Problem Q leads to the formulas 

( 19 ) ^*'-'- 2 ^* = 

instead of (18). 

The behavior of the algorithm is exhibited in Figs. 15.10 and. 15.11. Notice in 
Fig. 15.10 that at ^ = 2 the trial solution x^ is optimal, although several more 
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Figure 15.11. Generalized Programming Algorithm— Problem Q,. 
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iterations are needed to verify this fact. Observe that the upper bound from (13) 
does not steadily decrease. Interestingly, the technique applied to Problem Q, 
gives, for every A:, a value of that is the reciprocal of the value of^ in Fig. 

15.10. As a result, the algorithm produces according to (18) and (19) the very 
same sequence of as in Problem P. (Hence, these X*" are not repeated in Fig, 

15.11. ) But in Problem Q, the optimal solution is not obtained until A: = 6 . 


*15.11 DECOMPOSITION OF LINEAR PROGRAMS 

An important special case of the separable model (14) and (15) in the preceding 
section is an ordinary linear programming problem. In this case, categorize the 
m constraints into a restricted master program with i?(< rri) linear restrictions, 
and a subprogram with the remaining m — R linear constraints and the nonnega¬ 
tivity stipulations. There are two reasons why decomposing the problem in this 
way may be beneficial. First, m may be too large for the capacity of whatever 
electronic computer is available to you. Splitting the problem into smaller pieces 
may alleviate the capacity restriction. Second, the m — R restrictions contained 
in S inay have a special structure that can be computationally exploited in solving 
the subprogram; for example, they may represent the flow-balance equations for 
a network. Since there are several ways to decompose a linear programming prob¬ 
lem, the presentation below is meant to be an illustrative rather than a comprehen¬ 
sive treatment of the approach. 

Specifically, suppose you characterize the model as 

n 

( 1 ) maximize 2 

subject to 

(2) A,{x) = g a,}X} — bi^O for i = 1, 2,..., R, 
where x must also satisfy 

(3) 2 ^ 0 for i = + 1,..., m and each Xj ^ 0. 

The restricted master program is exactly the same as in (3), (4), and (5) of the 
preceding section; therefore, it contains R + I linear restrictions. The correspond¬ 
ing subprogram is to find values of Xj satisfying (3) that maximize 

(4) S C]Xj - g (g a^xjj -^ = g (o - g auj>i ) -Xj, 

where the are the dual variables associated with the solution to the restricted 
master program at iteration k. Observe that the expression in (4) is a linear func¬ 
tion of Xj, and consequently, the subprogram is an ordinary linear programming 
problem. If there exists a solution Xj = Xj such that 

J'r+i < ^ ^ ^ 


(5) 
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then the iterations continue with X introduced into the restricted master program 
as a new grid point. 

This case of the Generalized Programming Algorithm is often referred to as the 
Decomposition Algorithm for linear programs. It converges to an optimal 
solution in a finite number of trials. Further, at the end of Step i, you can discard 
from the restricted master program those grid points corresponding to the non- 
basic variables. The restricted master program, therefore, never need contain 
more than jR + 1 variables (weights) at the beginning of an iterr.tion. Each grid 
point AT* contains at most m ^ R variables at positive levels, if the simplex method 
is used to solve the subprogram. But since an optimal solution corresponding to 
the basic variables in the restricted, mas ter program is a weighted ayerage of these 
AT*, namely Sf-o for each j, this solution can contain more than m of the 
at positive levels (unless the entire problem has a unique optimal solution). 
This is an important difference between the simplex method applied to the entire 
problem and the Decomposition Algorithm. 

The following example illustrates, the method and also exhibits a minor varia- 


tion in the approach. Consider the problem 


( 6 ) 

maximize Xi + X 2 + Xj + x^ 


subject to 

Xi “1“ 2^2 ~j- 2^3 ^ 3 

Row 1 

(7) 

—Xi + X, + 2x< 2 

Row 2 

xj + X 4 3 

Row 3 


X 3 — X 4 < 1 

Row 4, 


and all Xj > 0 . 


Since Xi and X 2 appear in the first two constraints only, we will include them 
explicitly in the following restricted master program: 


( 8 ) 

subject to 


(9) 


maximize Xi + Xj + ^ \X\ + XQwt 
X, + 2x2 + [2Ar5]a;, < 3 

-X. +'^[Xi + 2Xi]wt^2 

T 

S”'-' 

Xj, X 2 , and all Wi ^ 0. 


Observe that in this version the constants (^i = 3, ^2 = 2) have been brought to 
the right-hand side of (9). 

Let the subprogram consist of the variables x^ and X 4 , the second two constraints, 
and the nonnegativity conditions. Therefore the summation in (4) is modified 
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to include only and x^. Specifically, the subprogram is 

(10) maximize [1 — + lji)]xs + [1 — (Qyf + 2 ^)]x 4 

subject to the constraints in Rows 3 and 4 of (7) and (xj ^ 0, x^ > 0). 

The region of feasible points for x^ and Xi in the subprogram is shown in Fig. 
15,12; note that there are four vertices, labeled A, R, C, Z), which represent pos- 


Figure 15.12. Feasible 
Region for Subprogram. 



^3 


sible basic solutions for the subprogram at each iteration. If there exists a solution 
and Xji to the subprogram such that 

(11) < [1 - (2/, + + [1 - (Qyf + 

then the iterations continue. Observe that the term (Sf.i bij/f) does not appear 
on the right of ( 11 ), since the constants bi and have been moved to the right- 
hand side of the restricted master program (9). 

Let the initial trial solution be {XI = 0, -YJ = 0), which is point A in Fig. 15.12. 
The optimal solution to the restricted master program is (x| = 3, wj = 1), and 
the associated dual variables are {y\ = 1 ,^J = 0, j;, = 0). 

The resultant objective function for the subprogram is 

( 12 ) maximize —-X 3 + X 4 . 

Verify in Fig. 15.12 that the optimal solution to this subprogram is at vertex jB, 
giving (A^J = 0, ATI = 3). Since this point yields a value of 3 for the objective func¬ 
tion in (12) and = 0 at the current solution, point B must be introduced to 
the restricted master program. Calculate the impact of this point on ( 8 ) and (9). 

The optimal solution to the next restricted master program is (xj = 3, = -J, 

w\ = f), and the corresponding dual variables are {jii = = 0 )- 

Verify that the objective function of the subprogram becomes: 

(13) maximize — 7 X 3 . 

Now the optimal solution is point Ay which already is in the optimal solution of 
the restricted master program. Hence, this point yields an equality in (11), and 
the present trial solution is truly optimal. The associated values for x^ and X 4 arc 
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(0)-i + (0)-| = 0 

Several Decomposition Algorithms for linear programs are available on large- 
scale electronic computers. These techniques usually involve considerably more 
calculations than does the simplex method applied to the same problem. Hence, 
such an approach is attractive only when the original problem is too large to be 
solved directly by the simplex method, because of computer capacity limitations, 
or when the subproblem has a special structure that can be exploited. 


(14) 


Xi = Xtwl + X\url 
X4 = X^wl + X\w{ ' 


REVIEW EXERCISES 

1 Consider the problem maximize — (^i — 1)^ — (.^2 “ 0^ subject to Xj ^2 ^ 1> 

Xi ^ 0, and X 2 ^ 0. 

(a) Construct a solution space diagram, show the optimal solution and the uncon¬ 
strained optimal solution. 

(b) Let a trial solution be the point x? = 1, xj = 0. Indicate on the diagram the 
direction of steepest ascent. Are there any feasible points lying in this direction 
from the trial solution? 

(c) Show graphically how the Method of Convex Combinations in Sec. 15.2 finds an 
optimal solution. (For k ■= 1,2, 3, 4, indicate x*’, 2 , and the linear objective 
function.) 

(d) Do the same analysis for the trial point .v? = xj = J. 

2 Consider the problem maximize — (xj — 3)^ — (xo — 3)* subject to 0 ^ Xi ^ 6 and 

0 ^x 2 ^ 2. 

(a) Construct a solution space diagram, show the optimal solution and the uncon¬ 
strained optimal solution. 

(b) Show graphically how the Method of Convex Combinations in Sec. 15.2 finds 
an optimal solution, starting at the point xj = xj = 0 (for k = 1, 2, 3, 4, indicate 
the successive x* and associated jf.) Show how c{£) can be smaller than c(x*), and 
indicate why there is a convex combination of x*' and jf, namely, x*"*"* such that 
r(x*ti) > r(x*). Show how the actual directions used are not the directions of 
steepest ascent. 

(c) What difference does it make if xj = 3, xj = 0? If xj = 0, xj = 1 ? If xj = 0, 
;^ = 2? 

3 Consider the problem in exercise 2, but let 0 ^ X 2 ^ 4. 

(a) Construct a solution space diagram and show the optimal solution. 

(b) Show graphically how the Method of Convex Combinations in Sec. 15.2 finds an 
optimal solution, starting at the point xf = xJ = 0 (for A: = 1, 2, 3, 4, indicate 
the successive x* and associated £). 
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4 (a) Give an algebraic proof of the statement in Sec. 15,2 that a convex combination 

of two feasible solutions, x* and f, is also feasible for any t in the interval 0 < / ^ I. 

(b) Verify that (4) can be written as (7) if you use the directions defined in ( 6 ). 

(c) Rearrange the summations in (3) so as to express the inequality in terms of 
directions. Give a verbal interpretation of the resulting inequality. 

(d) Explain how the Method of Convex Combinations relies on assumptions (ii), (iv), 
and (v) in Sec. 15.1. 

5 *(a) Derive'the first inequality in the bounding relation ( 8 ) shown in Sec. 15.2. 

(b) Derive the second inequality in ( 8 ). 

(c) Explain why the validity of the inequalities in ( 8 ) implies that if z yields an equal¬ 
ity between the two summations in (3), then the associated x* is truly optimal. 

6 Consider the problem maximize (xj — 3)® -r (xj — 3)* subject to 

+ —xi 4 - 3 x 0 < 9 3x, — Xj < 9 x, ^ 0 and Xj ^ 0. 

(a) Construct a solution space diagram and show the optimal solution. 

(b) Suppose you apply the Method of Convex Combinations, starting at xJ = 3, 
xJ = 4. Determine how the algorithm behaves. 

(c) Rework part (b), starting at x\ = 3J. 

(d) Rework part (b), starting at xJ = xJ = 3. 

(e) Rework part (b), starting at xJ = 3, = 0. 

7 Consider the problem maximize 18xi -f- 18 x 2 — 5xf — 8 x 1 X 2 — 5^ subject to Xi -f- x^ 

^ 1 > ^ 0, and X 2 ^ 0, Apply one iteration of the Method of Convex Combinations 

in each part below to find a z corresponding to the indicated x®. Then calculate 
c(x®), c{£) and the upper bound ( 8 ) in Sec. 15.2. 

(a) x; = x®= 0 . (d) xJ=i,xS=i. 

(b) xJ = 1, xJ = 0 , (e) x® = ^, xJ = |. 

(c) xJ = xJ = |. (f) xJ = xJ = Is this an optimal solution? Explain. 

8 Consider the application of the Method of Convex Combinations to the example (9) 
through (12) in Sec. 15.2. Perform the detailed calculations for A; = 0, 1,.,., 5 and 
verify the results shown in Fig. 15.1. 

*9 Consider the application of the Concave Simplex Method to the example (10) through 
(17) in Sec, 15.2, Perform the detailed calculations for k = 1, 2,..., 5 and verify the 
results shown in Fig. 15.3. 

*10 In each part below, apply the Concave Simplex Method described in Sec. 15.2 to 
Problem P, modified as indicated. Stop the iterations after ^ = 4 (if convergence has 
not yet occurred). (These modifications were also treated in exercise 23 of Chap. 4.) 

(a) Let the right-hand-side constant in (9) be 8 , instead of 4. 

(b) Let the right-hand-side constant in (9) be 3, instead of 4. 

(c) Let the right-hand-side constant in (9) be 2, instead of 4. 

(d) Add the constraint Xi -f X 2 ^ 1. 

(e) Add the constraint X 2 -f Xj ^ 2. 

(f) Add the constraint —2xi -|- 2 x 3 ^ 1. Parts {g) to (i) onp, 624. 
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(g) Let the objective function be Gxj + 4x2 + 2^3 "■ 4^ *“ — SxJ. 

(h) Let the objective function be 2xi + 6 x 2 + 4 x 3 — 3xf — 2^ — 

(i) Add to the objective function the terms —X 1 X 2 — 2 x 2 X 3 . 

♦11 In each part below, apply the Concave Simplex Method described in Sec. 15,2. Stop 
the iterations after A: = 4 (if convergence has not yet occurred). Indicate the progress 
of the iterations on a solution space diagram. 

(a) Exercise 1. (c) Exercise 3. 

(b) Exercise 2. (d) Exercise 7. 

(e) maximize 10x2 "" ” ^^Xj 4- 16x1X2 subject to 

3xi -{- X 2 < 3 Xi ^ 0 and X 2 ^ 0. 

(f) Change the signs of the objective-function coefficients in part (e), and let the 
initial basic solution be Xi = 3. Do you obtain a global optimum? Justify your 
answer. 

♦12 Section 15.3 states that, in the Method of Convex Combinations, all directions may be 
nonzero at an iteration, whereas, in the Concave Simplex Method, at most only 
m -f- 1 directions arc nonzero at an iteration. Give a detailed justification for this 
assertion. 

♦ 13 Consider the Cutting-Plane Method (as described in Sec. 15.4) applied to the following 
problem: maximize —x* + 4x subject to 0 ^ x ^ 5. 

(a) Find the optimal value for x. Plot the objective function for 0 ^ x ^ 5. 

(b) Let X® = 0. Indicate the progress of the algorithm for A: = 1,2, 3, 4 by drawing 
a solution space diagram for the variables x and Xq, and showing the cutting plane 
added at each iteration as well as the values for x*^ and xj. Explain in this diagram 
the reason for calling the method a cutting-plane approach. 

♦(c) Change the signs of the coefficients in the objective function in part (a), and 
explain why the method does not work if the objective function to be maximized 
is convex, 

14 Consider the function a(x) = xj — X 2 . 

(a) Draw the contour a{x) = 0. 

(b) Let Xi = 2 and X 2 = 4, and write (4) in Sec. 15.5 in full detail. Verify that the 
line given by setting the summation on the right-hand side of (4) equal to 0 is 
4(^1 — 2) — {y 2 ~ 4) = 0, and is a linear support to the contour in part (a) at 
the indicated point. 

(c) Perform the same analyses for a{x) = SxJ — X 2 and the point Xi = §, X 2 = 4, 


15 Determine for each part whether a{x) satisfies assumptions (i), (ii), and the Constraint 
Qualification in Sec. 15.5. Determine the feasible values for x. Check whether 
da(z)ldx 9 ^ 0 for a point z such that a{z) = 0. 


(a) 

(b) 

(c) 

(d) 

(e) 


;(S _ 4 ^ 0. 

+ 4 0. 

-x^-4^ 0. 
-x^ + 4<.0. 
-(x-- 4)-^0. 


(0 

(g) 

(h) 

(i) 

(j) 


(x*-4)»^0. 

-(x=-4)»^0. 

(x^-4y^0. 

-(4-x*)’^0. 

(4 - X*)’ 0. 
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16 Determine for each part whether a{x) satisfies assumptions (i), (ii), and the Constraint 
Qualification in Sec. 15 . 5 . Determine the feasible values for X| and Xj. Check whether 
da{z)/dxj ^ 0 for at least one j at a point z such that a(z) = 0. 

(a) X, 4 - >^2 - 1 ^ 0. (d) {xi + X, — 1)* ^ 0. 

(b) -^1 - ^2 + 1 ^ 0. (e) -{xi + Xi - 1)’ 0. 

(c) -(X, + X2 - 1)* ^ 0. (f) {X, + Xi- 1)’ ^ 0. 

17 In each part^ construct a diagram indicating the set of feasible points. Explain in 
geometric terms why the set is convex. (Assume Xj ^ 0 and X2 ^ 0 .) Check whether 
the Constraint Qualification is satisfied. 

(a) xj 4- (xj — 1)* — 1 ^ 0 and (x, — 1)* + ^ — 1 ^ 0. 

(b) xj -f- (x2 — I)* — 1 < 0 and xj + x| — 1 ^ 0. 

(c) (x 2-2)2- 1 ^0andxj + x|- 1:^0. 

(d) xj + x2- 4 ^ 0 , (xi- 3 )* + ;^- 4 ^ 0 and(x,-§)*+ (x 2 - 3 )*- 4 ^ 0 . 

(e) xJ 4- — 1 ^ 0 and Xi 4 - X2 — 7 ^ 0. 

(f) xf 4 - xl — 1 ^ 0 and —Xi — X2 4 - i < 0 . 

(g) xf 4- — 8 ^ 0 and Xj 4“ X2 — 3 ^ 0 . 

(h) xJ 4- — 8 < 0 , —Xi — X2 4- 3 ^ 0 , and Xj 4- X2 — 3.5 ^ 0 . 

(i) XJ 4 “ Xj — 8 ^ 0 and —Xj — X2 4- 4 < 0. 

(j) 4- 10 < 0 and 2xi 4- 3x2 - 10 ^ 0. 

(k) —X1X2 4- 1 < 0 and Xj 4" X2 — 2 < 0. 

( l ) Reverse the inequality of the first constraint in part (a). Show why the feasible 
region is no longer convex. 

18 Show the intermediate algebraic steps demonstrating that if each constraint function 
^((x) is convex according to ( 3 ) in Sec. 15 . 5 , then the region of feasible solutions is 
convex. 

*19 Section 15.5 states five ways to establish whether a particular function ^^(x) b convex. 
Demonstrate the validity of these conclusions. 

20 In each part below, establish whether or not the specified function is convex. If the 
function is convex, justify your conclusion by citing one of the five properties (A)-(E) 
given in Sec. 15 . 5 . If the function is not convex, use a numerical illustration to show 
that the definition of convexity ( 3 ) is not satisfied. 

(a) * for X > 0 . 

(b) —Xi — X2 4* ^ 4- xJ for all Xi and X2. 

(c) X1X2 for Xi ^ 0 and X2 ^ 0. 

(d) —log (2xi 4 - 3 x, 4 - 4 ) 4 - 5 (xi — 6x2)* for x, ^ 0 and Xj ^ 0. 

(c) (xj — 2)’ — (x, — 2)* for X| ^ 0 and X2 ^ 0. 

(f) (2 — X,)* 4- (2 — X2)’ for Xi ^ 0 and X2 ^ 0. 

21 Consider a nonlinear optimization problem satisfying the assumptions in Sec. 15 . 5 . 
Suppose you have a trial point that is on a boundary of the constraint set but is not 
truly optimal. Explain why there must be a direction of improvement moving into the 
interior of the constraint region. {Suggestion: use a diagram like Fig. 15.6 to illustrate 
your reasoning.) 
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22 (a) Explain why, if x* is interior to the ith constraint, the point (x* -f td) will satisfy 

the ith constraint for any set of directions </, provided / > 0 is sufficiently small, 

(b) Consider the point x* in Fig. 15.6. Draw the linear support (tangent hyperplane) 
to the nonlinear function at Indicate those directions that make the right-hand 
side of (3) strictly negative. Must any such direction intersect the boundary of the 
nonlinear function? Explain why, for any such direction, there is a point near 
to x* that lies in the interior of the constraint region. 

23 (a) Explain why the optimization formulation (4) through ( 6 ) in Sec. 15.6 can be 

used to discover directions that satisfy the improvement test ( 1 ) and make the 
right-hand sides of (3) strictly negative when the trial point x* is not a constrained 
stationary point. 

(b) Show that the nonnegativity constraints included in the Ai{x) guarantee in ( 6 ) 
that the dj are bounded from below by —xj for Xq ^ 0 . 

24 (a) Verify that the optimization formulation (4) through ( 6 ) in Sec. 15.6 when 

applied to Problem Q, given at the end of Sec. 15.5, yields (9) through (11). 

(b) Write in detail and solve the optimization for k = 0, using the trial point x} = 
xj = 1, X 3 = 3 in Fig. 15.7. Verify that you obtain the trial point x*. 

(c) Write in detail and solve the optimization assuming that x* is the truly optimal 
solution (given at the bottom of Fig. 15.7), 

25 (a) Formulate Problem P, given by (4) and (5) in Sec. 15.1, in terms of the optimiza¬ 

tion problem (4) through ( 8 ) in Sec, 15.6. 

♦(b) Compare the formulation in part (a) to that used in the Method of Convex 
Combinations in Sec. 15.2. State the effect of selecting the value for b in ( 8 ), 

♦(c) Show how to convert your answer in part (a) to a model containing two con¬ 
straints and three upper-bounded variables. 

(d) Let ^ = 1 and the initial trial point be x} = xj = xj = 0 in your answer to part 
(a). Complete the optimization for A: = 0, and derive the new trial point for 
A: = 1 , Do the same for ^ = 5. 

(e) Write the formulation in part (a) in detail and solve the resultant optimization 
problem when the trial point is truly optimal. 

26 Let m = 3, n = 5, and = —xj for j = 1, 2,..., 5. Write in detail the linear 

programming model (1) through (4) and its dual (5) through ( 8 ) in Sec. 15.7, Also 
write in detail the Optimality Conditions (9), (10), and (11). 

27 In each part below, write in full detail the Optimality Conditions (9), (10), and (11). 

(a) Problem P, given by (4) and (5) in Sec, 15.1. Verify (16) and (17). 

(b) Problem Q, given at the end of Sec. 15.5. Also find values for Ui satisfying the 
conditions for the optimal x. 

(c) Show in Problems P and Q that the assumptions Xj > 0 and X 2 = X 3 = 0 implies 
that no i/( ^ 0 satisfy the Optimality Conditions. 

28 Consider the Optimality Conditions in Sec. 15.7. 

(a) Verify that (10) can be derived by dividing the expression in (5) by fo > 0. 
Justify the remaining steps in the argument. 
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(b) Derive conditions (11) from the conditions in (7). Justify your approach. 

(c) Show why (10) and (11) imply (12), (13), and (14). 

*(d) Assume that c{x) and all the ai{x) are linear. Show that (9), (12), (13), and (14) 
reduce to the results given by the Theorem of Complementary Slackness in Sec. 
5.3. 

29 Suppose you want to test whether a trial feasible point is optimal for a nonlinear 
programming problem. Discuss how you can apply the Optimality Conditions in Sec. 
15.7, instead of the linear programming model (1) through (4). Describe the connec¬ 
tion between the two approaches. 

30 In each part below, find an optimal solution by inspection. Check whether the 
Optimality Conditions in Sec. 15.7 are satisfied, that is, find values for the ^ 0. 

(a) Maximize — (a: — 4)- for 1 < x < 3. Are the Optimality Conditions satisfied 
at a:= 1? AtA:= 3? Why? 

(b) Maximize (.v — 4) ^ for 1 < .v < 6 . Are the Optimality Conditions satisfied at 
A = 6? AtA: = 4? Why? 

(c) Maximize x subject to (x- — 4)^ — 25 ^ 0. 

(d) Maximize x subject to (x- — 4)^ < 0. Explain the difference in your answers to 
parts (c) and (d). 

(e) Show that your answers in parts (a) and (c) yield a saddle point for the Lagrangian 
function ( 21 ). 

(f) Show that the Lagrangian function for the problems in parts (b) and (d) do not 
have saddle points. 

(g) Maximize (a: — 4)^ for 1 < a: ^ 6 . Are the Optimality Conditions satisfied at 
A* = 1 ? At A* = 4? At .V = 6? Why? 

31 Consider the Lagrangian function L(a*, u) given by (21) in Sec. 15.7. Write the expres¬ 
sion in detail when r(A) and all < 7 ((a) are linear functions. Discuss the connection 
between the Lagrange Optimality Conditions for this case and the Dual Theorem for 
linear programs. 

*32 Let m = 2 and « = 3 in a quadratic programming model given in Sec. 15.8. Write 
in detail the expressions (1) through (4) and provide the intermediate steps to justify 
the statements (5) through (14). 

*33 Consider the solution to Problem P by the Modified Simplex Algorithm, as shown in 
Sec. 15.8. Perform the detailed numerical operations, check the steps of the algorithm, 
and verify the results in (18) through ( 21 ). 

*34 In each part, apply the Modified Simplex Algorithm described in Sec. 15.8 to Problem 
P, altered as indicated below. (These alterations were also treated in exercise 23 of 
Chap. 14.) Parts (/) through (i) on p, 628, 

(a) Let the right-hand-side constant in (9) be 8 , instead of 4. 

(b) Let the right-hand-side constant in (9) be 3, instead of 4. 

(c) Let the right-hand-side constant in (9) be 2, instead of 4. 

(d) Add the constraint Xj -f- ^ 1. 

(e) Add the constraint X 2 -f Xj ^ 2. 
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(f) Add the constraint — + 2^:3 ^ 1. 

(g) Let the objective function be Gxj + 4x2 + 2 x 3 — ix\ — 2x\ — 3x|. 

(h) Let the objective function be 2xi + 6 ;r 2 + — 3xf — 2x\ —• 

(i) Add to the objective function the terms —atiXj — 2 ^ 2 ^ 3 . 

35 (a) Assume that is a convex function. Show that [IMW] is a concave function 

for those X such that fltW < 0 - 

(b) Verify the first equality in (4) of Sec. 15.9. 

*(c) Give the detailed reasoning establishing (iii) and (5) in Sec. 15.9. 

36 Consider the problem: maximize 4x subject to a: < 1. 

(a) Write the penalty function (2) in Sec. 15.9. 

(b) Perform the differentiation in (4) and construct a graphical representation for the 
A* that maximizes c(A:|r), given a value of r. [Hint: you can manipulate (4) to 
yield an equality between a linear and a hyperbolic function, which in turn can 
be represented graphically by an intersection of the two functions.] Show how the 
solution varies as r tends to 0. Is the limiting value for x truly optimal ? 

(c) Suppose the constraint is at ^ 5. How does this affect the behavior of the solution 
for a given r and the limiting solution as r approaches 0 ? 

37 Consider the generalized programming formulation (1) through (5) in Sec. 15.10. 

(a) Explain v/hy the restricted master program has a basic feasible solution and a 
finite maximum value c^. 

(b) Show that any feasible solution to (4) and (5) yields a point, given in ( 6 ), that is 
feasible in the original problem. 

(c) Let R = 2 and T = 3 in (3), (4), and (5). Write in detail the associated dual 
problem (7), ( 8 ), and (9). 

(d) Show that if the inequality in (10) holds for all x in S, then a*^' is truly optimal in 
the original problem, as claimed. 

(e) Show that the objective function in (12) for the subprogram is concave. 

(f) Justify the bound for c in (13). [Hint: since can be any of the X^, 

i = 0 , 1 ,..., T, the inequality (<) holds immediately in ( 11 ).] • 

(g) Explain why the algorithm can be terminated at the next iteration if Wt+i = !• 

38 (a) Consider solving the Problems P and Q, by the Generalized Programming 
Algorithm, as described in Sec. 15.10. Show the intermediate steps justifying that 
the approach leads to (17), (18), and (19). 

(b) Garry out the algorithm for Problem P to the point of deriving a* for k = 2, 
shown in Fig. 15.10. (Note that X\= X^ — XI ^ 0.) 

(c) Write in detail the restricted master program for Problem P, where you include 

only X^ and X^ from Fig. 15.10 and the slack variable for the first constraint. 
Verify that the optimal weights are = .5, as shown in Fig. 15.10, and 

complete the next (final) iteration. 

*39 Consider the example ( 6 ) and (7) in Sec. 15.11 illustrating the application of the 

Decomposition Algorithm. Verify the expressions in ( 8 ) through (11), and carry out 

the detailed calculations to obtain the results shown in (12), (13), and (14). 


Pencil-Pushing Exercises 629 


40 Consider the example (6) and (7) in Sec. 15.11. Change the objective function to 
X, 4- ;c 2 + 9 a*3 4- 3^4, and let the right-hand-side constant be 0, instead of 1, in Row 4. 
Solve by the Decomposition Algorithm. 

*41 Consider the case of the Knox Mix Company, described in Sec. 2.2 and solved by the 
simplex method in Sec. 4.4. Solve by the Decomposition Algorithm, where you let 
the subprogram consist of the first constraint (xj -f ^2 + ^3 4- ^4 ^ ^5) and the 
nonnegalivity conditions. 


42 Explain your understanding of the following terms: 


large-sicp approach 
regularization postulate 
optimal feasible step size 
constrained stationary point 
conve.x combination (weighted average) 
concave function (strictly concave 
function) 

convex function (strictly convex function) 
first-order method (second-order method) 
Method of Convex Combinations 

* Concave Simplex Method 

* reduced (or relative) partial derivativ'e 
*Cutting-PIane Method 

*Gradient Projection Method 

* projection 

linear support (tangent hyperplane) 


Constraint Qualification 
convex set 
feasible direction 
Method of Feasible Directions 
Kuhn-Tuckcr conditions 
*John conditions 
Lagrangian function 
Lagrange multipliers 
saddle point 

*complemcniary pivot operation 
penalty function approach (SUMT 
Algorithm) 

Generalized Programmmg Algorithm 
restricted master program 
subprogram 

Decomposition Algorithm. 


PENCIL-PUSHING EXERCISES 


43 Consider the following problem [which was also treated in exercise 2*1, part (a), 
of Chap. 14, and represents Case 3 in Fig. 14.9] 


maximize*—(.V] — 1)- — (xo — 2)- 4- 5 

subject to 

Xi 4 - 2 x 0 < 5 
16xi 4- 12x0 < 37 
24xi4- 4xo:^27 
Xj >: 0 and Xo > 0. 

Solve each part below by the method indicated. Draw a solution space diagram and 
show the progress of the iterations. (In those approaches that do not converge in a 
finite number of iterations, stop the calculations after k = 5.) 
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(a) Method of Convex Combinations in Sec. 15.2. 

♦(b) Concave Simplex Method in Sec. 15.3. 

♦(c) Modified Simplex Algorithm in Sec. 15.8. 

(d) Generalized Programming Algorithm for a separable problem in Sec. 15.10. 

(e) Apply the Optimality Conditions in Sec. 15.7 to derive an optimal solution. 

44 Answer the questions in exercise 43, except let the constraints be (Case 2 in Fig. 14.9) 

^1 + 2x2 ^ 5 
16xi + 12x2 ^ 37 

6 xi + X 2 ^ 12 

Xi ^ 0 and X 2 ^ 0 . 

45 Answer the questions in exercise 43, except let the constraints be (Case 1 in Fig. 14.9) 

Xi + 2 x 2 ^ 6 

8 xi -{■ 6 x 2 ^ 23 

, 6 x 1 + X 2 ^ 12 

Xj > 0 and X 2 ^ 0 . 

46 Consider a standard transportation problem, like that in Sec. 6.2, with supplies 
Si = 4, iS '2 = 6 , and demands Di = 5, D 2 = 2, and D 3 = 3. Assume the cost function 
is 

+ ^12 + ^22 + ^23) + 3 x ^3 + 2 x|i. 

In each part below, explain how the algorithm makes use of the problem’s special 
structure. Is an optimal solution integer-valued? (Do not carry the calculations 
beyond k = 5.) 

(a) Solve by the Method of Convex Combinations. 

♦(b) Solve by the Concave Simplex Method. 

47 Consider the following problem (which was also treated in exercise 46 of Chap. 14): 

maximize 6 x 1 -|- 8 x 2 — xj — x^ 

subject to 

4x5 -i- x| ^ 16 
3xi + 5x2 ^ 15 
Xi ^ 0 and X 2 ^ 0 . 

Solve each part below by the method indicated. Draw a solution space diagram and 
show the progress of the iterations. (In those approaches that do not converge in a 
finite number of iterations, stop the calculations after A: = 5.) 

(a) Method of Feasible Directions in Sec. 15.6. 

♦(b) Modified Simplex Algorithm in Sec. 15.8. 

(c) Generalized Programming Algorithm for a separable problem in Sec. 15.10. 

(d) Apply the Optimality Conditions in Sec. 15.7 to derive an optimal solution. 

(c) Consider, instead, the objective function maximize (xi - 1 )* + (x, - I)^ Apply 
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the Method of Feasible Directions, starting at aJ = atJ = 0 and using A = 1. Do 
you obtain an optimal solution ? What would happen if you use 6 = 2? Apply 
the Optimality Conditions in Sec. 15.7 at the point Xj = ^2 = 1, and discuss 
the implication of your result. 

48 Consider the problem 

maximize Xi + X 2 

subject to 

+ (^2 - 4)*^ -16^0 
{x,^4y+ 4-16^0 
Xi>0 and X 2 ^ 0. 

Solve each part below by the method indicated. Draw a solution space diagram and 
show the progress of the iterations. (Stop the calculations after k = 5.) 

(a) Method of Feasible Directions in Sec. 15.6. 

(b) Generalized Programming Algorithm for a separable problem in Sec. 15.10. 

(c) Apply the Optimality Conditions in Sec. 15.7 to find an optimal solution. 

(d) Consider an alternative objective function: maximize (xi — 1)’ + (-^2 “ O’- 
Apply the Method of Feasible Directions, starting at xj = ^ = 0 and using 6=1. 
Do you obtain an optimal solution? What would happen if you used 6 = 2? 
Apply the Optimality Conditions in Sec, 15.7 at the point Xi = Xn = 1, and 
discuss the implication of your result. 

49 Answer parts (a), (b), (c) in exercise 48, except let the objective function be 

maximize — 26xi -f 50x2 — Sxf — 9^ -f IOX 1 X 2 , 

50 Show how to solve the problem in exercise 2 using the Generalized Programming 
Algorithm in See. 15.10, where you let R contain all the constraints. (Do not carry 
the calculations beyond k = 5.) Show the progress of the iterations on a solution space 
diagram. 

*51 Use the Decomposition Algorithm in Sec, 15.11 to solve the upper-bounded variables 
problems in exercise 28 of Chap. 5. Let the subprogram contain the upper bounds and 
nonnegativity conditions. 

*52 Use the Decomposition Algorithm in Sec. 15.11 to solve the transportation problem 
example in Sec. 7.3 (sec Fig. 7.2). Let jR'contain the supply constraints, and S the 
demand and nonnegativity constraints. 


FORMULATION AND NUMERICAL EXERCISES 

53 Consider a nonlinear optimization model like that discu^ed in Sec. 15.5. Show how 
to convert the problem into an equivalent one in which the objective function is 
linear. {Hint: introduce another variable and another constraint that involves the new 
variable and the original objective function.) 
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54 Consider a problem containing the nonlinear constraint 

n r n n ~\\/2 

a^Xi — K 2 aoXiXi^ —6^0, 

where /C > 0 and the term inside the brackets is nonnegative for all x*. Show an equiva¬ 
lent formulation composed of two constraints, one of which is linear and one nonlinear, 
and where the nonlinear a{x) is a convex function. 

55 Consider the problem of maximizing a nonlinear function r(x) subject to linear 
constraints, as displayed by (1) and (2) in Sec. 15.1. 

(a) Write the Optimality Conditions and their implications (9) through (14) given 
in Sec. 15.7. [Suggestion: first eliminate the slack variables in (2) of Sec. 15.1.] 

(b) Apply your answer in part (a) to the problem 

maximize — (jfi — 4)* — (x 2 — 4)^ 

subject to 

Xi 3^2 ^ 12 

3xi 4- X2<12 
—— ^2 < — 1 
—xi 4 X2 ^ 3 
Xi ^ 0 and X 2 ^ 0. 

(c) Draw a solution space diagram for the problem in part (b). Indicate the optimal 
solution. Interpret conditions (13) and (14) in the diagram. [Hint: you can 
represent [3r(x)/5x|, dc{x)ldx^ as the direction of steepest ascent from x. You 
can represent [5at(^)/^^i> ^^i{x)ldx^ for each constraint i that is satisfied as an 
equality by a direction that is orthogonal (perpendicular) to the constraint at x.] 

56 Consider the problem 

maximize Xx 4 ^2 

subject to 

^ + (>f 2 “ 0* 1 ^ ^ 4 ^ — 1 ^ 0 Xj ^ 0 and X 2 > 0. 

(a) Draw a solution space diagram and show the optimal solution. 

(b) Write the Optimality Conditions and their implications (9) through (14) in 
Sec. 15.7. 

(c) Give a geometrical interpretation for your answer in part {b),.[Hint: you can 
represent [dc{x)/dxi, dc(x)ldx^ as the direction of steepest ascent from x. You 
can represent {dai{x)ldxxy dai{x)ldx{\ for each constraint i that is satisfied as an 
equality by a direction that is orthogonal (perpendicular) to the linear support 
of the constraint at x.} 

57 Consider the problem 

maximize CjXi 4 ^>^2 


subject to 


^ 25 *1 — JCj ^ 1 *1^0 and x, ^ 0. 
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Use the Optimality Conditions in Sec. 15.7 to derive a range of values for Ci over 
which the solution Xj = 4 and ;cj = 3 is optimal. Display your results on a solution 
space diagram. 

58 Where-or~lVhen Problem (Sec. 1.6). Find the form of an optimal solution for this case by 
applying the Optimality Conditions in Sec. 15.7. Do the same for the problem 

maximize (^i + -h * • * + ^n) 

subject to 

^ C and every ^ 0. 


59 Consider the problem 

n 

maximize 2 

subject to 

n 

2 ^ ^ ^ 

where each Cj{xj) is strictly concave for all Xj and has a continuous first derivative. 

(a) Write the Optimality Conditions and their implications (9) through (14) in Sec, 
15.7. 

(b) Show that if b is increased to 6 + ^ (^ > 0), then the new optimal value of each 

is at least as large as it was for b, 

60 Consider the three-period production and inventory model with convex costs given in 
exercise 3 2 of Chap. 9. Drop the assumption that each Xi must be integer-valued. 

(a) Use the Optimality Conditions in Sec. 15.7 to solve each part of that exercise. 
Show the cost impact of an inhnitesimal change in each demand requirement /)<. 

(b) Assume the production cost function in Period 1 is Ci(xi) = (where rj = 1 in 
exercise 3 2 of Chap. 9). Use the Optimality Conditions to derive a range of values 
for Cj over which the solutions in parts (a), (b), (c), and (f) of exercise 32, Chap. 9, 
remain optimal. 

61 A company requires a work force of size Di in Period t, where t = 1, 2,..., n. At 
the beginning of Period 1, the initial work force Xq = 0. Let Xi be the amount the 
company hires (for xi > 0) or fires (for Xt < 0) at the beginning of Period t. The 
values of xi must satisfy 

t 

for / = 1, 2,..., n, 

where the summation on the left-hand side represents the work force available in 
Period t. In each Period t, let the hiring/firing cost be ri(X() and the idle work force 
cost be ^2(25-1 — -Ot)* Assume that these two functions arc strictly convex with 

continuous first derivatives, and the derivative of Cj(x) is nonnegative at x = 0. Assume 
that r,(x) grows without bound as |x| increases, and Cj(x) grows without bound as x 
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increases. The objective function is 

maximize — 2 ~ 

(a) Write the Optimality Conditions (9), (10), and (11) in Sec. 15.7. 

(b) Show that if there is a Period t < n in which it is optimal to have some idle work 

force S 5-1 then it is optimal to hire in the next period so that > Xi, 

♦62 Consider the Dual Theorem of Nonlinear Programming given in Sec. 15.7. Write 
the dual problem (23) and (24) when in fact every constraint is linear. Show how 
(23) and (24) can be reduced to the usual dual problem in linear programming. 


63 A constrained stationary point is obtained in the Method of Feasible Directions in 
Sec. 15.6 when there is no feasible direction yielding an improvement as evaluated 
in terms of the first-order partial derivatives; this condition is expressed by the text in 
(1). Show that when c(x) is concave and x* is a constrained stationary point, then x* 

is globally optimal. you can use the property of a concave function that 

” 5c (x) . . 1 

c(x d) < c(x) -f ^ • d) for any point x and direction </. 

i-i J 

64 Suppose Ai{x) is a linear constraint and x^ is a point satisfying Ai{x^) < 0. Show that 
directions dj that satisfy (6) in Sec. 15.6 with the variable Xq omitted will be feasible. 


65 Consider the problem 


subject to 


maximize CiXi 


auXi - ^2 < 0 
^ 21^1 “^ 3^0 

— log 032(Afj + 1 ) — log 033(^3 + 1 ) — ^3 ^ 0 
every atj > 0, 

where all > 0. Consider solving the problem using the Generalized Programming 
Algorithm in Sec. 15.10. Compare the difficulty of solving the restricted master 
program and the subprogram when the nonlinear constraint is included in S and 
‘when it is not. 


66 Consider the application of the Generalized Programming Algorithm to a separable 
problem, as described in Sec. 15.10. Discuss the effect of including the nonnegativity 
constraints ^ 0 explicitly in the restricted master program and removing them 
from the subprogram. {Suggestion: illustrate your answer using Problem P.) 

67 One-Potato^ Two-Potato Problem (Sec. 1.6). Exercise"47 of Chap. 14 generalized this case 

to account for the possibility of varying the yield coefficients. Show how the problem 
in exercise 47, Chap. 14, can be formulated and solved by the Generalized Program¬ 
ming Algorithm when you assume the expense functions and ^ 2(^19 

are linear. 
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MIND-EXPANDING EXERCISES 

68 (a) Assume a(x) is a convex function, where x = (xi, X 2 ,..., x„). Consider points 

yuy 2 i • • • and weights Wx^ a; 2 ,..., where each ^ 0 and Si-i 
Show that 

^ a 

where the summation on the right denotes the weighted average, coordinate by 
coordinate, of the points^!,... What is the corresponding proposition for a 
concave function ? 

(b) Assume each ^^(x) is a convex function, for 1 = 1,2, where x = 

(xj, X 2 ,. . ., Xn), Show that A{x) = maximum [aiW,fl 2 W> • • • > “ a convex 

function. What is the corresponding proposition when each ai(x) is a concave 
function ? 

69 (a) Show that a continuously differentiable function a(x) satisfies the inequality (4) 

in Sec. 15.5 if and only if it is convex. 

*(b) Show that a function a(x) having continuous second-order partial derivatives is 
convex if and only if the quadratic form 



is positive semidefinite at every point x. 

*(c) Show how to modify your answers when a(x) is concave. 

70 Suppose the inequality (1) in Sec. 15.6 is satisfied by a set of directions dj (not neces¬ 
sarily feasible). Prove that there exists a point (x* + !</), where / >0, such that 
c(x* -f- td) > r(x*). 

71 Consider a nonlinear optimization model like that discussed in Sec. 15.5. Assume 
that c(xi, X 2 ,, . ., x„) is a strictly convex function. Prove that the optimal value c occurs 
at an extreme point of the constraint region. {Recall: x is an extreme point if it cannot 
be represented as the weighted average of two other feasible points.) 

72 Consider the Lagrange Optimality Conditions in Sec. 15.7. Show that a point £ is 
optimal if there is a point iZ ^ 0 such that the pair (;?, fi) is a saddle point of L(x, a). 

73 Let c(xi, X 2 ,..., x„) and fli(xi, Xj,..., x„), for i = 1, 2,..., m, be arbitrary real¬ 
valued functions defined over an arbitrary region R for X], X 2 ,..., x„. Let ^0, for 
1 = 1, 2,, .., m, be specified values. Suppose that j?i, ;l? 2 > • • • > is an optimal solu¬ 
tion to 

m 

maximize c(x) — 

where x must be in R, Show that X], Xj,..., ;?n is also an optimal solution to 
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subject to 


maximize ^(x) 


Qiix) — ai{x) ^0 for i = I, 2, 

where x must be in R, 


I Tn, 


74 Consider the problem of maximizing a nonlinear function c{x) subject to linear 
constraints, as displayed by (1) and (2) in Sec. 15.1, Alter the formulation to include 
constraints that are equalities (that is, an Si is not present) and to include variables 
that arc unrestricted in sign. Derive the Optimality Conditions and their implications 
analogous to those in (9) through (14) of Sec. 15.7. 

75 The Penalty Function Approach in Sec. 15.9 utilizes a particular form for the expres¬ 
sion that multiplies the parameter r, namely, the summation of terms [l/i4t(x)]. 
Consider a more general formulation in which the expression that multiplies r is the 
summation of terms Ci[i4((x)]: 

(a) Discuss the properties that the function Gi should have to be appropriate for a 
penalty function. Suggest one or more functions Gj having these properties and 
show the resultant expressions for dc{x\r)ldxj. {Hint: consider, for example, the 
logarithm.) 

♦(b) Devise a penalty function appropriate to letting r approach “minus infinity.” 
♦(c) Devise a penalty function that can encompass equality constraints ai{x) = 0. 

♦76 Consider the Generalized Programming Algorithm in Sec. 15.10. Prove that the 
optimal values for the objective function of the restricted master program increase 
monotonically to the optimal value c for the objective function of the original problem. 

77 Consider a nonlinear optimization model like that discussed in Sec. 15.5. The Con¬ 
straint Qualification postulated the existence of a point x® such that each ai{x^) < 0, 

(a) Suppose you want to find a point z such that every ai{z) ^ 0. Discuss how to 
obtain z. [Hint: start with an arbitrary pointy that may satisfy some, but not all, 
of the constraints. For example, suppose ai{y) > 0. Suggest an optimization 
problem, involving ai(x) in the objective function, that eventually produces a 
point z such that ai{z) ^0, and that maintains feasibility in all the constraints 
satisfied by y. Alternatively, suggest an approach that uses artificial variables.] 

(b) Suggest one or more ways to alter your answer in part (a) so that you obtain a 
point X® for which every a((x®) < 0. 
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CHAPTER 


Introduction to Stochastic 
Programming Models 


16.1 IMPACT OF UNCERTAINTY 

All our previous examples have assumed that the data required by a model are 
known exactly. This was true of the linear models in Chap. 2, which postulated 
that unit profits, demand requirements, and supply availabilities were specified 
with perfect certainty. Similarly, the illustrations of dynamic phenomena in 
Chaps. 8 through 11, pertaining to inventory replenishment, production schedul¬ 
ing, and equipment replacement, assumed there was no uncertainty about the 
data needed by the models. But in real life, since you never know all these values 
with perfect certainty, you may ask distrustfully, “Are the preceding deterministic 
models and techniques really practical ?” Rest assured that they are. But the more 
important and harder question is, “When?” The discussion below will help you 
^ learn when to apply what model. 

For full comprehension, you must recognize three factors: 

• The true nature of uncertainty in a particular real situation. 

• The way a proposed mathematical model deals with such uncertainty. 

• The computational technique appropriate for solving this suggested mathe¬ 
matical model, given the data for the particular situation. 

Therefore, in applying operations research to a real managerial decision-making 
problem, you always should ascertain 

(i) What uncertainties must be faced^ and how they may influence the selection of an 

optimal decision. 

(ii) Whether a given mathematical model takes suitable account of these uncertainties. 
Choosing the most efficient computational approach for obtaining numerical 
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solutions is an important step in any application, but it can be left to the technical 
specialist. You will have ample opportunity to become acquainted with such 
techniques in the chapters ahead. But as you study the material, do not become 
so engrossed in mathematical details that you lose sight of (i) and (ii), and thus 
miss the more significant aspects. 

Below are two simplified case studies that clarify the three factors cited above 
and indicate how you proceed in answering (i) and (ii) in actual situations. 

New plant expansion. The Yum-Yum Company, a food processing 
firm, is deciding whether to expand its present plant or build a new plant. The 
President believes that an important factor in this decision is his firm’s share of 
the market during the next ten years. Assume that Yum-Yum’s planning staff 
is fairly confident about the other pertinent data, and finds that the structure of 
the production and marketing processes can be mathematically represented by 
a linear model resembling those found in Chap. 2. The President wants to be sure 
that the economic analysis of the decision explicitly reflects the element of un¬ 
certainty about the company’s future market share. 

Bear in mind that the critical management decision is where to add productive 
capacity. The values of the other variables calculated by the model, such as the 
sales quantities of individual products, the average and peak levels of inventory, 
and the required amounts of raw food supplies, are assumed to be of less interest, 
although they are used in the analysis of the major decision. Consequently, it is 
quite reasonable for the economic study to proceed as follows. 

With a deterministic linear model, find the best expansion alternative for each 
of several possible and reasonable values for Yum-Yum’s future market share. 
If the results show that the optimal decision is insensitive to the market-share 
parameter, the linear model has adequately dealt with the impact of uncertainty. 
If, on the contrary, the decision proves to be highly sensitive to market share, 
then further study is required. In particular, for each major decision, evaluate 
the resultant profit for several possible market shares. This may indicate that 
although the optimal expansion decision is sensitive to market share, the actual 
profit level is not. If profit also varies significantly with market share, however, 
then it will be necessary for the analysis to include, in one way or another, a 
judgment of the relative likelihood (probability) of each possible market share. 
The company may choose to embark on further market research to narrow the 
range of this uncertainty prior to making a final expansion decision. 

To sum up, during an expansion analysis, the firm learns the nature and pos¬ 
sible impact of uncertainty about market share. The problem is characterized 
by a linear programming model, and the influence of market share is tested through 
sensitivity analysis, like that explained in Chap. 5. If these analyses demonstrate 
that profit is significantly affected by the market-share value, the President of 
Yum-Yum can judge the risk of each decision by estimating the likelihoods of the 
possible market-share values. Further, the company can assess in economic terms 
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the precise importance of obtaining additional market information before making 
a final decision. 

As to calculating specific answers in so complex a situation, Yum-Yum’s planning 
staff undoubtedly should employ a linear programming routine on a large-scale 
computer. 

New computer facility. Now we turn to the second case study, which 
on the surface resembles the first one. The Lead-Nickel Company, a major 
primary-metal producer, hats several dozen plants scattered throughout the 
United States. At its central headquarters the firm has four relatively small-scale 
computers that are used 70% of the time to prepare standard corporate account¬ 
ing reports, and the rest of the time to perform calculations for special studies 
conducted by staff groups, such as the Operations Research Department. Although 
these special studies consume a significant fraction of time on an annual basis, 
their demands for computer time vary considerably from day to day and fre¬ 
quently occur in bunches. A number of small-scale computers are also located at 
the different plants; these machines produce plant accounting reports about 50% 
of the time, and the rest of the time service the need‘s of resident technical groups, 
such as the Industrial Engineering staff. Here, too, there is considerable day-to- 
day variation in work load. 

The company Controller, who has corporate responsibility for all these comput¬ 
ers, has observed that at least four or five days a rnonth, and sometimes up to ten 
days, the computers seem overloaded both at headquarters and at the plants. 
Feeling sure this congestion results in costly and irritating delays in getting jobs 
processed, he wants to install, at corporate headquarters, either a medium-scale 
computer to replace only the four small ones, or a large-scale computer to replace, 
in addition, several of the plants’ machines. 

The Controller realizes that his decision must recognize certain intangible 
factors, such as the relative merits of decentralized operations, but he wants to 
weigh these considerations against the possible advantages of a larger and 
economically more efficient piece of computing equipment. Further, he wants 
the new configuration of computers to eliminate, or at least drastically reduce, the 
problem of overloading with all its ensuing delays and disruptions. 

Several years ago, the Controller anticipated he would someday want to make 
this computer equipment decision, and started collecting data on all the jobs 
processed by the company’s computers. Thus he knew, for example, how many 
hours of computer time were being spent at each location on processing payroll, 
keeping inventories, billing accounts receivable, etc. The computer manufacturers 
supplied him with the comparable amount of time required to process these same 
jobs on their medium- and large-scale computers. 

The Controller asked the Operations Research Department whether it could 
help him analyze the situation. In fact, he had a rough description of a linear 
model. He thought the variables would be the annual frequency with which 
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Job Type i is placed on Computer Type j; the coefficients of these variables would 
show the relative speed of each computer for each job; the constraints would 
indicate the total annual frequency for each Job Type i and the time available 
of each Computer Type j; and the objective function would contain the cost of 
processing Job Type i on Computer Type j. 

Do you think Lead-Nickel’s computer capacity expansion decision can be 
analyzed in the same way as Yum-Yum’s plant capacity question? Is the impact 
of uncertainty the same? To both questions, the correct answer is “no.” Here are 
the reasons why. 

In the Yum-Yum example, annualized figures appropriately measure profit, 
because the company’s market share, by definition, indicates the overall basis 
for the year’s entire^operations. In Lead-Nickel’s situation, the same annualized 
approach would obscure the very essence of the congestion problem: the day-to- 
day variation in requirements for computer time. After all, under the current 
computer configuration all jobs are processed eventually —the extra capacity is 
needed to reduce the delays due to daily fluctuations. Thus, for an analytic model 
to be useful in this case, it must indicate the impact of the random elements in the 
operating environment. In other words, the approach must take daily phenomena 
into account to show how many jobs will be seriously delayed during a year’s 
operation. 

You may wonder if a deterministic model could be devised to use annualized 
data and still handle the impact of daily uncertainty. Perhaps one could be put 
together, but you would probably find it unsatisfactory to work with. You could try 
to encompass the congestion factor in an ad hoc fashion by overstating the time 
required to perform Job Type i on Computer Type j. T his approach, bolstered by 
considerable sensitivity analysis as well as ingenuity, might be usable if there were 
only a very few types of jobs and computers. But, as you will find in studying the 
second half of this book, there are alternative probabilistic models that are far 
better for analyzing a capacity problem of this sort. These models will use the 
historical data indicating the day-to-day variations in requirements, and thereby 
will put into sharp focus the elements of uncertainty contained in this kind of 
problem. 

With regard to numerical techniques for solving such probabilistic models, 
this topic, of course, anticipates what is ahead, and therefore cannot be pursued 
here in any detail. Suffice it to say that, depending on the particular mathematical 
statement of the model, you actually may be able to use linear or nonlinear 
programming algorithms, or a dynamic programming approach, or, if these are 
not appropriate, what is called computer simulation. As you continue reading this 
book you will therefore find that the computational approaches in the previous 
chapters are also helpful techniques for solving many stochastic problems. In 
addition, you will discover some new techniques that are especially suited to 
models containing elements of uncertainty. 
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Introducing probabilities. The cases above referred to two approaches 
for dealing with uncertainty: testing solution sensitivity in deterministic modeb, 
and designing models containing probabilistic elements. The underlying theme 
for the rest of the book is how to introduce probability analysb into optimization 
models. Now uncertainties will be viewed as imperfect predictions to be 
characterized by probability dbtributions for the different possible events. 
Frequently, the resultant models will merely be more complicated versions of 
deterministic, models and can be solved by the same numerical techniques. But 
this will not always be so. Sometimes you will be justified in substituting a single 
expected value in the deterministic version; but more often, in order to calculate 
solutions you will have to exploit the form of an optimal strategy, if thb is not too 
difficult to find, or otherwise you will have to use an arbitrary, but reasonable, 
form. 

No matter what, you will see that probabilbtic models are inherently harder to 
use than deterministic versions. First, there are new conceptual difficulties, such 
as the interpretation of the probabilities themselves and the meaning of opti¬ 
mality; these are discussed in Sec. 16.2. Second, there are new technical dif¬ 
ficulties relating to the mathematics of optimization. To illustrate, even when a 
stochastic model is a straightforward generalization of a determinbtic version, 
the computational burden increases, since you must consider each possible event 
instead of only a single estimate. Further, in stochastic models the criterion func¬ 
tions are typically nonlinear, so that the task of optimization is more complex. 
And third, there are increased data requirements for the specification of the 
probability distributions. For example, a manager may see that the price of his 
competitor’s product fluctuates, but he may find it difficult to state a meaningful 
probability distribution for this variation. 

Thus, aside from the intellectual curiosity engendered by stochastic phe¬ 
nomena, there would be limited interest in these probabilistic models were it not 
for their practical necessity in certain real decision-making situations. 

The art of management science once more. Here we return to 
the question, “How do you select the right model for a particular application?” 
Regrettably, no textbook can furnish you with an infallible set of rules—you will 
have to rely on experience, mature judgment, and continuing analysis. 

In real applications, of course, you can often get expert advice from other 
people based on their own past experience and mature judgment. There is some 
solace in that thought. But if you have managerial responsibility, then never 
forget that you—not your advisors—must bear the ultimate responsibility for 
your decisions, good or bad. Consequently, if you want to make effective 
managerial use of the powerful analytic tools in this book, there is no good sub¬ 
stitute for educating your own judgment. To assist you in this task, we have 
selected many of the examples in the remainder of this book to be stochastic 
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versions of the deterministic models in earlier chapters. You should strive to see 
how uncertainty affects a decision-making problem by studying how this factor 
is reflected in each analytic model. Then, when faced with a real decision-making 
problem in an environment of uncertainty, you can more confidently ferret out 
the essentials you want reflected by an analytic model, and so choose an eflfective 
approach. 

Study guide. The remainder of this chapter and the next two will show 
you how to generalize many of the examples in earlier chapters to include random 
elements. At the same time, you will see a few important general propositions 
about optimal solutions to stochastic models. Finally, the material will demonstrate 
the applicability of techniques such as linear and dynamic programming for 
finding numerical solutions. These three chapters, then, link the first part of the 
book with what is to come. It is important for you to realize, however, that many 
of the models in these introductory chapters are really too general. They do not 
sufficiently exploit a model’s special structure or take advantage of the form of an 
optimal solution. Consequently, they often obscure the fundamental nature of an 
optimal solution and are computationally inefficient. For these reasons, you will 
want to study the models in subsequent chapters, where special structure is 
exploited. 

One important group of probability models deals with inventory stockage 
problems in which you cannot assume that the demand requirements are known 
with certainty. Chapter 19 and Appendix II explain several of these models, 
including a few approaches that have proved eminently successful in practice. 
Another useful class of probability models treats what are sometimes called waiting 
line, queuing, or congestion problems. The illustration of Lead-Nickel’s computer 
capacity expansion problem can be put in this category. If the structure of a 
waiting line environment is not too complex, the operating characteristics of the 
system—such as the average line length, the average waiting time, and the 
probability of the line’s being empty—can be found by the analytic methods 
presented in Chap. 20 and Appendix III. Furthermore, the results in these chapters 
indicate the insights to be gained in the analysis of typical complex congestion 
problems. Such queuing phenomena as well as other intricate probabilistic 
environments may be too complicated to yield to standard mathematical analysis, 
and consequently may have to be studied through computer simulation. Chapter 
21 surveys many of the concepts that are fundamental to an understanding of 
computer simulation and also presents several techniques that are effective in ! 
building such models. | 

As you continue your reading, you will find it helpful to answer two questions 
about each stochastic model: j 

1. What is an optimal policy for a deterministic version of the model ? j 

2. How much information about the probability distributions is required to 

state an optimal solution? j 
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By considering the first question, you will better perceive the influence of the 
elements of uncertainty. In a few instances the deterministic version will have a 
trivial optimal solution, so you can see exactly how uncertainty complicates the 
decision problem. In every case, the computation of an optimal solution for the 
deterministic version will be simpler, so you can come to appreciate the degree of 
additional complexity arising from introducing probabilities. The importance 
of keeping the second question in mind will become quite clear when you study 
the so-called chance-constrained approach in Sec. 16.5, for example, which 
requires only the fractiles of the probability distributions to calculate an optimal 
policy. Conscientious effort to answer these two questions for each example will 
greatly assist you in gaining the fundamental insights to be had from stochastic 
models. 

16.2 DECISION-MAKING IN AN UNCERTAIN 
ENVIRONMENT 

Before launching into an extended discussion of specific models and examples, 
we state in a few sentences what new complications are introduced when you 
seek an optimal decision in an uncertain environment. We assume that you 
are now fairly well acquainted with model building and optimizing techniques 
for deterministic problems. In particular, you understand that when you solve 
linear and dynamic programming problems, like the ones treated in the first half 
of this book, you know exactly what your decisions will be and precisely what 
future effect they will have. For example, irf a deterministic planning model, if 
you decide to process 10 units of raw material, then you know exactly how many 
items will be produced for sale. Or, in a deterministic inventory model, if you 
choose to purchase five dozen units for the next six months and two dozen units 
for the following six months, then as a result, you can forecast precisely what the 
inventory levels will be for the 12-month horizon. 

But suppose instead that if you decide to process 10 units of raw material, the 
yield may be any one of several possible numbers of items. Or if you purchase 
dozens of units as specified above, then the future inventory levels depend on the 
amounts of demand that actually materialize. In other words, consider the 
impact on your immediate and future decision choices if you cannot know for 
certain what will happen as a result of your actions. 

When uncertainties do impinge on your choice, you first must resolve the 
question of what is an optimal decision. We elaborate on the question in this 
section. 

Second, if some decisions have to be made now but others can be deferred 
until several of the uncertainties disappear, you also must consider the possibility 
of employing a contingency plan or strategy. Returning again to the 
purchasing example above, you may want to decide to order five dozen units 
this period, and then devise a “set of ordering instructions” to be used over the 


646 Stochastic Programming Models 


. 16.2 • 


remaining periods. These rules would state how much to purchase given the 
observed demands and the orders in previous periods. Thus, you do not know 
exactly how much will be ordered after this period, but only what you will 
purchase conditional on the actual ordering and demand history when the future 
purchase decisions must be made. In the paragraphs below, we will elaborate 
further on the notion of a decision strategy. 

Expected value. If you have had an opportunity to apply any of the 
models in the first half of this book to an actual planning situation, you will have 
discovered that stating a single objective function to be optimized is merely a 
convenient way to select a solution when there are a myriad of feasible possi¬ 
bilities. Executives experienced in applying operations research know quite well 
that such a solution is rarely optimal in any absolute, “all things considered” 
sense. In fact, almost always such a solution is manually adjusted to provide a 
better fit to reality, or the model specifications themselves are altered and a new 
“optimal” solution is found. 

In solving decision models containing uncertainties, we continue in the same 
spirit and state only a single objective function to be optimized. But we caution 
here at the outset that in an actual application, you should investigate various 
operating characteristics of a solution that is selected by a single optimization 
criterion. You may have to adjust this solution or modify the model before being 
able to successfully implement the result. 

For most of the models you will study, uncertainty will affect the actual or 
realized value of the economic criterion. We will consistently employ the expected 
value of the economic criterion as the objective function to be optimized. In 
recent years, scholars of decision-theory have devised several ways of justifying 
the use of the expected value as the sole criterion for an optimal choice. Many 
recent texts in statistical analysis present these arguments. As fascinating as the 
subject is, we do not pursue the topic here, because regardless of whether or not 
you find these justifications convincing, we still are going to use expected value 
as the objective function in the optimization models. (We do, however, temper 
this intransigent attitude by showing in several chapters how you can calculate 
relevant operating characteristics of a selected policy.) 

With the exception of the advanced material, we employ only elementary 
notions of probability theory, like the expected value and the cumulative proba¬ 
bility function. Frequently, we use discrete probability distributions, so that you 
can calculate answers without resorting to the calculus. Sometimes we provide 
you with the end results of applying the calculus; although the formula may look 
complicated, you will have no trouble in computing answers when you are given 
specific values for the variables. 

But if you are like most students—even if you have had a full-term course in 
probability theory—^you will find some of the mathematical expressions new, 
and therefore strange looking. The reason is that most probability textbooks do 
not concentrate on decision-making or optimization problems. Consequently, 
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the mean and variance of a probability distribution may be the most complicated 
expected values you have ever derived up to now. 

Calculating the types of expected values that are used in this text is really not 
much more complicated than finding the mean of a 4 )robability distribution. To 
demonstrate this claim, we review the ideas below to familiarize you with the 
process for when you read the subsequent examples. 

Suppose X is a random variable that can have any of the values n = 0, 1, 
2, . .., Let P[X = n] designate the probability that X takes the value u. Then 
the expected value of the random variable JC is 

(1) E[X] = f; n-P [X = n] (expected value of AT). 

nsO 


1£ X can equal any nonnegative integer with positive probability, then we will use 
the symbol oo instead of .AT in (1); in this case (as well as for all summations over 
an infinite number of terms), we postulate that the expected value is a finite 
number. 

Next, suppose we want to compute the expected value of the random variable 
X^, This is simply 


(2) E[X^] = f; n^-P[X = n] (expected value of 

n»0 


In general, if we wish to calculate the expected value of a function of X^ which 
we denote asf{X), then we will compute 

(3) E[f(X)] = 2/(«) = n], [expected value o£f{X)]. 

n»0 


Hopefully, you remember this much from your previous training in probability 
theory. 

Now let us apply (3) to an example containing an economic criterion function. 
A simple, but instructive, illustration is the expected cost associated with an 
inventory stockage rule for a on>period horizon. Let D denote a random variable 
representing the amount of customer demand in a single period. Assume the 
possible values for D are J = 0, 1, 2,..., iV, and the associated probabilities are 
P[D = d]. Suppose that you purchase items to stock for customer demand, 
where each item costs c dollars. If any of these items are left over at the end of the 
period, you incur a holding cost of h per unit. (Assume there is no salvage value 
for items left over.) But if demand D exceeds the amount you order then you 
incur a penalty cost p per item short. Thus, the actual cost in a period depends 
on not only how much you order but also what value of demand really occurs. 
Since you have to order before knowing demand Z>, it is reasonable to assess the 
effect of your decision in terms of expected cost. 

Specifically, \etf{d\y) represent the actual cost when demand D = d, given 
that you order Then 


( 4 ) 


' h+p<i!-y) 


i£d^y (demand is less than amount ordered) 
i£d>y (demand exceeds amount ordered). 
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Hence the expected cost, given that you order N)^ is 

(5) Elfim] = tf(d\y)-P[D = d] 

amQ 

(6) = ±f{d\yy’P[D = rf] + f; /(rfb )-PID = d] {ify < N) 

tfaO doy+l 

= t[cy + h-{y-d)]-PiD = d] 

(7) ^ 

+ S [<y+p-{d-y)].P[D=^d]. 

d=y+l 

The equation in (5) is simply the definition of expected value; the equation (6) 
follows by breaking the summation in (5) into two parts, one for demand less 
than the amount ordered, and one for demand exceeding the amount ordered; 
the third equality (7) is simply a substitution of the formulas in (4) for f{d\y). 

You can carry the algebra further. Notice in (7) that the'term cy appears inside 
both summations. Therefore, the amount cy is multiplied by the probability 
P[D = d] for every possible value d; since the sum of the probabilities P[D = d] 
over every d equals 1, you can write 

(8) £[/(I>b)] =cy + ±h{y- d).P[D = rf] + t P'{d -y)-P[D = d] 

(expected cost, given j>). 

The formula in (8) has an important direct interpretation: given that you stock 
j, the expected cost equals the sum of the purchase cost, the expected holding 
cost, and the expected penalty cost. 

We have derived (8) in a straightforward fashion using only “first principles” 
of expectations. But in building decision-making models such as this, it is far more 
convenient to start immediately with the statement in (8). Consequently, we will 
adopt the practice of initially writing the objective function as a sum of the 
component expected costs (and revenues). When you are in doubt as to whether 
the objective function has been written correctly, then make an. analysis parallel 
to the one above. Start with the value of the economic criterion exhibited as a 
function of the random variables, for given levels of the decision variables. Then 
take the expected value of this function over the possible values for the random 
variables, and simplify the results. 

Now we analyze a specific decision problem using an expected value criterion 
to see the impact of uncertainty on making an optimal choice. 

Optimal decisions. Consider once again the plant expansion problem of 
the Yum-Yum Company discussed in the previous section. The two alternatives 
are to expand the present facility or build a new plant. Foi either decision, the 
resultant profit depends on the firm’s future share of the market. Suppose the 
President jesses the probability of his firm’s maintaining its present 35% market 
share as and the probabilities of 30% and 40% shares as ^ and |, respectively. 
The corresponding annual profit values are shown in Fig. 16.1. 
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Share of 
Market 

30% 


35% 


Figure 16.1. Yum-Yum^ 40% 

Company. _ 

If you were this company’s president, how would you decide between the two 
alternatives ? How calamitous would it be if you built a new plant and the market 
share then fell to 30%, so that the company made only $50 million instead of $90 
million ? How embarrassing would it be if you expanded the facility and then the 
market share increased to 40%, so that the firm made $130 million but could 
otherwise have made $150 million? 

In situations like this, when there are only a few alternative actions, it is feasible 
to calculate the probability distribution of the possible objective-function values 
and then examine the relative merits of each action. But in most problems we will 
at least initially recommend an action that optimizes the expected value of the 
economic criterion. For the two alternatives in Fig. 16,1, verify that: 

Expand facility: expected profit = (90)^ + (100)+ (130)f =110 
Build new plant: expected profit = (50)^ + (100)+ (150)f = 112.5 

In terms of expected profit, building a new plant is slightly better. 

You well may conclude that the small difference between the expected values 
in (9) does not really convey the true qualitative difference between the probable 
economic effects of the two decisions. If that is your feeling, then let this example 
serve as a lesson. Whenever you employ an objective function that for the purpose 
of comparison reduces an entire probability distribution of economic outcomes to 
a single number, at least examine other characteristics of the solution to test the 
reasonableness of the recommendation. 

Uncertain actions. In the various dynamic deterministic models of 
Chaps. 8 through 12, you can predict with perfect certainty the precise sequence 
of decisions that is implied by any given policy. For example, in a production 
scheduling model, where demands are known exactly over the horizon, you can 
trace the precise production levels period by period. But if, instead, the demands 
can be described only by means of a probability distribution, then ordinarily 
you cannot say for certain what the production levels are beyond the current 
decision. 

Suppose in this production example that you employ a dynamic policy that 
specifies the amount to be produced for each level of entering inventory at any 
period. Then such a policy actually induces a probability distribution on the 
future production decisions. 


President’s 
Assessed 
Probability 
of Share 


Annual Profit ($ Millions) 
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New Plant 


90 


50 
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To illustrate this critical observation, let i = 0, 1, 2, 3 be the possible levels of 
entering inventory at any period. Assume the horizon is unbounded and the 
dynamic policy is stationary. Let Xo,(i) be the current production decision given 
where 

(10) • x^{0) = ;c.(l) = x^{2) = 3 x^{3) = 0. 

Hence, each period you produce 3 unless entering inventory equals 3. Suppose 
in any period the amount demanded is described by the probability distribution 

(11) Dt = 2 with probability ^ Bt = 3 with probability 

If entering inventory in the current period is 2, then current production is 3, and 
production in the next period will be either 0, with probability ^ (if the current 
demand is 2), or 3, with probability ^ (if the current demand is 3). Explain why. 

Continuing the analysis iri the same fashion, production in the period after next 
will be either 0, with probability ^ (if the current and next period’s demands are 
3 and 2, respectively), or 3, with probability (1 — ^ = |^). Again, explain why. 

This line of reasoning leads to the important insight that even though the 
component decisions implied by the stationary strategy (10) arc deterministic, 
they actually will occur over time in a random fashion because of the uncertain 
elements (11) in the structure of the model. As a result, the associated costs over 
the horizon are described by a complex joint probability distribution. 

In most dynamic models containing probabilistic elements, you will need to 
treat future decisions so as to reflect their future uncertainty. With this in mind, 
you should always carefully examine any such model for the speciflc assumptions 
as to what information about previous values of the random variables is available 
each time a decision is to be made. In constructing a mathematical model it is 
easy to make an error in describing what is typically an intertwined time sequence 
of random events and the subsequent decisions that are based, in part, on the 
actual outcome of the earlier events. The examples below and in Chap. 17 will 
show you how to approach these time-staged decision problems. 

Another notion arising from induced randomness in an unfolding sequence of 
decisions is the idea of steady-state behavior. As an illustration, look again at the 
production example, (10) and (11). Let pi denote the proportion of periods that 
entering inventory is at level i, when the system operates over an imbounded 
horizon. It can be shown for the strategy (10) and the demand distribution (11) 
that 

(12) A-i. /..-l A-i- A-i.- 

Further, let denote the proportion of periods during which production is at 
level X when the system operates over an unbounded horizon. Then as you should 
verify, from (10) and (12) it follows that 

^ 1 
qo —ps -^ 


(13) 


5 

g»=Po+ pi + pi = -q' 
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► Occasionally, the optimal policy for a model calls for the decision-maker himself to 
introduce randomness as part of his strategy. Of course, issue can be taken about the 
particular structural details in models giving rise to this type of solution, but there 
hardly is reason to object to such randomized strategies per se, since, as you 
have just seen, there already is randomness in the actions induced by the uncertain 
elements. 

The following hypothetical c.xamplc shows how a randomized strategy can be optimal. 
The manager of the Toute de Suite Bakery has found that the daily demand for one of his 
specialty cakes is described by the probability distribution 

1 1 2 
(i) P[no cakes] = -g- P[one cake] = -g- jP[two cakes] = -j-* 

Assume that the demand on any day is independent of demands on previous days. 
Each cake costs c to bake and must be thrown away if it is not sold at the end of the day. 
The manager does not want to make too many cakes as c is large; but still he wants to 
satisfy a reasonable level of demand. Accordingly, he formulates his decision problem as: 
minimize expected baking cost subject to the constraint that the probability of meeting 
the total daily demand is at least 

It might appear that the optimal solution is to bake one cake at a cost of Ir, for then 
total demand is met with probability (i + -J = 3 ). However, suppose the manager 
uses the following randomized decision rule: bake no cakes with probability 
5 and bake two cakes with probability 5 . He then still meets total demand with proba- 
bility [(1)-^ 4 - (^)l = ^], and what is more, his expected cost is [(4)0^ 4- (t)2c = Ac], 
which is less than the cost of Ic for the nonrandomized strategy. This randomized rule 
is optimal. 

Suppose that, instead of imposing a probability constraint, he formulates his decision 
problem as: minimize baking cost subject to satisfying, on the average, at least j of 
expected daily demand. Verify from (i) that expected daily demand is so that the 
constraint is to sell at least J a cake a day, on the average. Once again, if he uses a 
deterministic strategy of baking one cake, on the average he sells [(i )0 4 - (i)! + ( 1)1 
= |] cakes per day, which is in excess of the minimal constraint, His optimal strategy 
is to randomize: bake no cakes with probability 5 and bake one cake with probability 
§. Then he sells [(|)0 4 - (|)| = j] cakes per day, on the average, and the expected cost 
is [(|)0r 4- ( 5 ) 1 ^^ = .6<;], which is less than Ic, 

In general, whenever an optimization model contains constraints on the probabilities 
of events or on their expected values, an optimal solution may indicate randomization. <4 

Multiperiod objective function. In a dynamic model, where you 
have to evaluate a stream of returns, each component of which is random, the 
objective function will be the expected value analogue to the approaches in the 
first half of this book. 

To illustrate, let Rt designate the return in Period t of a dynamic model. As you 
have seen in the paragraphs above, when there are uncertain events, a decision 
strategy followed over the planning horizon gives rise to a joint probability dis¬ 
tribution on the elements of the returns stream {Ri, Ri, R ^,...). Suppose we 
postulate that the objective function is the expected present value of the stream: 

(1^) jE[present value] = + olRz + cL^R^ +•••], 
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where a is the one-period discount factor, and 0 ^ o' < 1. We assume that the 
value in (14) is finite. 

The expected value in (14) is taken over the joint probability distribution of 
(/2l, /?!, /?s, . . .). We can simplify the expression by employing a Fundamental 
Theorem of Random Variables: the expected value of a sum always equals the sum of the 
expected values. Consequently 

(15) [present value] = + aE\R^ + a^E[Rj\ + • • •, 

where we have brought each discount-factor coefficient in front of the associated 
expectation since the factor itself is a constant. In (15), each expectation is taken 
over the marginal probability distribution of the displayed random variable. 

For a second illustration, consider again the production example in (10) and 
(11). In this case, suppose the objective function is the long-run expected cost per 
period. You saw in (12) the exact proportion of periods that entering inventory 
is at level i, where the system is viewed over an unbounded horizon. Similarly, 
you saw in (13) the exact proportion of periods during which production is at 
level X, Denote the stationary production and inventory costs each period as C (x) 
and A(i), respectively. Then you can compute the value of the objective function 
for the stationary policy in (10) as 


(16) 


expected cost per period 
over an unbounded horizon 


= £[C(x)] + £[A(0] 

= {[C(0)]i + [C(3)]f) 

+ {[A(O)]^ + [A{l)]i + [/i(2)]i + [A(3)]i}, 


as you should verify. 


Specifying probability distributions. The stochastic models in 
this book assume that the decision-maker can state probability distributions to 
describe the elements of uncertainty in the model. Loosely put, the decision-maker 
must be able to assign nonnegative numerical weights to each possible event such 
that: (i) If an event is certain, its associated weight equals 1. (ii) If two events A 
and B are mutually exclusive, the weight of the event “either A or B” equals the 
sum of the weights for each of the events A and B. Since most elementary texts on 
probability theory discuss how to characterize a probability distribution in this 
way, we give no further elucidation here. What is more, for the purpose of this 
book, nothing else really has to be said, as the models to be treated merely apply 
the weights, employing the type of elementary, probability calculations you find 
in an introductory text. Nevertheless, we do add a few comments on the interpreta¬ 
tion of the weights and how a decision-maker might go about assigning specific 
numerical values, so that you have a better appreciation of these models’ intended 
meaning. 

Over the years, probability theory scholars have suggested a variety of inter¬ 
pretations for the weights. The most widespread notion is the relative frequency 
interpretation. In optimization-oriented models, a more insightful view, however, 
is that probability assessments reflect the decision-maker’s state of mind. To 
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illustrate what this means, recall the market-share probabilities in Fig. 16.1 that 
were specified by the President of the Yum-Yum Company. These weights arc 
really a quantification of the President’s current judgment about the future, and 
guide him in making an immediate decision (on whether to expand an existing 
plant or build a new one). For another illustration, consider the Lead-Nickel 
Controller who was evaluating whether to install a new large-scale computer. 
The historical data he collected, showing congestion, are of decision-making 
relevance only if he feels that they embody his judgments about future conditions. 

Much has been written recently on the issues and techniques of deriving proba¬ 
bility weights that reflect the decision-maker’s personal prognosis. The topic is 
usually referred to as statistical decision theory, and sometimes by the more 
restricted term Bayesian analysis. Despite the subject’s importance and relevance, 
delving into the details here would only be too distracting from the main task of 
analyzing optimizing models. But as a pragmatic matter, essentially, you can 
utilize four approaches to obtain these probability distributions: 

1. Use introspection, 

2. Employ historical data, 

3. Find convenient approximations, 

4. State descriptive axioms. 

Most often, you would apply two or more of these approaches in combination. 

To illustrate these, consider the Peppy Pill Company, a wholesale jobber of 
pharmaceutical products, that wants to install a computerized scientific inventory 
control system for stocking about 1500 products. In particular, once a week the 
computer is to update the inventory status of each item, and if the stock is too 
low, to indicate that Peppy Pill should send a replenishment order to a manu¬ 
facturer. Since there is considerable weekly variation in customer demand for 
each item, the scientific replenishment rules are to be derived from probabilistic 
inventory models like those you will study in Chap, 19. These models require 
specifying the probability distributions of weekly demand for each item. 

To begin, suppose Peppy Pill has little or no historical demand data. This 
situation sometimes arises because of insufficient records, and frequently because 
an item to be stocked is a new product that has no demand history. Then Peppy’s 
inventory manager has no recourse other than to use Approach 1, applying what¬ 
ever experience he can bring to bear in quantifying his judgments. As the inventory 
system operates for a while and demand data are accumulated, he can then apply 
the numerical techniques of Bayesian analysis to update his probability assess¬ 
ments. 

Now assume, instead, that this company has customer demand data for 6 to 
18 months back, depending on whether an item has a fast or slow turnover. These 
quantities have been manually posted on makeshift record cards. Then Peppy 
Pill can combine Approaches 1 and 2 by summarizing between one and two 
years’ weekly demand data into an historical distribution for each item and by 
applying judgmental corrections to reflect the management’s future assessment of 
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demand. You should realize, however, that in most actual situations, using his¬ 
torical data to compute empirical distributions is ordinarily not feasible: usually 
not enough data are available, and even when they are, the expense of computing 
so many individual demand distributions is prohibitive. 

Hence, Approach 3 is added as a modification to Approaches 1 and 2. To 
illustrate, the inventory manager may have calculated the empirical mean, and 
perhaps the variance of each item’s weekly demands, making whatever judgmental 
corrections he deems necessary. Then in a computerized stochastic inventory 
control model, he may employ, as an approximation, a Normal distribution having 
such a mean and variance. These values would be systematically updated by the 
computer as new demand experience accumulates. 

Approach 4 is a more sophisticated version of Approach 3. The inventory 
manager first chooses a model describing the process by which demands are 
generated. For example, the model may include the total number of Peppy’s 
customers, the chance that any one of these will want an item during a week, the 
pattern of order sizes for a customer, etc. Obviously, this method is more com¬ 
plicated than Approach 3, but it can be effective when the resultant analysis 
provides an explicit form for the probability distribution of demand, such as a 
Poisson or a binomial distribution. Then the historical data and judgmental 
corrections are used to obtain the few parameters needed to describe the derived 
probability law. (For Peppy’s inventory problem. Approach 3 is likely to be better 
than Approach 4.) 

In conclusion, uncertainty influences stochastic models in two ways. The first 
way is the direct effect stemming from the random phenomena explicitly ac¬ 
counted for by the model. The second is the indirect impact coming from the 
process of specifying the probability weights to describe the phenomena. For the 
most part, the rest of this book concentrates only on the direct effect. 

►A lively area of research on stochastic models is testing the sensitivity of solutions to 
variations in the specification of the probability distributions. Several topics of interest 
are: how best to estimate the parameters of distributions used in optimization models; 
how to find good approximations; how to take explicit account in a model of the 
uncertainty about the probability distributions; and how to derive reliable models that 
need only a limited amount of historical data. ^ 

Where do we go from here? Having established the groundwork for 
analyzing, probabilistic decision models in the preceding pages, we can progress 
in earnest to some important insights and several useful applications. In the next 
section, we show how, in a complex dynamic decision process, you can go astray 
should you try to ignore randomness by using an expected value in lieu of the 
random variable itself. We dub this mistake the Fallacy of Averages, It implies that 
in a linear programming planning model, for example, you usually are not 
justified in simply substituting expected values for the problem’s coefficients when 
they really are random. The rest of the chapter explores what you can do in linear 
progranuning problems when some of the coefficients are characterized by proba¬ 
bility distributions. 
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In Sec. 16.4 we discuss one possible way to treat a fairly simple, so-called two- 
stage stochastic linear programming model. The model clearly illustrates some of 
the nasty difficulties that have to be resolved in order to make an optimal decision 
when faced with uncertainty. (The generalization to a multistage problem is 
explained in the advanced material of Sec. 16.7.) 

In Sec. 16.5 we present another possible way of solving the problem. The 
approach, named chance-constrained programmings is an alternative to the two-stage 
model. We compare the two approaches by means of an illustrative planning 
example. 

The advanced material in Sec. 16.6 provides a second example of both the 
two-stage and chance-constrained approaches. It is an interesting problem in its 
own right. The model is a standard transportation problem, like the one you 
studied in Chap. 6, except here the demands are random variables. In the final 
advanced section we explain a programming model for which it is appropriate 
to substitute an expected value wherever a random variable occurs. 

If you like to read suspense stories, then skip immediately to the next section— 
you will discover the villain soon enough. But if you like to be primed for what 
lies ahead, then we must prepare you for discouraging news. Only rarely in 
practice will you be able to easily modify a linear programming model so as to 
take account of uncertainty in the coefficients. True, the methods in this chapter 
are formally capable of solving such models. But for real problems, usually these 
techniques are not applicable. Actual decision problems are seldom only two- 
stage situations, and a full-blown treatment of these actual problems requires a 
programming model of staggering dimensions. 

Nevertheless, don’t lay down this book yet! There are some practical probability 
models that you can solve. We have saved these for the subsequent chapters, after 
you have come to appreciate what difficulties must be overcome. 

16.3 FALLACY OF AVERAGES 

The following case is a simplified example of an actual decision problem dealing 
with uncertain elements. It demonstrates how you can be dangerously misled by 
using average values in a model appropriate for a deterministic situation. 

The Galactic Reaper Company, which manufactures farm machinery, is 
planning to construct a new plant to build its latest equipment, a combine, for 
harvesting, threshing, and cleaning grain. Five major tasks must be completed 
in order to put the plant into full operation: 

A. Erect plant building. 

B. Complete final design of combine model. 

C. Expand nucleus labor force to full-scale production size. 

D. Install manufacturing equipment. 

E. Debug prototype models. 

Let tj^s • •., be the number of periods required for Tasks A, B,..., E. 
A period consists of three months. Assume that these tasks must be performed in 
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the sequence indicated by Fig. 16.2. For example, both Tasks A and B can be 
started immediately. Both Tasks G and D can be started as soon as Task A is 
completed. Task E can start when both Tasks B and D are finished. The plant is 
in full-scale production as soon as both Tasks C and E are completed, (The network 
diagram is identical to Fig. 6.16, p. 186, used to illustrate critical path scheduling. 
Although the explanation of the case here is complete, you may find it helpful to 
review Sec. 6.6.) 



Figure 16.2. Galactic 
Reaper Company Exam¬ 
ple. 


If the values of t^ • • • were known with perfect certainty, then the total 
span of time required to put the plant into operation could be determined by 
finding the length of a longest route from Node 0 to Node 3 in the network of 
Fig. 16.2. Several of these values are uncertain, however. Relying in part on past 
experience in constructing plants. Galactic Reaper’s President has estimated the 
probability of each possible value, as shown in Fig. 16.3. Thus, it is equally likely 
that is be either 2, 3, or 4 periods (that is, 6, 9, or 12 months); similar statements 
hold for tc and The times for and tj) are known exactly. The President 
believes that there is complete independence among the random events. 


Task 

Possible 

Number of Periods 

President’s 
Assessed Probability 

A 

2 

perfect certainty 

B 

2.3.4 

3 each 

C 

2.3.4 

3 each 

D 

1 

perfect certainty 

E 

1.2.3 

3 each 


Figure 16.3. Galactic 
Reaper Company. Possible 
Time Requirements (One 
Period = 3 Months). 


Verify that when each random event takes its smallest value, the total time 
span is 4 periods; and when each takes its largest value, the span is 7 periods. 
*I^us the range of completion times is between 12 and 21 months. Also check that 
if each random event takes its average value (/^ = 3, = 3, t^ = 2), then the 

time span is 5 periods (15 months). 

The President of Galactic Reaper feels there is a significant competitive advan¬ 
tage to having the plant in operation as soon as possible. His estimates of the in- 
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cremental profit impact for different com¬ 
pletion times of less than 7 periods are 
shown in Fig. 16.4, Being concerned about 
the drop in profit that occurs if the com¬ 
pletion time stretches out to 6 or 7 periods, 
he considers the possibility of hiring an 
experienced manager to act as special 
assistant to supervise the construction 
project. He believes that the extra attention 
such a man could provide would reduce the 
total time span by 1 period in any event. 

That is, even if the span would have been 4 
periods, the manager would cut the time 
to 3 periods, and likewise for any other possible total span between 5 and 7 
periods. The total cost of hiring this manager is 520,000. If the President knew for 
certain that the total span would be 4 or 5 periods, then he would decide against 
adding the man, since the gain of $10,000 would not offset the additional cost of 
$20,000. The opposite would be true if the President knew for certain that the 
span would be 6 or 7 periods. Given the factual information in Figs. 16.2 through 
16.4, would you hire the new manager if you were President of Galactic Reaper? 


Totol Time Span 
(Periods) 

Incremental 

Profit ($ 1000) 

3 

120 

4 

110 

5 

100 

6 

50 

7 

0 


Figure 16.4. Galactic Reaper Company. 
Competitive Advantage of Early Com¬ 
pletion. 


Analysis. If you were to base your analysis on the total span of 5 periods, 
calculated by using the average time for each task, then you would not hire the 
special assistant. The reason is that a reduction from 5 to 4 periods implies 
the profit gain would be $10,000 (= $110,000 — $100,000) and this is less than 
the additional cost of $20,000, As you will see, this analysis is quite faulty. 

Such an approach makes two mistakes: first it assumes that the total time span 
calculated by looking at individual averages is a useful approximation to the 
expected total time span. Second, it overlooks the fact that the relevant criterion 
is expected incremental profit, and not incremental profit for the expected time 
span. 

By evaluating all the different possible events and their probabilities of occxir- 
rence, it can be shown that 

(1) p[r = 4]=A P[r = 5]=A P[r=6]=^ p[r=7]=A. 


where T denotes the total time span. Therefore, verily that 


= 64, 



^ rincremental profit 1 

(2) 

[without new managerj 

and 


(3) 

^[■incremental profit*! _ 
[with new managerj 


= 99 , 
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To keep the arithmetic uncluttered, we have rounded the computations to the 
nearest thousand dollars. Thus the gain in expected incremental profit from the 
new manager is S35,000 (= S99,000 — S64,000), which exceeds the additional 
cost of S20,000. Notice that the faulty analysis using the time average went astray 
because it was considerably ovcroptimistic in assessing the economic outcome when 
the new assistant was not hired. 

The faulty analysis exhibited the following fallacy. 

Fallacy of Averages. Given an arbitrary nonlinear function /(.v,,..., of 
random variables .Vj,. . ., a-„, it is usually erroneous to assume F[/(^i,. .., a„)] = 


Although mathematical analysis of a particular nonlinear function may establish 
that its expected value is well approximated by the same function of the expected 
values, you should never blindly assume that it does. 


Uncertainty and information. What is the economic impact 
of uncertainty in the foregoing example? A way to give a quantified answer 
to this question is to evaluate the gain in expected profit that would occur if 
Galactic Reaper’s President were able to obtain a perfect prediction of the uncer¬ 
tain elements. In other words, the impact of uncertainty can be measured as the 
maximum amount the President would be willing to pay if, after the payment, he 
were able to learn the exact values of the random elements and consequently 
decide without error whether to hire the new manager. This figure is easily 
computed. 

Recall that the decision to hire the man is better only when T = 6 and 7, and 
the net profit will be S80,000 (= $100,000 - $20,000) and $30,000 (= $50,000 - 
$20,000), respectively. Then the expected net profit when the values of the 
random elements arc known prior to the decision is 


( 4 ) 


'net profit with 
.perfect information 


] = (100)^ + (100)^ + (80)^ + (30)^ 
= 83, 


as you should check. Given the expectation in (3), the expected net profit with 
uncertainty is $79,000 (= $99,000 — $20,000). Therefore the gain from 
perfect information is $4,000 (= $83,000 — $79,000), taking account of 
the added cost of the new manager. This figure may be interpreted as measuring 
the impact or loss from uncertainty, for the President of Galactic Reaper would 
not pay over $4000 to have perfect foresight. 


16.4 TWO-STAGE LINEAR MODEL 

In this section we begin consideration of how to account for probabilistic 
uncertainty in the coefficients of a linear programming problem. As you know 
by now, linear programming has a multitude of applications. You may easily have 
gotten the impression after reading Chap. 2 that linear programming is a "model 
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for all seasons.” For this reason alone, trying to extend the linear programming 
problem to encompass chance elements is a worthwhile goal. 

Our task in the next several sections is to devise ways of formulating a so-called 
stochastic linear programming model that yields ordinary linear 
programming problems as a result. Because of the great diversity of applications 
mentioned above^ v/e cannot hope to succeed in this task unless we add some 
specific postulates about the underlying structure of the decision process. In 
particular, we have to formalize the evolution of “v/hich decisions have to be made 
when v/hac information is known about the previous decisions and the random 
variables.” 

A fairly generai structure of this sort is contained in the advanced material of 
Sec. 16.7. That formulation permits you to model a long and involved history of 
decisions and information flow. You can glean many of the principal ideas, how¬ 
ever, by examining a scaled-down version with only two stages to the decision 
process. Consequently that is the problem we consider in this section. 

In particular, you v/ill learn two things from investigating the two-stage model. 
First, you will see how to formulate a simple stochastic programming model to 
yield an equivalent ordinaiy linear programming problem. Second, you will see 
that such a fbrmulacion magnifies rhe size of the problem. You will be justified 
in concluding that the more general situation modeled in Sec. 16.7 is likely to be 
beyond practicality for most real linear programming applications. 

In Sec. 16,5, you will study a chance-constrained model, which is an alterna¬ 
tive formulation for a stochastic programming problem. The approach avoids 
some of the drawbacks in the two-stage formulation below, but, as you will sec, 
it gives rise to some limitations of its own. 

An easy ease. Before presenting an example of a two-stage model, we 
acknowledge a fundamental result for wliat might be termed a simple one-stage 
problem. To ease the exposition, suppose a deterministic version of the model 
can be written in the canonical form: 


(1) 

maximize 2 

subject to 


(2) 

n 

2 = bt for i = 1, 2,..., m 

j-i 

(3) 

xj>0 for j = 1, 2,..., n. 


Now assume that the coefficients in the objective function really are random, 
and that all the levels of the variables have to be determined prior to learning the 
actual values for the random cj. Such a situation might arise in a planning model 
where future market prices and labor costs are not known exactly at the time the 
plan is being developed. Since all the structural coefficients Cij and the right- 
hand-side coefficients bi are known with certainty, no difficulty arises in selecting 
feasible levels for the Xj, 
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Postulating that the appropriate objective function is to maximize the expected 
value of the summation in (1), it is easy to establish the theorem below. 

Linear Certainty-Equivalence Theorem: Assume that all the aij and bi in 
(2) are known .exactly, but Cj in (1) are random variables independent of all 
Xj, If the levels of Xj, for j = 1,..., n, must be set prior to knowing the exact 
values of Cjy then a solution to 

(4) maximize E j 

subject to (2) and (3), is given by levels for Xj that satisfy 

n 

(5) maximize 2 
subject to (2). and (3). 

Thus if the only random variables are the objective-function coefficients, and 
these are independent of the specific activity levels, then an optimal solution can 
be found from an equivalent deterministic linear program, where the correspond¬ 
ing expected values are used in the objective function. As you will see next, a linear 
problem with uncertainty is not solved so simply when there are other random 
elements, or when there are x^ that can be set after learning the exact values for 
of the random elements. 

Example. We start with a small example showing how a two-stage model 
can arise in a decision-making context. By studying the specifics of this illustra¬ 
tion, you will find it easier to understand the more general explanation at the end 
of this section. 

Each month, the Big Board Company, a wood-products manufacturer, pro¬ 
cesses a given tonnage supply of timber, T, into lumber and plywood. To keep the 
example simple, suppose Big Board makes only one premium quality grade of 
each of the two timber products. At the start of a month, the company must 
decide the levels of 

Xi = the tons of timber supply allocated to lumber manufacturing 
X 2 = the tons of timber supply allocated to plywood manufacturing 
Xi = the tons of timber not allocated to either manufacturing process. 

Assume that by the end of the month Xi yields a^x^ hundred board-feet of lumber, 
and X 2 yields ^ 2^2 thousand sheets of plywood. Let 2), denote the maximum amount 
of lumber the company can sell at the end of the month, and D 2 the maximum 
amount of plywood. For each manufactured product {j = 1 and 2), let represent 
the market price Big Board receives per unit (that is, per hundred board-feet of 
lumber and per thousand sheets of plywood); hence, the amount r^ia^x^) is the 
total revenue from Product j. Let be the production cost of processing a ton of 
timber into Product j; therefore, e^x^ is the total manufacturing cost associated 
with Product j. Consequently, defining the quantity c^x^ repre¬ 

sents the profit from Product j, where j = 1, 2. In addition the company receives 
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CzXz from selling the unused timber on the open commodity market; even if Big 
Board decided to sell all its timber on the commodity market, it would not affect 
the value of ^ 3 . 

If the values of aj, Cj, and Di are all known with certainty, the decision problem 
can be characterized as an ordinary linear programming problem: 

( 6 ) maximize + C 2 X 2 + c^x^ 
subject to 

(7) X, + X2+ X2= T (total supply) 

( 8 ) diXi + Si = Di (lumber demand) 

(9) ^ 2^2 + S 2 = D 2 (plywood demand) 

( 10 ) . xj and nonnegative, 

where represents the unfilled demand for Product i. Study the simple structure 

( 6 ) through ( 10 ) and explain why a numerical solution is trivial once you are 
given values for and Di. 

In a typical situation, the market prices for the products vary from week to 
week, depending on the supply and demand conditions. Therefore the Big Board 
Company cannot know the exact values of prices r, and rn (and thus Ci and rj) 
until several weeks after the company decides the levels of Xj, X 2 , and x^. Further¬ 
more, both the yields Ci and ^2 as well as the potential demands Z)j and Dn are 
subject to random variation. Therefore Si and .r 2 are actually determined after 
the values of aj and Z), become known. What is more, if ajXj exceeds the maximum 
demand, the company will have units left over. Thus you see that the structure 

(7) through (10) is not yet complete for the case where the aj and Di are random. 
To complete the description of the optimization model, assume the Big Board 

Company must dispose of any units left over, and has to sell them at a lower profit 
margin to do so. Consequently, add the terms —to ( 8 ), — ^2 to (9), and —fti 
—fih to ( 6 ), where /< > 0 denotes excess units of Product i and —fiU is the profit 
loss due to a disposal sale of the amount /j. The loss coefficients f are also random 
and unknown when the levels of x^ must be chosen. The actual levels of ti are to be 
determined along with the Si after the values of all the random variables are 
learned. 

To summarize the approach, note that the timing sequence is: 

(i) First Stage, The company selects the levels of Xi, Xi, and Xs knowing only 
the exact value of ^21 and ^ 3 . 

(ii) Random Event, The specific values of the random elements C2yfiyfii fli, 
fl 2 > Z),, and D 2 become known, and are independent of the levels of Xi, X 2 , X 3 . 

(iii) Second Stage, The company establishes the levek of Ji, J 2 > and / 2 - 

Given this information structure. Big Board selects those levels for Xj, X 2 , Xs that 
maximize expected profit. 
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Without any further simplifying assumptions, even this small-scale example 
would be difficult to solve. So first postulate that for each Product j = 1 and 2, 
the yield coefficient and the market price rj are independent random variables. 
Therefore, you can write the expected value of the profit coefficient Cj as 

(11) E[cj] = E[rj]-E[aj] - ej for j = 1 and 2. 

Then make one more postulate about the nature of uncertainty, which will enable 
you to solve the problem as a standard linear program: assume there are only a 
finite number (2, of different possible sets of values for {fufz, fli, a 2 > To 

illustrate with the above example, suppose Q, = 3. That is, assume that only three 
sets of values can occur: 

(/ii,/i 2 , flii, ^ 12 , Du, Du) with probability 

(12) (/ 2 i,/ 22 , « 21 , 022 , Du, D^) with probability J& 2 , 

(/si,/ 32 , O 31 , O 32 , Du, Du) with probability/^ 3 , 

where + ^2 + ^3 = 1 • With this information, you can calculate 

g-1 

and given E\r^ and you can determine from (11). Then the appropriate 
linear programming formulation for this two-stage model is 

maximize + E\c^X 2 + 

— +^ 12 ^ 12 ] — p2\S2\h\ " 1 -^ 22 ^ 22 ] — Pz[Szih\ +^ 32 ^ 32 ] 

subject to 

(total supply) 

(lumber demand for ^ = 1) 

(plywood demand for ^ = 1) 

(lumber demand for q = 2) 

(plywood demand for q = 2) 

(lumber demand for ^ = 3) 

(plywood demand for ^ = 3) 

^ 0 and all > 0 
how the two-stage decision process has been 

formulated. In particular, note that: 

(i) The deterministic timber supply constraint restricting the first-stage 
variables is included as (14). 

(ii) There are (2, = 3 groups of constraints, one group for each possible set of 
values for the random elements—the three groups in the example are 
[(15), (16)], [(17), (18)], and [(19), (20)]. 


(13) 


(14) 


+ X2 

+ ^3 


= T 

(15) 

QiiXi 


+ '^11 

-tn 


(16) 


^12X2 

4 “ ’^12 

— fl2 


(17) 

auXi 


+ <^21 

— ^21 

= 2)21 

(18) 


a22X2 

+ ^22 

— ^22 

= D22 

(19) 

aiiXi 


+ 'J ‘31 

-^31 

= 2)31 

( 20 ) 


^ 32^2 

+ J32 

-^2 

== 2)32 

(21) 



all ^ 0 

all Sqi 

Examine 

(12) 

through (21) to see 
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(iii) The first-stage variables appear in each of the three groups of constraints 
given in (ii). The coefficients of these variables are the specific values of the 
random elements corresponding to the index q, where q = 1, 2, 3. 

(iv) There is a set of second-stage variables (j^i, and (j^j, t^i) associated 
with each of the three groups of constraints given in (ii). Their levels are 
relevant and to be implemented if and when the corresponding values for 
the random elements actually do occur. 

(v) The objective function contains the unconditional expected values of the 
profit coefficients for the first-stage variables. 

(vi) The objective function weights the profit coefficients for the second-stage 
variables by the probability pq that the eissociated set of second-stage 
variables will be relevant. 

Solving (13) through (21) is not a trivial task, as compared to solving (6) through 
(10). To check your understanding, how would (13) through (21) be simplified 
if the values for a,, a-,, and jDj were not random (do not allow an excess of Product 
2 )? ‘ 

It is very important that you perceive the true nature of the solution. You 
must decide right now the levels of the first-stage variables, and the model (13) 
through (21) provides the optimal levels. You do not need to set the levels of the 
second-stage variables until the uncertainties are resolved. Consequently, what 
you find now for the second-stage variables are optimal decision rules, that 
is, a strategy that indicates what levels you will choose for each and every possible 
outcome of the uncertain events, given the vadues you already selected for the first- 
stage variables. The fact that you have to determine rules for your future actions 
is what distinguishes a stochastic from a deterministic dynamic optimization 
model and is what makes the computational task much more difficult. 

Summary. We give here a general formulation of the so-called two-stage 
linear model. Using the notation of the deterministic version (1) through (3), 
assume that in the stochastic version: 

1. The value of each random element is independent of the levels of all 

2. The levels of xj, for j = 1, 2,..., A: < w, must be fixed at the first stage 
before any exact values of the random elements are known. 

3. The constraints z = 1,2,...,^ contain only the first-stage variables, and 
the associated and bi are known with certainty. 

4. There always exist feasible levels for the remaining second-stage variables 
.r^, for j = A: + 1,.. ., n. These are to be established after all the random 
values are known. 

5. There are a finite number Q, of possible sets of values for the where 

7 = A; + and for aij and where t = ^ -f- 1,..., m and where 

j = 1, . .., w. Denote these sets by (r^j, bq^ and the associated proba¬ 
bility of occurrence by for ^ = 1, 2,..., (2.- 
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Then an optimal decision rule can be found by solving the linear program 


(22) 

maximize [ S 

subject to 


(23) 

2 a„Xj = bt for j = 1, 2,..., 5 (first stage) 

(24) 

k n 

S = Ki (second-stage decision rules) 


for : = g + 1, . . . , 771 and q = 1, 2,, 

(25) 

all Xj^O and all Xqj> 0. 


[If any Cj is known exactly, then use this value in (22). If any for : = ^ + 1, .. ., 
m is known, then let equal this value in (24), and do the same for any au that 
is known.] Notice that (24) contains (m — g}Q, equations. In real applications 
where (m — jf) can easily be between 25 and 50, the approach is impractical if 
is large. 

You can read Sec. 16.7 if you are curious to learn how a multistage stochastic 
linear programming problem can be transformed by an analogous line of reason¬ 
ing into a very large ordinary linear programming problem. The advanced 
material below discusses when you can give easily computed bounds on the 
optimal value of the objective function in a stochastic programming problem. 
In the next section, you will see another formulation for stochastic linear program¬ 
ming that does not expand the size of an equivalent standard linear programming 
model in a two-stage decision process. 

► You can exploit the special structure of (23) and (24) in computing a numerical 
solution by employing the Decomposition Algorithm of Sec. 15.11 on the dual problem. 
The dual constraints associated with variables Xj for 7=1,2,...,/: comprise the 
master program, and for each q, the dual constraints associated with the variables x^j, 
foTj = k+ 1,..., n, comprise a subprogram. To guide your understanding, write the 
dual to (13) through (21) and identify the master and subprograms. 

The mode! in (22) through (25) assumes that the values of all the random elements 
are known by the time you have to decide the levels for the second-stage variables. 
With only negligible added complication you can expand the model to include the 
possibility that certain of the objective-function coefficients of the second-stage variables 
become known after the levels of these variables have been determined. Specifically, 
for 7 = ^ + 1,..., /I (where k ^ h ^n), let E[cj\q] denote the conditional expected 
value of cj given that the random event associated with the index q occurs. Then let 
f<,} = E[Cj\q\ forj = A + 1,..., n, in (22). 

Section 16.1 emphasized the importance of assessing the impact of uncertainty in a 
particular model. The development to follow w'ill help you gain further insight into the 
effect of the random elements in the two-stage model. In some situations, the formulas 
also provide a useful way to measure whether an approximate solution is nearly optimal. 

The model (22) through (25) is sometimes called the here-and’-now problem, because 
the levels of several variables must be established immediately, before the exact values 
of all the random elements are known. A convenient alternative mathematical repre- 
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maximize 


[ yi gjXj -f maximize ^ ^>^^1 = E\heTe and now]. 


sentation of (22) through (25) is 

(i) 

where the levels xj > 0, for j = 1,2,...,/: are set first and must satisfy 

k 

(ii) 2 = *‘ for «• = 1, 2,..., g (first stage), 

and subsequently the levels ^ 0, for J = /: + 1,..., n are set to satisfy 


(iii) 


2 z= dt — ^ atjxj for i = g -f 1,..., m (second stage) 


after the values of all the coefficients in (iii) are known. In (i) through (iii) and the 
models below, you can drop the assumption that there are only Q, possible sets of values 
for the random elements. Assume, however, that there is always a feasible solution to (iii). 
A contrasting model, sometimes called the wait-and-see problem, arises when the value of 
every random variable is known before the levels of any x^ are set. Denote this model as 


(iv) 


/^A^efn^aximize ^ ^ [wait and see], 

L *.. J 


where the levels of all > 0 are determined simultaneously so as to satisfy both (ii) 
and (iii). In the event there are only Q, possible sets of values for the random elements, 
the value of (iv) can be calculated by finding an optimal solution to each of the Q, 
linear programming problems and then weighting the objective-function value by the 
associated probability pg. In this case, each linear programming problem has the same 
size and structure as the deterministic version (1) through (3). Since being able to wait 
and see gives you helpful information, the value of E [wait and see] is an upper bound for 
E [here and now]. 

Consider next an optimal solution to a deterministic version of the model in which 
each random element is replaced by its expectation; denote this situation as average- 
value problem. Let Xj ^ 0, forj = 1,2,_, n, be a solution to 


(v) 

subject to (ii) and 
(Vi) 


maximize 2 ^[^l\^i = ^ [average value] 


L for »■ = f + 1. 


,,m. 


Notice (v), (ii), and (vi) comprise an ordinary linear programming problem with the 
same size and structure as the deterministic version (1) through (3). Given the previous 
assumption that there is always a solution to (iii), there will always be a feasible solution 
to (ii) and (vi), as we establish below. Also we demonstrate that E [average value] is an 
upper bound for E [wait and see]. 

Finally, suppose that in the two-stage model, you set Xj = xj, for j = 1,2,..., k, 
found by (v), (ii), and (vi); this is a feasible action since these levels satisfy (ii). Set the 
other variables xj, for j = k 1,..., n, as previously in (iii). Denote this situation as 
the Jirst-stage, average-value approximation approach. The expected value of the objective 
function for this strategy can be written as 

(vii) + maximize J] cpcA = E [first-stage, average-value approximatiort], 

U-i **♦!.X, j=r+i J 
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If there are Q, possible sets of values for the random elements, the value of (vii) can be 
computed by solving each of Q,linear programming problems, using only the constraints 
(iii), and then weighting each resultant objective-function value by the associated 
probability In some applications, the solution to each smaller problem is trivial. 

It can be proved Uiat the objective function values for the above problems do exhibit 
the relations 

E [Jirst-siage, avzmgc-valm approxijnaixon\ ^ E [here and now] 

(viii) E [wait and see] 

^ E [average value]. 

The validity of the first inequality can be argued by noting that the first-stage, average- 
value approximation is a particular one of the many feasible solutions considered in the 
here-and-now problem; consequently, it cannot be better than an optimal herc-and-now 
solution. 

The second inequality reflects the fact that the expected value of the objective function 
may be improved if all the values of the random elements are known before the levels of 
Xj are set. The difference {E[wait and sec] — E[h€rc and may be called the cost of 
uncertainty. The quantity represents the maximum amount to pay if, as a result, you then 
receive a perfect prediction of the random elements before setting the levels of all the Xj. 

The final inequality provides an easily computed upper bound for the other three 
expected values. It can be established as follows. Calculate the “expected value” of the 
xj, forj = 1, 2,as determined by the optimal decision rules for the here-and-now 
problem. This solution is feasible in the average-value problem. Further, from (i) this 
solution gives the optimal E [here and now] as the value of the objective function in (v), 
and hence the optimal E [average value] must be at least as large. 

In the event that E [first-stagCy average-value approximation] is easy to calculate and that 
its value does not differ much from E [average value]^ then the strategy of letting the first- 
stage variables be set at level for j = 1, 2,..., A:, is a near-optimal solution to the 
here-and-now problem. 

In some situations it is possible to find another approximate solution to the here-and- 
now problem. Assume that all for J = A: -{- 1,..., n, and for i = g 1,.. ., m, 
have smallest values, denoted by cf and bf, and that all a^^ for i = g + 1,.. ., m and 
J = 1, 2,..., 72, have largest values denoted by aj. Finally, assume there exist levels 
xf ^ 0, for j = 1, 2,..., n, that solve what is termed the most stringent problem: 

k n 

(ix) maximize ^CjXj -]r Y] ^ [most stringent problem] 

subject to (ii) and 

(x) V flfVj = for i = g -f I,..., rn. 

Note that (ix), (ii), and (x) is a linear programming problem with the same size and 
structure as the deterministic version. Further, the levels xf, for j = 1, 2,..., A:, arc 
feasible for the herc-and-now problem. Note, however, that C [most stringent problem] 
understates the expected value of the objective function when xf^ forj = 1, 2>..., A;, arc 
used in the herc-and-now problem, since (ix) does not take into account that the xj, 
for j = k 1,..., a, will be chosen optimally after the random elements are known. 
But it can be proved that 

/xi) C [most stringent problem] E [here and now], 
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so if C [most stringent problem] and E [average value] are nearly the same, then the levels 
xfy for j — 1, 2,..., At, are nearly optimal for the here-and-now problem. A 

16.5 GHANGF-CONSTRAINED MODEL 

As you reflect on the solution technique for the two-stage stochastic linear 
programming model given in the preceding section, you will conclude that one 
of its major drawbacks is the resultant size of the equivalent standard linear 
program. The expanded dimensions are due to the explicit provision for complete 
decision rules to be applied at tlie second stage. If you make a few more simplifying 
assumptions, then you can employ another linear programming formulation that 
has the same number of constraints as the original stochastic model. The approach 
is called cbance-coustrained programming (named by Professors A. 
Charnes and W. VV. Cooper, who have pioneered the technique). We motivate 
the approach by re-examining the illustration in the preceding section. 

Example. Reconsider the case of the timber products manufacturer, the Big 
Board Company. We further simplify the problem by assuming that the two yield 
coefficients Oi and ^2 are known exactly. And instead of including the explicit 
assumption that the company disposes of excess production, we suppose that 
several action alternatives are open but we do not specify the economic con¬ 
sequences of these. Thus the only random elements are the market prices T) and 
the demand limitations Zlj. Now the expected value of a profit coefficient is simply 
E\c)] = E\r)] for each Product j = 1 and 2. 

The objective function can be written as 

(1) maximize E\c{\xi + -£‘[r 2 ]-V 2 + ^ 3 ^ 3 - 

Note that (i) contains only the first-stage variables, because we are not specifying 
the action alternatives in the event production exceeds demand. We also impose 
the previous supply and nonnegativity constraints 

(2) xi + ATo + ;c 3 — r (total supply) 

(3) alU^>0. 

Finally, we must place restrictions to limit production, because otherwise the 
model, in maximizing expected profit, is likely to indicate a solution that will 
exceed actual demand. Suppose, then, that the President of Big Board stipulates 
that with a probability of at least he wants to be able to sell all the output 
produced, namely for i = 1,2. His goal can be expressed mathematically as 

(4) P[aiXi < Di] > (lumber demand) 

(5) P[a 2 X 2 ^ ^ Pi (plywood demand). 

Restrictions (4) and (5) are called chance-constraints, because they impose 
restrictions on probabilities. These restrictions imply an equivalent pair of ordinary 
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linear inequalities: 

(6) CiXi < Bi (deterministic equivalent for lumber) 

(7) 02 X 2 < B 2 (deterministic equivalent for plywood). 

In other words, the linear programming model comprised of the objective func¬ 
tion (1), the supply and nonnegativity constraints (2) and (3), and the deter¬ 
ministic equivalent constraints (6) and (7) yields an optimal solution 
to the chance-constrained problem. 

The appropriate values for each Bi in the deterministic equivalent is the 
largest number such that P[i)| ^ B^ > yS,; the number Bi is called the (1 — y8|) 
fractile of the probability distribution of Z)|. Note that the probability distribution 
is the marginal distribution for Di, This means that if you start with a joint proba¬ 
bility distribution for jD, and 2 ) 2 , you must, as a side calculation, derive the two 
marginal distributions. 

A numerical example will illustrate how to determine the Bi from a marginal 
distribution. Suppose the marginal probability distribution for Di is 

(8) P[Z)i = l] = .2 P[2)i = 3]=.4 P[Di=S] = .3 P[A = 10] = .l, 

giving the graph of P[-Di > jB,] shown in Fig. 16,5. Then verify that jB, = 1 if 
•8 < y8i 1.0, and jBi = 3 if .4 < /9i ^ .8, etc. 



Figure 16.5. Big Boaxd 
Company Example. 


Observe that the linear programming model (1), (2), (3), (6), and (7) is as 
trivial to solve as the deterministic version (6) through (10) in the previous section. 


The approach. With the Big Board Company Example as background, we 
give a general statement of the technique. Let Xj for j — 1, 2,..., A; refer to the 
first-stage variables. We do not explicitly represent the second-stage variables. 
Assume that all the coefficients atj are known exactly. And, finally, postulate that 
each random element is statistically independent of the levels for the first-stage 
variables. 

The chance-constrained model is to find levels Xj that satisfy 


( 9 ) 


maximize 
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subject to i 

I 

(10) 2 atjXj =l>, for t = 1, 2,..., 5 (first stage) i 

* i 

(11) p[i: ^ ^ A for t = ^ + 1,..., m (chance-constraints) | 

(12) aUxj^O. I 

You interpret (11) as constraining the unconditional probability to be no smaller \ 

than y9<, where 0 < y9| < 1, that the actual value for bi is at least as large as I 

Assuming there are levels Xj satisfying (10) through (12), the above formulation 
can be solved as a standard linear program with objective function (9), constraints 
(10), (12), and instead of the chance-constraints (11), the linear constraints: 

k 

(13) V OijXj <Bi for z = ^ -f- 1, ..., m (deterministic equivalents), 

where Bi is the largest number satisfying 

( 14 ) P[b,>B,]>fi, 
or, equivalently, 

(15) P[b, <B,]<1-^, [B,: (1 - A) fractile]. 


Discussion. You can compare the chance-constrained approach with the 
earlier two-stage approach. The chance-consirained model has two desirable 
properties. First, it leads to an equivalent linear program that has the same size 
and structure as a deterministic version of the model. Consequently, the com¬ 
putational burden of the stochastic version is no greater after the proper right- 
hand-side values have been determined. Second, the only information required 
about each uncertain element bi is the (1 — A) fractile for the unconditional 
distribution of the right-hand-side coefficient. These properties compare favorably 
with the larger size of the linear program of the previous section and the restrictive 
assumption that there are only Q, possible sets of values for the random elements. 

The principal weakness of the chance-constrained model is that it only indirectly 
evaluates the economic consequence of violating a constraint. In the example of 
the Big Board Company, the chance-constrained approach indicates no dif¬ 
ferential penalty between a small and large amount of excess production. Put 
somewhat differently, in most situations, specifying the correct values for fii should 
be part of the optimization problem. (An additional conceptual limitation is that 
the approach arbitrarily excludes randomized strategies that could be better, as 
was pointed out in the advanced material in Sec. 16.2. Also, as you will discover 
if you read the next section, a particular optimal solution for one formulation may 
never be optimal in an analogous formulation of the other. So you cannot be 
assured that there exist probabilities )8i that in a chance-constrained model would 
yield a given solution that is optimal for a two-stage formulation.) 

Therefore, when faced with a choice between the two approaches for modeling 
a two-stage decision process, you will have to compare the serious limitation on 
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the problem size imposed by the two-stage model of the previous section against 
the restricted meaning of optimality implied by the chance-constrained version. 
When you come to a multistage model, you will find the conceptual difficulties 
in a chance-constrained approach compound tremendously. The applicability 
of the approach in these instances is a question still being explored by theoreticians 
and practitioners. 

The remainder of the chapter treats a special example of a two-stage model and 
other formulations of stochastic programming models. The material is of interest 
if you want to round out your understanding of how to structure stochastic pro¬ 
gramming models into equivalent deterministic models. In any event, the preced¬ 
ing introduction to determining optimal strategies under uncertainty has given 
you the necessary preparation to consider dynamic programming models with 
probabilistic elements, explained in Chap. 17. 


^In some applications it may be appropriate to have one or more joint chance- 
constraints instead of the individual constraints (11). Usually these versions arc 
equivalent to nonlinear programming models and can be solved by the techniques in 
Chap. 15. To illustrate what is involved, suppose (11) is replaced by the single probability 
restriction 

(i) 

where 0 < < 1 (joint chance-constraint). 

Notice that here only one need be specified, so the resultant task of sensitivity testing 
is relatively easy. Let the marginal distribution functions be denoted asFi{b) = P[bi < 6], 
and assume that each Fj is continuous. Also assume the joint distribution function for 
the bi is simply Uj^g+i^c, where the symbol JJ denotes the “product” operation. Thus 
the ^g+i,..., are statistically independent. Finally, let Gi = 1 —• F<. Then an 
equivalent deterministic model is (9), (10), (12), and 

k 

(ii) —+ 2 ^ for i = g + 1,..., m (fractile definition) 

m 

(i«) n Ot{yi) (equivalent joint-constraint), 

i«g+i 

where the yi arc unconstrained in sign. Note that (iii) is a nonlinear constraint. Only 
rarely will the G| be such that (iii) is concave, as is required for most nonlinear algorithms. 
However, a transformed version 


m 

(iv) 2 ^iUi) ^ In ^ (transformed equivalent constraint) 

i-g+i 


is frequently concave—for example, when the density for bi is given by a Normal, gamma, 
or uniform distribution. Techniques such as those in Chap. 15 can be applied to solve 
(9), (10), (12), (ii), and (iv). 

Consider the situation where the values of bi are known exactly but the Oij are random 
elements. In particular, suppose that (11) is replaced by 


(V) 



for I = g -f- 1,..., m 


(random coefficients). 
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If you can assume that for each i, the random elements aij, for j = 1, 2,..., A;, have a 
joint Normal distribution with means for j = 1, 2,..., A:, and covariances for 
j and h = 1, 2,. , ., A:, then a deterministic nonlinear equivalent for (v) is 


(vi) 


k / k k \l/2 

XI ^4 IS ) 




for I = ^ + 1,..., m 

(deterministic equivalent), 


where Bi is the (1 — fraciile of a unit Normal distribution. If A ^ 4 so that A ^ 

then (vi) is concave and the model (9), (10), (12), and (vi) can be solved by techniques 
such as those in Chap. 15. M 


* 16.6 TRANSPORTATION NETWORK EXAMPLE 

Recall from Chap. 6 that the transportation (or distribution) problem is a very 
important network model, mathematically described as 


(1) 


m n 

minimize ^ 2 


subject to 




(2) 

2 

for z = 1, 2 ,... m 

(supply) 

( 3 ) 

II 

hvj = 1, 2 ,..., n 

(demand) 

( 4 ) 

Xt) ^ 0 

for each i and j. 



where you may interpret A as the available amount of supply for shipment at the 
:th source and Dj as the required amount of demand at the jth destination, and 
where the model has been structured so that total supply equals total demand 
(S A = X ^j)- In Chap. 6, we assumed that the values for A and Dj were known 
exactly, and in Chap. 7 you saw how to apply the simplex method to obtain 
numerical solutions. Here we consider the problem where the demand require¬ 
ments Dj are uncertain. You will see how the appropriate two-stage model is not 
much more complicated than the deterministic version (1) through (4), 

Assume the time sequence of events is 

(i) First Stage, All the levels of must be established, with knowledge of exact 
values for every Cih ^nd possibly a few D^, 

(ii) Random Event, The particular values of the random elements Dj then become 
known. The values of Dj are independent of the levels of the Xy. 

(iii) Second Stage, The resultant amount oV over- or underfulfillment of demand 
is established. 

Note that (iii) really implies that the model (1) through (4) has been augmented 
by n second-stage variables: 
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(5) = A —forj = 1,2, 

where the are unconstrained in sign. To make the first-stage problem nontrivial, 
it is also necessary to add the effect of into the objective function, since otherwise, 
an optimal solution is to ship, for each t, the total Si to that Destination j where 
Cii is a minimum. Let c^{u^ denote the cost or penalty associated with the resultant 
level Uj, Realize that each function Cj{uj) need not be linear; for example, you 
may want to use a quadratic penalty function. Then assume that the objective 
function is the sum of the transportation costs and the expected value of the 
penalty costs: 

(6) minimize fj g CyXy + 

subject to (2) and (4). Each expectation in (6) is over the corresponding marginal 
probability distribution of Dj, 

Note an important characteristic of this particular two-stage model: there is no 
optimization problem at the second stage. The levels of the second-stage variables 
Uj are uniquely determined according to (5) by the levels of the first-stage variables 
and the values of the random elements. This special structure can be fully exploited 
so that you do not have to deal with an equivalent deterministic problem of the 
sort suggested by the two-stage formulation given in Sec. 16.4. 

By your making two further postulates, the two-stage stochastic model, (2), 
(4), (5), and (6), becomes mathematically equivalent to an ordinary transporta¬ 
tion problem. First, assume that for every j, the only possible values for Dj are 
integers. Define, for each j, 

m 

(7) Xj = 2 (total shipment to Destination j) 
and 

(8) = E\cj{Dj — x^)] (expected penalty at Destination j.) 

An illustration of how to calculate (8) is given in the example below. Second 
assume that for every Destination j, the expected penalty cost function Cj{xj) is 
convex: 

(9) Cj{Xj + 1) — Cj{xj) > Cj{xj) — Cj{xj — 1) for all integers Xj. 

A sufficient condition for Cj{Xj) to be convex is that the penalty function Cj{uj) is 
itself convex. The example below will illustrate how integer-value and convexity 
assumptions are sufficient to permit you to view the model as an expanded trans¬ 
portation problem. 

So that you are not misled, we caution that there are much more efficient 
computational techniques for obtaining numerical solutions than solving the 
problem in the fully expanded form. (Most of these techniques are applications 
of the Generalized Programming Algorithm in Sec. 15.10.) Hence, in studying the 
example here, you should give primary attention to understanding why the 
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introduction of uncertain elements presents only a minor conceptual complication 
for this particular two-stage model. You can leave the streamlined algorithms to 
the technical specialists. 

Example. Consider the deterministic example given in Fig. 16.6 (which is the 
same as Fig. 7.2, p. 217); the unique optimal solution is also shown. Suppose now 


Figure 16.6. Transporta¬ 
tion Example (Optimal 
Deterministic Solution). 



Supply: 

6 

1 

10 


Minimal Total Cost = 100 


that the values of and are uncertain, and, specifically, assume that their 
probability distributions are 


(10) P[Z), = 1] = P[£», = 3] = P[Di = 5] = y so that £[/),] = 3 


(11) P[£>, = 0] = P[D, = 1] = />[D4 = 3] = P[iD. = 4] = ^ 

SO that E\p^ = 2. 

Thus the expected demands are the same as the previous deterministic demand; 
however, although E{p^ = 2, an actual demand value = 2) never occurs. 
Assume for j = 3, 4, there is no economic penalty for oversupply (ifj ^ 0) at 
the jth destination, but that each unit of undersupply incurs a loss olfj: 

(12) cj{Dj-xj)=\^ (penaltyfunction). 

for D,>xj 

Verify that Cj(uj), where Uj = Dj — Xj, is convex. Thus the expected cost is 

(13) C){Xi) = — xj)P[Dj = jD] (expected penalty function). 

D>xt 

To illustrate, for Xj = 1, 

(14) C,(l) = S /,-(i) - 1)P[D, = D] =/,[(3 - 1)4 + (5 - 1)4] = 2/„ 

J>-3.5 

and for = 3, 

(15) C,{3) =^/r{D - 3)P[D, = D] - 3)4 = 4/«. 

All the C,{xi) are tabled in Fig. 16.7. To test your understanding, verify the values 
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for C 3 ( 3 ) and C 4 ( 0 ). Also explain verbally why Cj{0) = fjE[Dj] and why Cj{xj) 
decreases to 0 as increases to the largest possible value for jDj. Check that each 
Cj{xj) is convex according to (9); the values Cj{xj) ^ Cj{x^ + 1), shown in Fig. 
16.7, can be used to assist you. 



c^ix^) 

” C3(X3 -^1) 

) 

c^{x^)-c^{x^-^\) 

0 


1^3 

2i, 


1 


1^3 


|4, 

2 



I/4 


3 

1^3 

yz 

if 

4/4 

i f 

4'4 

4 


5 '3 

0 

0 

5 

and 

larger 

0 

0 

0 

0 


giGURE 16.7. Expected Cost of Undersupply. 

To transform the stochastic model into an expanded transportation problem, 
think of each possible unit of demand comprised in and as a destination all 
by itself. In the example, there arc five destinations for £> 3 , since the maximum 
possible value for D 3 is 5; similarly, there are four destinations for D^, Each of 
these destinations has a demand requirement of one unit. This transformation 
means that the sum of the demand requirements is now 21 (=7-}-5 + 5 + '^)« 
Since the total supply available in Fig. 16.6 is 17 (= 6+1 + 10), it also is 
necessary to add a fourth (fictitious) supply point with Sj^ = 4 (=21 — 17). So a 
unit of supply from ^'4 really represents an unfilled demand. Assume that the 
demand requirements Di and D 2 must be satisfied, and S^, consequently cannot 
ship to these two destinations. The resultant transportation tableau is shown in 
Fig. 16.8. 

For t = 1, 2, 3, the in Fig. 16.8 correspond to those in Fig. 16.6, and are 
equal in each of the columns associated with the same Dy The Cij for i = 4 require 
some explanation. These are the incremental values Cj{xj) — Cj(a:j + 1) as shown 
in Fig. 16.7. Observe that the sum of the incremental values, starting at the bottom 
of Fig. 16.7 and moving upward, equals C){X)), Because Q(;cj) is decreasing and 
convex, the coefficients of are successively smaller as you progress from left 
to right. (Notice that v/here two coefficients arc the same, a plus sign is placed on 
the leftmost so that you may consider it to be “slightly” larger.) It follows that for 
each D) an optimal solution will allocate ^4 to the columns starting on the right 
and moving progressively to the left. Therefore, in an optimal solution, the sum 
of the costs associated with allocating iS '4 (unfilled demand) to the columns as¬ 
sociated with each will equal the value of Cj(a:^). 
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For example, suppose three units of •S '4 are allocated to columns 5, 6 , and 7 
associated with Z),. Then Xj = 2 (explain why), and 

^3 (2) = y/s + + 7/3 = 

in agreement with Fig. 16.7. (Since ^4 = 4, you know that Xs ^ 1 and, therefore, 
the first entry for Z), in the fourth row, having a cost l/i, can be eliminated from 
consideration. Suppose Sx = l\ then X 3 ^ 2 and, therefore, the first two entries 
for in the fourth row can be eliminated. A further simplification would be to 
combine the corresponding columns and let the associated demand requirement 
for the composite column be 2 .) 

If the Cj{x^) were not convex, there would be no guarantee that an optimal 
solution would allocate units of unfilled demand, represented by iS' 4 , starting at 
the highest level of possible demand and working progressively downward. In 
summary, the minimum total cost solution to Fig. 16.8, for given values of 
and / 4 , will correspond to ( 6 ), and can be found by applying the transportation 
algorithm in Sec. 7.3. 

Sensitivity analysis. Having established an enlarged transportation 
problem that is mathematically equivalent to the two-stage model, you are ready 
to assess the impact of uncertainty. In particular, you can explore the sensitivity 
of the solution to the values of fi and Since total available supply is 17 Si 
+ .S’* + 53 ), and demand of 12 (= Z7, -f D 2 ) must be satisfied, there are 5 
(=17 — 12) units of supply to allocate in total to and D^, This means that 
the only possible solutions are 

(17) Xj = 1, 2,..., 5 and X 4 = 5 — X 3 . 

Given each of these values for X 3 and X 4 , the corresponding optimal solutions for 
all the Xij are shown in Fig. 16.9 (these can be found by solving each associated 


'3 

^4 

Optimal Allocation® 

X23 ->^24 -^34 

Total Expected Cost 

Ratios >3/^4 = 

3 
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1 
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^And X|2 = 5, x,3=1. X3| = 7. 


Figure 16.9. Optimal Allocations, With Given Values of 
[xz,Xi) and (A/i). 
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deterministic problem, such as in Fig. 16.6). Of course, which of these specific 
allocations is truly optimal for the expected cost objective function ( 6 ) depends on 
the particular values ofand Actually, the optimal allocation will depend on 
the value of /3 relative to/ and the absolute magnitude of/. For three values of 
fiffii Fig- 16.9 also shows the values offor which each possible solution is optimal. 

Observe that: 

(i) If the ratio/// is large, such as 3, then more units are allocated to the 
third destination as/ increases. 

(ii) If the ratio/// is small, such as 1/3, then the minimal amount is allocated 
to the third destination no matter what the value for/. 

(iii) There exist values for/ and / that can make each of the possible alloca¬ 
tions optimal. 


► You can measure the economic effect of uncertainty by comparing the minimal 
expected cost from (6) against what the expected cost would be if you could obtain a 
perfect forecast of and prior to making the allocation. In order to make the latter 
calculation, assume that/ is large enough so that it is always preferable to fill demands 
whenever possible, and that all five units are shipped to the third and fourth destinations, 
even when -f Di^) is less than five. Then it can be sho^vn that 


(i) 


E 


total transportation 
and penalty cost with 
perfect information 


lOOj -f for ^ — 3 

95 “1“ tt/ for = 1 and 


and, consequently, 




(ii) 


expected cost of uncertainty = 


""1 + t/s 


—6 + 1 A/ 


for = 3 
JA 

tot fy-=\ 

JA 


for/ sufficiently large. The quantities in (ii) represent the maximum amount you should 
be willing to pay if, as a result, you obtain a perfect prediction. The cost of uncertainty 
is linear in/; for sufficiently large/, this cost is greatest for// = i, since at this ratio 
the least amount of supply is allocated to ^ 


Ghance-constrained approach. You will find it enlightening to 
compare the foregoing two-stage model with a chance-constrained version, which 
consists of the deterministic objective function ( 1 ), the supply constraints ( 2 ), 
the demand constraints (3) for j =1,2, the nonnegativity conditions (4), and the 
chance-constraints 

(18) P[Di < ;es] ^ ySj and P[Di < xj ^ ^84 (chance-constraints), 

where the total shipment to Destination j is defined by (7). Thus (18) requires 
that for j = 3, 4 the probability of meeting demand Dj should be at least 
which is presumably large. [Note that in (18) the protection is against under 
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fulfillment of demand, whereas in (4) and (5) of Sec. 16.5 the protection was 
against over fulfillment.] 

Given values for /3i and ^ 84 , the chance-constrained model can be solved by 
letting the demand constraints be 

m 

(19) 2 ^ = 3 and 4 (deterministic equivalents), 

where Bj is the smallest number satisfying 

( 20 ) 

The rest of the model consists of (1), (2), and (3) for j = 1 and 2. 

Once again, the only possible distributions of the five units available for the third 
and fourth destinations are given by (17), and the associated values for ^ xj] 
appear in Fig. 16.10. There are two important conclusions to draw from Fig. 
16.10. First, there may not be a feasible solution if the values for ^83 and ^84 are 
both too large. For example, if ^83 = then there is no feasible solution for ^84 > y. 
Second, the allocations (^3 = 2, X 4 = 3 ) and (^3 = 4 , X 4 = 1) are never optimal 
for any values of ^83 and > 84 . The allocation (X 3 = 1 , ^4 = 4) strictly dominates 
(xs = 2, X 4 = 3), since the corresponding probabilities of meeting Djy for j = 3, 
4, are at least as good and the total transportation cost is less. Explain why (^3 = 4, 
X 4 = 1 ) would never be optimal. 

This divergence between the previous two-stage and the chance-constrained 
approaches points out that there is a fundamental difference between the two 
models: the set of possible optimal solutions can be distinct. Do you think it is 
reasonable to rule out the two above-mentioned allocations? Why? 


^3 

X4 

loiui 

Transportation 

Cost 

^[£>3 s X3] 


1 

4 

89 

1 

3 

1 

2 

3 

94 

1 

3 

3 

4 

3 

2 

100 

2 

3 

1 

2 

4 

1 

106 
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3 

1 

2 

5 

0 

112 

1 

1 

4 


F iGURE 16 . 10 . Probabil¬ 
ities of Meeting Demand 
Requirements, With Given 
Values of (^ 3 , Xi). 


* 16.7 MULTISTAGE LINEAR MODEL 

The mathematical logic that permitted you to transform a two-stage stochastic 
linear programming model with uncertain elements to an ordinary linear pro¬ 
gramming problem can be applied just as well to multistage models. As before, 
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you must assume there are only a finite number Q, of possible sets of values for 
the random elements. In a real multistage situation, Q,is likely to be so large that 
it would be impractical to solve the resultant linear programming problem; 
therefore, you should study the analysis below with the idea of gaining further 
insight into the problems of decision-making under uncertainty. Only infrequently 
will you find the model of practical use, however. 

Let the deterministic version of the problem be described as 


(1) 

maximize 2 

subject to 

(2) 

n 

2 atjX} = b, for i = 1, 2 ,..., m 

(3) 

Xj^O forj = 1,2,..., n. 


Assume for the stochastic version that the Cj, Otj, and bi can be random. Specifi¬ 
cally, suppose there are Q, possible sets of values for these elements. Denote these 
sets of values by {cgj, bqi), and the associated probability by pq. We give a 
three-step procedure for formulating the appropriate linear programming problem. 
The procedure is completely general, so that it will work for any multistage 
model. You will often be able to take some shortcuts in applying the approach, 
however, when considering a particular model. 

The idea is to start with a formulation that assumes you know which of the Q, 
possible outcomes has occurred before you have to make any decisions. If such 
a formulation were a true representation of the problem, then you would have Q, 
sets of values or decision rules for the variables, the composite of these representing 
a complete strategy. The next step is to recognize that in fact you will not know 
some of the random elements when you have to decide on the levels of particular 
variables. Therefore, you have to revise the initial formulation by inserting con¬ 
straints to indicate that the levels for these particular variables must be identical 
in the appropriate sets of decision rules. To illustrate, each first-stage variable must 
have the same level in all Q, of decision rules. The final step is to tidy up the 
appearance of the model after applying the second step. 

The method in detail is 

Step L Write the problem that would be appropriate if you could learn the 
exact values of all the uncertain elements prior to deciding the levels for the : 

(4) maximize ^pq^ CqjXqj 

Q-l J-I 

subject to 

n 

(5) for j = 1, 2 ,..., m and j = 1, 2,..., Q,, 

(6) all > 0, 

where Xqj, for j = 1,2,n, are levels associated with outcome {e,j, a,y, i,,). 
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Step 2. Account for the actual sequence of information specified in the par¬ 
ticular multistage model by equating the appropriate Xgj. For example, if the 
level of Xi must be determined before any of the random elements are known, 
then add to the constraints (5) and (6) the restrictions Xn = X 21 = • • • = Xqi, 
Step 3, Use the restrictions in Step 2 to simplify the form of the resultant prob¬ 
lem. For the example in Step P, put Xu wherever for ^ = 2, 3,. .., Q., 
appears; then collect terms and eliminate any redundant equations. 

It is important that you realize that the optimal values for Xq) in (4), (5), and 

(6) are precisely the decisions to be taken if and when the values {cqj^ aqijy bqt) for 
the random elements actually occur. Consequently, the analysis provides a com¬ 
plete prescription of the actions to be taken for every possible eventuality. Rather 
than elaborate further on the details of these steps so as to remove any ambiguities, 
we present an example that will serve as well in showing you how to apply the 
technique. 

Production scheduling esampie. Consider a firm that must schedule 
its production level Xi for t = 1, 2, 3. Let Di be demand occurring in Period t, 
and assume that any part of this demand not satisfied in Period t is lost forever. 
Let it be the inventory level at the end of Period t; assume that initial inventory 
is 0 and inventory at the end of Period 3 is worthless. Let Ci be the unit profit as¬ 
sociated with producing an item in Period t and selling it at any time during the 
horizon; assume == r — et, where r is the unit selling price and ei the unit 
production cost in Period t. Let hi be the unit cost of holding an item in inventory 
at the end of Period t. Then the deterministic version of this model can be stated as 

3 2 

(7) maximize CtXi — htU — (A, + r)i 3 
subject to 

(8) Xi — ii Si = Di (inventory balance—^period 1) 

(9) *1 + ^2 — *2 + J 2 = -02 (inventory balance—period 2) 

(10) 1*2 + Xi — ii Si = Di (inventory balance—period 3) 

(11) all xt, it. Si > 0, 

where St is the level of lost sales in Period t. 

The model is quite similai to that in Sec. 8.3 (see Fig. 8.6), but there are two 
points of difference requiring comment. First, the variables St allow for unfilled 
demand. [How would you modify (8) and (9) to permit backlogging of unfilled 
demand ?] Second, the objective function is “maximize profit.” Since the selling 
price r is included in Ct (the coefficient for a production variable), the coefficient of 
ii contains r to indicate that revenue is not in fact realized on items left over at 
the end of Period 3. For given values of Di, the model is trivial to solve: assuming 
all Ct > 0, fill demand in Period t by production in that Period k. A; < /, where the 
value of Cjc minus accumulated inventory cost is maximal. You hardly need a 
model to obtain this common sense answer. 
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But now suppose that the problem contains elements of uncertainty. Specifical¬ 
ly, let the values of Di be random. What are optimal levels for To answer 
this question, you have to know more about the probability law that generates 
the values for jD^, and know more about the sequence in which the individual levels 
Xi are set and demand information becomes available. Assume that the values for 
are independent of Xt, Further suppose that Di has two possible values. For 
each value of Z),, the demand Dn may be either one of two values; in other words 
there are four possible values of Z) 2 * Similarly, there are two possible values of 
2)3 for each possible value of Di and 2 ) 2 ; therefore 2), may be any one of eight 
possible values. In short, the Di are dependent random variables, and there are 
0 ,= 8 possible sets of values for ( 2 )i, 2 ) 2 , 2 ) 3 ), which arc denoted by (2)^i, 2)<,23 
2)<,3), for q = 1, 2,..., 8, and are summarized in Fig. 16.11. To see how the 


Figure 16.11. Possible 
Demand Sequences. 


0, 

Dz 

Di 


^11 ^21 ^31 "^41 

II 
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0.3 

Pi 
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Pa 
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Ps 
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^72 - ^82 

O73 
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P8 


sequence of information and decisions affects the formulation of the model, con¬ 
sider four different situations: 


Perfect Information, Assume that the specific values of all the random elements are 
known prior to setting the levels of any of the decision variables. Then the model 
described in Step 1 above gives the appropriate linear programming problem: 

(12) maximize — 2 — % + '■)»«] 

subject to 

( 13 ) Xql iqX + = Dql 

(1^) h\ + Xq2 — iq2 + Cqi = Dqi 

(13) iq2 + Xqi - iq2 + Sq 2 = 2)^3 

(18) all Xqly iqly Sqt ^ 0. 


for ^ = 1, 2 ,,.., 8 
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All together, there are 72 variables and 24 equality constraints. [Of course, to 
obtain a numerical solution to this model, you can factor ( 12 ) through (16) into 
= 8 separate linear programs. Each of these can be solved trivially since it has 
the same structure as the deterministic model (7) through (11). To calculate 
( 12 ) you would weight the resultant optimal value of the objective function for 
each small problem by the associated pq,'\ 

Learn Demand — Then Produce, Suppose 
before setting the level of Period t produc¬ 
tion, you know the exact value of Di and 
previous demands. Then in accord with 
Step 2 above, you must add the restrictions 
shown in Fig. 16.12. For example, if 
Z)j = Dll = • • • = Z)^,, then the levels of 
first-period production, ending inventory, 
and lost sales will be the same for ^ = 1 , 
2, 3, 4. Similarly if D 2 = ^^32 = ^ 42 ? t^en 
the levels of second-period production, 
inventory, and lost sales will be the same 
for q = 3, 4. 

To carry out Step 3, it is helpful to use a 
single symbol each time two or more quan¬ 
tities are equal. For example, use Xn 
wherever X 2 u and ^41 appear, and use 
^iij and -Dll analogously. You can see 
immediately that the constraints (13) for ^ = 1, 2, 3, 4 are identical, and therefore 
only one of these needs to be written. Thus the simplified form of the problem 
contains the variables 

f ^=l,5 if/=l 

y = 1 , 3 , 5 , 7 ifi = 2 
^ = 1, 2,..., 8 if ^ = 3, 

and consists of constraints (13) for ^ = 1, 5, (14) for ^ = 1, 3, 5, 7, (15) for 
q = 1, 2,. .., 8, and (16). All together, there are 42 variables and 14 equality 
constraints in the reduced model. Write the simplified form of the objective func¬ 
tion, and verify, for example, that the coefficients of Xn and x^ are ()&i + • • • 
+ pi)ci and (/>5 + • • • + respectively. 

Produce — Then Learn Demand, Suppose before setting the level of Period t produc¬ 
tion, you know only the exact values of previous demands. The additional restric¬ 
tions according to Step 2 are shown in Fig. 16.13. Notice that all the must be 
identical, since the production level in Period 1 is set prior to any exact knowledge 


'^91* ^ qV^qv 

(^<72* '92* ^92' 

^11=^21=^31= ^41 

'll ” '21 '31 '41 

^11 = ^21 = ^31 = ^41 

X|2 = X22 

'12 = '22 

S|2 = S22 

^^32 = ^42 
'32 = '42 

^32 = ^42 

^1= *61= ^71= ^8. 

'a” '(51 ” '71 ■" 'ei 

551= ^ 7 ,= Sq, 

^52 =^^62 
'52 = '62 
^52 = ^62 

^72 “ ''82 
'72 ='82 

572 = Sq 2 


Figure 16.12. Additional Restrictions for 
“Learn Demand—^Then Produce.” 
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Figure 16.13. Additional 
Restrictions for “Produce— 
Then Learn Demand.** 



of demand. Also observe that the equality restrictions on i^i, Squ and Xqq occur for 
the same values of q because the levels of these variables are established with the 
same amount of information, namely, the exact value of Di. 

After performing Step 5, the simplified form of the problem contains the variables 

XxuiQuSquXQ2 for^7= 1,5 

(18) ^ Ij 8, 5, 7 and 

^qZi ^qz lor q ~ 1, 2, . . . , 8, 

and the constraints (13) for ^ = 1, 5, (14) for ^=1,3, 5, 7, (15) for ^ = 1, 2,, 
8, and (16). All together, there are 35 variables and 14 equality constraints in the 
reduced model. Write the appropriate objective function, and verify that the 
coefficient ofxn is just 

Full Commitment Finally, suppose all the production levels must be set before 
you know the exact values for any Dt, Verify that the simplified form of the model 
contains the variables 

(19) ^ “ Ij ^ 

^q 2 i ^q 2 lor ^ = 1, 3, 5, 7 Sq^ for ^ = Ij 2,..., 8, 

and the same set of constraints as in the case “Produce—^Then Learn Demand.” 
Thus there are 31 variables and 14 equality constraints. 

In going from the first case to the last, there is progressively less information 
available as the production levels are established. Consequently, the successive 
optimal values for the objective function will usually decrease (and will never 
increase). (The supporting mathematical argument is that each successive problem 
adds more equality constraints and therefore further restricts the possible levels of 
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the variables.) The “Perfect Information” case represents the best you could do 
with a perfect predictor of the random elements. For any of the other situations, 
the difference between the optimal objective function value and the best you can 
do from the “Perfect Information” case measures the cost of uncertainty. 

Other approaches. In some situations, like the above production schedul¬ 
ing example, you can use other techniques of model formulation for the multistage 
problem. One such approach is adding chance-constraints, as was done in Secs. 
16.5 and 16.6. When there is a finite number (^of possible sets of values for the 
random elements, the chance-constrained approach may involve a smaller-size 
linear programming problem than does the above technique. But the complexity 
of keeping distinct the variables and constraints associated with the particular 
sets of information about values of the random elements is inherent in the problem 
—no correct method can avoid it. When the random variables in a period are 
completely independent of the variables in other periods, then a dynamic pro- 
granuning technique often can be applied, as will be illustrated in the next chapter. 

* 16.8 QUADRATIC CRITERION FUNCTION 
(linear decision rule) 

The following multistage model with uncertain elements represents a situation 
where you can make optimal decisions while you only know the expected value of 
the random elements. The time sequence of decisions and emerging information 
about the values of the random variables can be as complex as in the preceding- 
section. However, assume the constraints are of the form 

n 

(1) + S = f>i for £ = 1, 2,..., 

^=m+l 

where for j = 1, 2,..., n, are unconstrained in sign, all aij known exactly, but 
bi are random with known expectation. As the example below will demonstrate, 
there are situations where the resultant op.imal policy will yield all Xj > 0; there 
also are situations where the Xj represent deviations about given target values, so 
that negative levels for Xj are meaningful. 

Further, suppose that the objective function can be written as minimizing the 
expected value of a quadratic form 

(2) jC + jC cjxj, 

where (2) is strictly positive for any levels xj, forj = 1, 2,. .., «, except all Xj = 0. 
Assume all the Cj^ are known; the Cj may be random. 

As usual, define the first-stage decisions as the levels of those variables that must 
be set before knowing the exact values of any of the random elements. It can be 
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proved that the optimal levels of the first-stage variables are found by acting as if 
the model were deterministic: 

Step 1, Insert the unconditional expectation for each Cj and in (1) and (2). 

Step 2, Substitute the for z = 1, 2,. .., m, from (1) into (2). 

Step 3. Collect terms in for j = m + 1,.. ., /i. 

Step 4, Set equal to 0 the partial derivatives of the quadratic form with respect 
to Xj, for 7 = m + 1, . . . , n. 

Step 5. Solve the resultant {n — m) linear equations with respect to where 
7 = m + 1, . . ., 

Step 6. Determine the associated values ofx<, for i = 1, 2,. . ., m, in (1). 

An important implication of these steps is that the optimal levels of the first- 
stage variables depend linearly on the and E[bi], Consequently this problem 
is sometimes referred to as the linear decision rule model. After the levels 
of the first-stage variables have been set according to the above, wait until the 
next values of the random elements become known, and repeat the procedure. 
This entails renaming bls first-stage variables wha' were previously second-stage 
variables, etc., and possibly recalculating the expectations for the Cj and bi if the 
new information indicates that the future expectations should be revised. The 
following, completely hypothetical, example illustrates the approach. 

Employment smoothing example. The manager of the Greasy Spoon 
Restaurant must determine the number of waitresses to have on duty during 
the rush hours each Day t, for / = 1,2,..., T. It is costly to change this number 
from day to day, because of such factors as employee morale and limited availa¬ 
bility of experienced waitresses. Having either too many or too few waitresses 
is also expensive, because of the extra wages or the loss in revenue from customers 
who will not wait in a crowded line. However, it is not possible to know exactly 
how many waitresses are needed because of daily fluctuations in the number of 
customers and the amount of service they require. Let Dt denote the number of 
waitress-service hours that would be required by the restaurant’s potential cus¬ 
tomers during the rush period on Day t. Lctji denote the number of such hours 
scheduled ahead of time by the manager. Let f be the level of fluctuation in the 
scheduled hours between Day t — 1 and Day t, and let et be the level of error 
between the actually scheduled and the required number of hours: 

(3) fi =yt^i —yi for / = 1, 2,..., T (employment fluctuation) 

(4) ei =y^ — Di for f = 1, 2,..., T (demand fluctuation), 

where7>o is the employment level right before Day 1. In this example, the variables 
ft and et correspond to x<, for z = 1, 2,..., m, and the variables yt to Xj, for 
7 = m + 1,,. ., n in (1). 
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Assume the objective function is to minimize the expected value of 
r 

(5) (/t* + cei) (quadratic criterion) 

subject to (3) and (4), where r ^ 0 is a given constant. Note if r = 0, then an 
optimal solution is =^o for if r is arbitrarily large, then yt = E\D^ is 

optimal. The first-stage decision is the level of^i. 

According to Steps 1 and 2 in the procedure above, insert E\P^ for in (4), 
substitute the and e^ in (5), yielding 

(6) C = g -JiY + c{y, - E[D,]y]. 

Then collecting terms and setting dCjdyi = 0, for t = 1,2,..., T, you obtain T 
linear equations in T unknowns: 

(7) (2 + c)y^ — ^^2 = cE[D;\ +y^ 

(8) -yt^i + (2 + c)yt - l^<+i = r£[A] for / = 2,..., T - 1 

(9) —jr-i — (1 + = cE[Dj\, 

You can solve (7) through (9) for given values of r, and y^. The resultant 

value of^i is the optimal level for the first-stage decision, and depends linearly 
on the E\p[\. 

Suppose the planning horizon really is unbounded, and the level of yi is to be 
set so that it is optimal for any arbitrarily large T, Then it can be shown that the 
solution of (7) through (9) for as T—► 00 , is 

(10) yi = Q'(j;o + cE[P^) + ^^(X}cE[p^ (unbounded horizon), 
assuming the summation on the right converges, where 

(11) /V ^ (2 + <^) — 

so that 0 < O' ^ 1 for 0 < r < 00 , and > 0. Once again, note that^i in (10) 
is a linear function of the E[P[\. 

After observing an actual value for the manager may revise his expectations 
about subsequent Z)<, so that on the second day, the calculation (10) would be 
repeated with E\p{\ set equal to the new expectation of service requirements for 
the second day, etc. However, if E\p^ = Z)*, for all /, regardless of previous 
values for Z)j, that is, if the expectation of requirements remains stationary, then 
the values for all future are given by 

(12) yt = ayi.i + (stationary demand), 

for 0 ^ r < oo, and -> jD* as ^ oo. 
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REVIEW EXERCISES 


1 In each case below, state what you think might be both the sources and the impact of 
uncertainty on the management decisions at issue. Also discuss whether performing 
sensitivity analysis on a deterministic model is likely to uncover the impact of 
uncertainty. 

(a) Selecting sites for new warehouses that will serve existing retail outlets. 

(b) Eliminating a product. 

(c) Introducing a new product. 

(d) Choosing the number of elevators to install in a new office building. 

(e) Designing delivery routes for postmen. 

(f) Determining the number of emergency operating rooms in a hospital. 

(g) Floating a new bond issue. (Discuss the situation for a corporation and for a 
municipal government.) 

(h) Determining the frequency of scheduling truck deliveries of gasoline to a major 
oil company’s gas stations. 

(i) Determining the size of a corporation’s bank balance. 

(j) Choosing the number of crews of telephone repairmen. 

(k) Selecting the capacity of a dam. 

(l) Determining the seating capacity of a supersonic airplane. 


2 In each part below, calculate the expected value of the random variable D, the 
expected value of D^, the expected value of/(Z)) = (—l)^*jD/(D-f 1), andE[/(Z)|jr)] 
in (8) of Sec. 16.2, where c = 1, A = 2, /> = 5, and^ = 0, 1,..., 6. 

(a) P[D = ^] = I for (/ = 0, 1, ,.., 6. 

(b) />[D = (/] = ifor^= 1,2,...,6. 

(c) P[D = ^] = ifor(/=0,l,...,5. 

(d) P[D = e/] = i for (/ = 0, 2, 4, 6. 

(e) P[D = </] = ^ for ^/ = 1, 3, 5. 

(f) P[D = e/] = for (/ = 1, 2, 3, 4. 

(g) P[D = d] = for (/ = 2, 3, 4, 5. 

(h) P[D = 2j] = (5)(,5)^(.5)’--^ forj = 0, 1, 2, 3. 

(i) P[D = 2j] = forj = 0, 1, 2, 3. 

(j) p[D = 2 j] = forj = o, i, 2 , 3 . 


3 In each of the three parts below, derive an expression analogous to (8) for the indi¬ 
cated function f{D\y), 

(a) 



(b) 

f{D\y) = 
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where 


^(jv) = 


0 

7^4- cy 


iovy = 0 
for^ > 0. 


(c) f{p\y) = c{y) + h(^y — (/) + />(^, d)^ where 




0 

K + cy 


for^ = 0 

for^ > 0, 


ihriy-d) iiO <y-d^I 

h{y-d) = \h h-{y - d) Uy-d>I 
lO otherwise, 


p{y. d) = 



ifO <d-y^S 

\i d — y > S 
otherwise. 


4 Yum-Yum Company (Sec. 16.2) 

(a) Suppose the annual profit from expanding the present facility when the share of 
market equals 30% is 90 + where ^ > 0. What is the largest value of e such that 
building a new plant remains the optimal decision ? Make the same sensitivity 
analysis for the profit associated with a 35% market share. With a 40% market 
share. 

(b) Suppose the annual profit from building a new plant when the share of market 
equals 30% is 50 — e, where f > 0. What is the largest value of e such that 
building a new plant remains the optimal decision ? Make the same sensitivity 
analysis for the profit associated with a 35% market share. With a 40% market 
share. 

(c) Let Pi be the assessed probability that the market share is 30% and p 2 that it is 
35%, where + />2 = !• How large can pi be such that building a new plant 
remains the optimal decision? 

*(d) Let ply p 2 y Pi be the assessed probabilities that the market share equals 30%, 35%, 
and 40%, respectively. Note />3 = 1 — />i — p 2 * Construct a diagram showing all 
the values for pi and p 2 such that building a new plant is the optimal decision. 
(Hint: write an inequality between the expected profits for the two decisions that 
expresses building a new plant is optimal and then simplify. Draw the resultant 
solutions space diagram for pi and /> 2 .) 

(e) Suppose the assessed probabilities are realistic but, by paying the amount K, 
the President can learn the actual share of market. In other words, he can find 
out what the share of market will be before having to make his decision. What is 
the largest value of iC that still makes such perfect information worth purchasing? 

5 Consider the production policy (10) and the demand distribution (11) in Sec, 16.2. 

(a) Assuming entering inventory in the current period is 2, explain why production 
in the next period will be 0 with probability Explain why production will be 0 
with probability J in the period after next. What is the probability that production 
will be 0 three periods from now? 

(b) Suppose, instead, that jD( = 2 with probability J and = 3 with probability J. 
Revise the probabilities in part (a) accordingly. 
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(c) Suppose, instead, that the policy is revised such that 



for i = 0, 1, 2 
otherwise 


and Dt = 2 with probability J and Dt = 3 with probability J. Determine the 
probabilities that production will be 0 in the next period, the period after next, 
and three periods from now, assuming entering inventory in the current period is 2. 

(d) Suppose each time production occurs there is a setup cost JC = 4 and a production 
cost per item c = 1. There is also a holding cost k = 2 per item left over at the 
end of the period. Use the information in (12) and (13) to determine the expected 
cost per period over an unbounded horizon. 

(e) Suppose that the policy in (10) is revised as shown below so that occasionally 
demand exceeds inventory on hand, in which case the excess demand is backlogged 
until the following period, 


x*(—2) = x«(—1) = Xo,(0) = 3 Xoo(l) = 0; 


thus, if entering inventory is 1, no production occurs, and the demand backlogged 
is 1 if Dt = 2, or 2 if Dj = 3, represented by the negative inventory levels — 1 
and —2, respectively. Revise the steady-state probabilities in (12) and (13) 
accordingly. Justify your answer. 


’•'6 Consider the Toute de Suite Bakery case, described in the special material in Sec. 16.2. 

(a) What is an optimal solution if the manager wants the probability of meeting the 
total daily demand to be at least { ? At least i ? 

(b) What is the optimal solution if he wants to sell at least ^ a cake a day, on the 
average? If he wants to sell at least one cake a day, on the average? 


7 Consider a three-period revenue stream (i?|, Rj, and assume that each Ri equals 
either 0 or 1. Let the possibility (1,0,0) occur with probability 0, (0, 1,0) with 
probability (0, 1, 1) with probability ■^, (1, 1, 0) with probability and each 
other possibility with probability i. Let d = 

(a) Determine the expected present value of the stream using the joint probability 
distribution for (i?i, /? 2 j ^ 3)9 ^^ 21 ^ is, calculate the value Ri + ciR 2 4- cc^Rj for 
each possible (Ri, R 2 , R3 ), weight the value by the associated probability, and 
sum over all possibilities. 

(b) Determine £[/?«] for / = 1, 2, 3. Calculate (15) in Sec. 16.2 and compare your 
answer with that in part (a). 

8 Consider each example in exercise 1. Suggest how you might obtain the required 
probability distributions; refer to the four approaches listed in Sec. 16.2. 


9 Galactic Reaper Company (Sec. 16.3). Consider the data in Figs. 16.2 and 16.3. 

(a) Verify the probabilities shown in (1). 

(b) Check that the time span is 5 periods if each random event takes its average 
value. Calculate the actual expected time span. 

(c) Should the new manager be hired if there is only a J probability of his reducing 
the total time span by 1 period? Show the calculations justifying your answer. 
Parts {d) through (j) follow on p, 690, 
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(d) Suppose the manager has a fifty-fifty chance of either reducing the total time 
span by 2 periods or not reducing it at all. Assume the incremental profit from 
a total time span of 2 periods is 130, Should the manager be hired? 

(e) Suppose the President decides not to hire the new manager, but considers the 
option of paying a fee F to ensure that Task B takes exactly 3 periods. What is 
the largest value for F that makes this option worthwhile? 

(f) Perform the analysis in part (e), but assume that the fee F ensures that Task C 
takes 3 periods. 

(g) Perform the analysis in part (e), but assume that the fee F ensures that Task E 
takes 2 periods. 

(h) Perform the analysis in part (e), but assume that both Tasks B and C each take 
exactly 3 periods. 

(i) What are the largest values for the fees in parts (e), (f), (g), and (h) if the 
President decides to hire the new manager? 

*(j) Suppose Task A can take 1 or 2 periods with a fifty-fifty chance, and similarly, 
Task B can take 3 or 4 periods. Task G can take 3 or 4 periods, and Task E can 
take 2 or 3 periods; Task D still requires 1 period. Analyze whether the new 
manager should be hired. Given the optimal decision, calculate the gain from 
perfect information. 

10 Devise an argument establishing the Linear Gertainty-Equivalence Theorem in 

Sec. 16.4. (Illustrate your argument with the problem maximize -f ^ 2^2 subject to 

2xi -f- 3 x 2 ^ 6 , where there is a fifty-fifty chance that Ci equals 0 or 1 , and similarly, 

there is an independent fifty-fifty chance that C 2 equals 0 or 1 .) 

11 Big Board Company (Sec. 16.4) 

(a) Explain why the structure ( 6 ) through (10) is so simple that a numerical solution 

is trivial once you are given values for Cj, Cj, and A. (Illustrate your answer using 
C\ = 10, C 2 — 15, ^3 = 3, T = 10, flj = 3, ^2 ~ ~ ^2 “ 50.) 

(b) Show how (13) through (21) are altered if 0,= 4. Give a formula showing how 
the numbers of equations and variables depend on Q,- 

(c) Show how (13) through (21) are altered if there is a capacity constraint on X 3 , 
namely, X 3 ^ K, where iC is a given constant. How does your answer change if 

is a random variable to be included in the characterization (12) ? Assume that 
if X3 exceeds K, there is a penalty charge JsiF — X3). 

(d) Show how (13) through (21) are simplified if the values for ai, < 22 , and Dq are not 
random (do not allow an excess of Product 2). 

(e) Explain how the formulation in (12) through (21) changes if rj and aj are cor¬ 
related. (Gontinue to assume that there are Q, = 3 different possible sets of values 
for all the random variables.) 

(f) Show how (12) through (21) are altered if the production costs ej are random 
variables. Assume that the ej are not known until after the first-stage decisions 
are made. (Gontinue to assume that there are Q,= 3 different possible sets of 
values for all the random variables.) 

(g) Suppose that the actual values of/i and /2 become known aji^r the levels of the 
second-stage variables are fixed. Let E[fi\q\ denote the expected value ofyj^'wn 
that possibility q occurs, where q = 1, 2,..., Q,. Show how the objective function 
(13) is altered. 
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12 Big Board Company (Sec. 16.4). Suppose the President of Big Board can pay a fee F 
to learn which of the Q. possible values for the random variables will actually occur. 

(a) Suggest how to calculate the largest value for F that still makes it worthwhile for 
the President to purchase this perfect information. 

*(b) Suppose the fee F purchases only perfect information about the yield ai, and that 
a I is statistically independent of the other random variables (knowledge of Oi is 
of no help in predicting the values of the other random factors). Explain how to 
modify your answer in part (a) ? 

*13 Big Board Company (Sec. 16.4). The special structure of this case permits an alternative 
formulation containing a single constraint (the total supply restriction) and a nonlinear 
objective function. Observe in (15) through (20) that there really is no optimization 
performed at the second stage; given a^^Xi and feasibility dictates whether Sqi or 
tqi is positive. Let A) represent the loss of profit associated with the deci¬ 

sion Aft, given specific values for the random variables a(,^, and A- 

(a) For i — 1 , 2 , write the appropriate formula for i/t(-^iki>y(> A)- (Hinti the value 
of the function depends on whether Ct^t is “less than or equal to” or "greater than” 

A.) 

(b) Let Ui{xi) be the expected value of Wi(Afi|flt,yj, A)» Write a formula indicating 
how to calculate Ui{xi) given the Q.possible sets of values for the random variables. 

(c) Show the complete nonlinear programming optimization model characterizing the 
problem. 

14 Consider the general two-stage linear model in Sec. 16.4. In each part below, explain 
how the stated assumption is embodied in the formulation (22) through (25). 

(a) The value of each random clement is independent of the levels of x^, 

(b) The levels of Xj, for j = 1, 2,..., A:, must be fixed at the first stage before any 
exact values of the random elements arc known. 

(c) The levels of the second-stage variables arc established after all the random values 
are known. Is assuming that there always exist feasible levels for the second-stage 
variables equivalent to assuming that (23), (24), and (25) have a feasible solution? 
Explain. 

(d) There is a finite number Q, of possible sets of values for the random variables. 
Docs the formulation imply the assumption that the random variables are mutu¬ 
ally independent? Explain. If not, does such an assumption simplify the formula¬ 
tion? Explain. 

(e) State why It is permissible to use an exact value that is known for any Cj, bi, or 

aij in (22) through (25). What simplifications occur if any bi or Oij is known, 
where z = l,...,m? (Be specific.) 

*15 (a) Write the dual linear program to (13) through ( 21 ) in Sec. 16.4. 

(b) Write the dual linear program to (22) through (25) in Sec. 16.4. 

16 Consider the two-stage problem: maximize 6 Afi -h 8 Af 2 — I 2 Af 3 — 3 Af 4 subject to 
fliAT, -f- 0,^2 + -^3 — ^>^4 = D, where every Xj ^ 0 , where the random quantities arc 
(fli, < 22 , D) and a :3 and a :4 arc the second-stage variables. Suppose the only two possible 
sets of values for the random variables are (1, 2, 4), occurring with probability and 
( 2 , 1 , 10 ) with probability §. 
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(a) Formulaic this two-stage problem in an equivalent linear programming model 
and find the optimal decision rule. Give an economic interpretation to your 
answer. 

♦(b) Suggest an alternative characterization that involves maximizing a nonlinear 
function subject only to the explicit constraints Xj ^ 0 and X 2 ^ 0. {Hint: exploit 
the fact that if you are given levels for Xi and X 2 and specific values for ai, 02 , D, 
then feasibility dictates whether x^ or X 4 is positive. Derive a nonlinear formula 
showing the expected value contribution to the objective function stemming from 
the second stage when the first-stage decisions are Xi and X 2 .) 

♦(c) Using tlie ideas developed in the advanced material at the end of Sec. 16.4, 
calculate E [wait and see], E [average value], E [Jirst-stage, average-value approximation], 
C [most stringent problem], and the cost of uncertainly. 

17 Big Board Company (Sec. 16.5). Consider the chance-constrained version (1) through 
(5). Suppose £[ri] = 10, E[c^ = 15, ^3 = 3, T* = 10, Oi = 3, ^2 = Assume that 
jDi and D 2 each have the marginal probability distribution given in ( 8 ). What is an 
optimal solution if 

(a) ySi = /92 = .9? (e) A =-5, y 92 = .9? 

(b) A = A = .95? (f) A = A=.3? 

(c) = A = .5? (g) = A = 0? 

(d) = .9, A = .5? 

18 Big Board Company (Sec. 16.5). Answer the questions in exercise 17, e.\cept let the 

marginal probability distribution for each of the products be P[D = for 

1 , 2 ,..., 10 . 

♦19 (a) Let Cj{uj) be a convex function, where u^ is integer-valued. Show that Cj{xj) 
= E[cj{Dj — Xj)] is a convex function of xj, (Assume that there are a finite 
number of possible values for the random variable Dj.) 

(b) Explain why the objective function ( 6 ) in Sec. 16.6 is appropriate even when 
demands Dj are correlated. 

Exercises 20 through 24 refer to the transportation model and the example in 
Sec. 16.6. 

♦20 (a) Verify the values for Ci{x 2 ) and Ci{x 4 ) in Fig. 16.7. Also verify the column 
^(^ 3 ) — A (^‘3 "I" i)- Check that each Cj{xj) is convex according to (9). 

(b) Explain why A(0) = fjE[Dj], and why Cj{xj) decreases to 0 as xj increases to 
the largest possible value for Dj, 

(c) Find an optimal solution for= 15 and /4 = 5; check your answer with that 
given in Fig. 16.9. 

♦21 Assume, instead of (11), that = </] = { for d = 0, 1, 2, 3, 4. Accordingly, revise 
Ci{x 4 ) in Fig. 16.7, rewrite the Expanded Transportation Tableau in Fig. 16.8 and 
find an optimal solution for/j = 15 and^i = 5. 

♦22 (a) Verify the solution Xj = 2 and X 4 = 3 is optimal for the associated values of/3 
and fi stated in Fig. 16.9. 

(b) Do the same for the solution X 3 = 3 and X 4 = 2. 
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*23 (a) Verify (i) and (ii) in the special material. 

(b) Derive expressions analogous to (i) and (ii) in terms of^i. 

*24 Consider the chance-constrained version, 

(a) Verify the total transportation cost given in Fig, 16.10 for the allocation ^3 = 2 
and = 3. Do the same for X 3 = 3 and x^ = 2. 

(b) Explain why a :3 = 4 and X 4 = 1 is never an optimal allocation in this approach. 

(c) Discuss the fundamental difference between this approach and the two-stage 
approach to the stochastic transportation problem. Do you think it reasonable in 
the example to rule out the allocation (^3 = 2, X 4 = 3) and (^j = 4, X 4 = 1)? 
Explain your reasoning. 

*25 Consider the production scheduling example in Sec. 16.7. 

(a) Explain why (12) through (16) characterizes the case of Perfect Information, 

the statement that there are 72 variables and 24 equality constraints. Explain 
why ( 12 ) through (16) can be factored into Q.separate linear programs. 

(b) Verify in the case of Learn Demand—Then Produce that the simplified form 
contains only the variables in (17), and justify the statement that there are 42 
variables and 14 equality constraints. Exhibit in detail the simplified form of the 
objective function. 

(c) Verify in the case of Produce—Then Learn Demand that the simplified form 
contains only the variables in (18), and justify the statement that there are 35 
variables and 14 equality constraints. Exhibit in detail the appropriate objective 
function. 

(d) Verify in the case of Full Commitment that the simplified form contains only 
the variables in (19), and justify the statement that there are 31 variables and 14 
equality constraints. 

(c) Suppose the horizon extends one more period, so that / = 1 , 2 , 3, 4; assume D^ 
is generated analogous to the other D^ so that, all in all, (2. = 16. Determine the 
number of variables and equations in each of the four cases. 

*26 Consider the production scheduling example in Sec. 16.7. 

(a) Explain how, if at all, the four cases are altered when the demands Di are mutually 
independent. (Be sure to determine w'hether there are fewer variables and equa¬ 
tions in each formulation.) 

(b) Explain in detail the equivalent linear programming mode! appropriate to the 
following case. In Period t you know both Di and before deciding produc¬ 
tion Xi. (Be sure to state the number of variables and equations in the model and 
exhibit the objective function.) 

*27 Consider the production scheduling example in Sec, 16.7. 

(a) Suppose each profit coefficient Ci in (7) is a random variable, and you learn its 
value at the same time you learn /)<. Also assume that there are Q, = 8 ’ possible 
sets of values for (Dj, D^y D^; r,, c^, C 3 ), and that the are generated as in Fig. 
16.11. Explain how each of the four cases are appropriately altered. 

(b) Suppose demand left unfilled in Period 1 is backlogged to Period 2, and demand 
left unfilled in Period 2 is backlogged to Period 3. Let di be the penalty cost per 
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unit of unfilled demand at the end of Period t. Any demand left unfilled at the 
end of Period 3 is lost. All the other stipulations in the model remain unchanged. 
Explain how each of the four cases are appropriately altered. (Be sure to state the 
number of variables and equations in each model, and exhibit the objective 
function.) 

♦28 Consider the employment smoothing example in Sec. 16.8, 

(a) Verify that setting BCjdyt = 0 for ^ = 1 , 2,..., T in ( 6 ) yields the equations in 
(7), ( 8 ), and (9). 

(b) Verify that 0 < O' ^ 1 for 0 ^ c < oo, where a is given by (11). 

(c) Verify thatj^t —> jD* as / —> oo in (12). 

♦29 Consider the employment smoothing example in Sec. 16.8. Let ^ c = 

In each case below, find the optimal 

(a) Suppose T = 4 and E\p{\ = 16 for every t, 

(b) Suppose T = 4 and E[D{\ = 4/ for ^ = 1,2, 3, 4. 

(c) Suppose T = 4 and E[p{\ = 20 — 4i for / = 1,2, 3, 4. 

(d) Suppose r = 4 and £[£),] = £[ 1 ) 3 ] = 4 and £[£ 2 ] = £[£4] = 16. 

(e) Suppose T = 4 and £[£ 1 ] = £[£3] = 16 and £[£ 2 ] = ^[^ 4 ] = 4. 

(f) Suppose the horizon is unbounded and E[p{\ = £)* = 16 for all /. Also show the 
optimal ^4 for / = 2, 3, 4. 


♦30 Answer the questions in exercise 29, assuming that 

(a) c = 

(b) e = f. 


31 Explain your understanding of the following terms: 


impact of uncertainty 

contingency plan, strategy, decision rule 

expected value 

uncertain actions 

induced randomness 

♦randomized strategies (or decision rules) 
Fallacy of Averages 

gain from (or value of) perfect informa¬ 
tion, cost of uncertainty 
stochastic linear programming model 
two-stage linear model 
Linear Certainty-Equivalence Theorem 


♦here-and-now problem 
♦wait-and-see problem 
♦average-value problem 
♦first-stage, average-value 
approximation 
♦most stringent problem 
chance-constrained programming 
deterministic equivalent constraint 
♦joint chance-constraint 
♦multistage linear model 
♦quadratic criterion function 
♦linear decision rule. 


FORMULATION AND COMPUTATIONAL EXERCISES 

32 Galactic Reaper Company (Sec, 16.3). Suppose in Fig. 16.2 that the arc between Nodes 1 
and 2 is drawn in the opposite direction. Analyze the same decision problem, assuming 
that the incremental profit is equal to 0 from a total time span of seven or more 
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periods. (Be sure to indicate the optimal decision and also compute the gain from 
perfect information, as defined at the end of the section.) 

*33 Critical Path Scheduling, Consider the example in Sec. 6.6 and its linear programming 
characterization (1) through (6). Assume that several of the completion times are 
random, as given by the data in Fig. 16.3. Suppose you want to determine job starting¬ 
time values ycD>yEy and_>»/- such that the probability is at least .5 that each of the 
inequalities (2) through (6) issatisBed (for example, you want > tc -{-ycD] ^ 

These starting-time values are to be determined prior to knowing the exact values of 
any of the random completion times. Give a verbal interpretation of the chance- 
constraints. 

(a) Formulate an equivalent linear programming model. Also exhibit the correspond¬ 
ing dual problem. 

(b) Solve the primal and dual problems in part (a). Indicate how the solution varies 
if the stated probability is .25, instead of .5. Is .75, instead of .5. 

34 One-Potatoy Two-Potato Problem (Sec. 1.6). Typically the actual yields of agricultural 
raw materials are subject to random variation. Assume that the yields for Source 1 
are either (.2, .2, .3), as shown in Fig. l.I, or (.25, .25, .35), or (.18, .15, .29), where 
each of these possibilities occurs with equal likelihood. Also assume that the yields 
for Source 2 are either (.3, .1, .3), as shown in Fig. 1.1, or (.35, .1, .25), where the 
first possibility occurs with probabilitiy | and the second with probability j. Assume 
that French fries manufactured in excess of the stated purchase limitation can be sold 
directly to institutional buyers at the relative profit ofjpper unit weight, and similarly, 
excess hash browns at H and flakes at L per unit weight. 

(a) Write a two-stage linear model that selects purchase quantities to maximize 
expected profit. 

*(b) Write the corresponding dual linear program. 

35 Knox Mix Company (Sec. 2.2) 

(a) Suppose the unit profit of each activity is composed of a revenue factor less a cost 
component, where cost is partly random. Specifically, let the unit revenue for 
Processes 1 and 2 be 15, and for Processes 3 and 4 be 20. Assume that the cost 
components are 

Process 1: 8 r Process 2: 8 -f 2c where P{c = 0] = J P{c = 1] = | 
Process 3: 6 -f Process 4: 6 -f 3(/ 

whercP[(/= 0] = i P[d= 1] = i P[(/= 2] = i. 

Display a model for selecting the production levels to maximize expected profit. 

(b) Suppose the availability of labor W is random, where 

P[M^=10] = -^ P[M'=I2] = y P[H'=15] = -|- 

Assume that if the production schedule requires more man-weeks than arc 
available, then the additional requirement is met by scheduling overtime, which 
costs c per man-week required. (The production levels are establbhed at the start 
of the week and any overtime is incurred at the end of the week.) Display a model 
for selecting production levels to maximize expected profit. Part (c) on p, 696. 
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(c) Consider the situation in part (b), except suppose that the company seeks a 
schedule that stays within the available amount of labor with probability of at 
least . 8 . (The dollar cost of overtime is not explicitly taken into account.) Display 
an appropriate linear optimization model. Show how’to alter the formulation if 
the probability is .9, instead of . 8 . 

36 Hion Hog Farm Problem (Sec. 2.3). After reading a recent research report issued by the 
Department of Agriculture, the production manager feels that the minimal amount r 
of Nutritional Ingredient B is either 225, with probability 4, or 250, with probability 
1 ^, or 300, with probability J . Formulate the appropriate linear optimization model 
that ensures his hogs receive at least the minimal required amount of B with proba¬ 
bility of at least .25. With probability of at least .50. With probability of at least .75. 

37 Trim Problem (exercise 25 of Chap. 2). Suppose that Fine-Webb produces in anticipa¬ 
tion of customer orders for 22-inch width rolls. Assume that such demand turns out 
to be 90 -f where D has the probability distribution P[P = d'\ = <//190 for 
// = 1 , 2,.. ., 19. Formulate a linear programming model that minimizes trim costs 
subject to the previous demand constraints on the rolls of 20 -inch and 12 -inch 
widths, and the restriction that all demand for 22 -inch width rolls be satisfied with 
probability of at least .5. At least . 8 . At least .9. 

38 Blip-Bleep Problem (Sec. 6.4). Suppose the manufacturer is concerned about the ability 
of certain subcontractors to meet the required specifications for the electronic com¬ 
ponents. Let pii be the manufacturer’s probability estimate that each Component i 
produced by Subcontractor j will meet standards, and let Ki be the loss incurred by the 
manufacturer when any units of Component i are produced below standard. Formu¬ 
late an optimization model that minimizes expected total cost. Indicate whether 
the formulation is still a standard assignment problem. 

39 Shortest-Route Model (Sec. 6.5). Consider the example in Fig. 6 . 8 . Suppose that the arc 
costs arc random and that traversing each arc requires one period. Assume that during 
each of the first three periods of the horizon the arc cost from Node i to Node j is 
cither cij, with probability pi^, or d^, with probability 1 — pij. After the third period, 
the arc cost is cither e^^ with probability oi^/Jj, v/ilh probability 1 — Assume 
that each arc cost for each period is completely independent of every other arc cost. 
(Thus, for example, one po.ssiblc sequence of arc costs from Node 3 to Node 4 over 
seven periods is ^ 34 , d^^ ^ 34 j/ 34 >/ 34 > ^ 34 >/ 34 .) A complete route must be selected prior 
to knowing the exact values of any of the arc costs, and the objective is to minimize 
expected cost in going from Node 8 (source) to Node 1 (sink). Show how to formulate 
the problem as an equivalent deterministic shortest-route model. (Draw the network 
diagram and indicate the arc costs.) 

40 Consider a linear programming model containing the constraint 

where flu is the random variable and has the probability distribution P[flii = a] = .1 
for fl := 1 , 2 ,*,. ., 10 . 
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(a) Show how to convert this chance-constraint into an ordinary linear inequality. 

(b) Answer part (a), except let the probability limit be .75, instead of .25. Be .8, 
instead of .25. 

(c) Rework part (a), letting P[fln = j] = .1 for a = 6, 7,..., 15. For a = 
-4,-3,..., 5. 

41 Great Auto Works Problem (Sec. 2.6). Let T = 3, and assume that the sales requirements 

St each period is random, where 

/>[5, = 10] = ^ = 20] = ^ i>[5, = 16] = 3- 

P[5,= 15]=-|: i>[5, = 25] = -i P[5,= 18] = -^, 

and the requirements are completely independent of each other. If the actual require¬ 
ment exceeds the available supply (at the end of the period), the unhlled demand is 

lost at a cost of per unit in Period t. 

(a) Assume there is “perfect information” at the beginning of the horizon, analogous 
to the case of Perfect Information in Sec. 16.7. Write the appropriate linear model 
in full, and indicate the total number of variables and equations. 

(b) Analyze how the formulation in part (a) is altered if the exact values of St and 
the previous sales requirements are known before setting the levels of the Period t 
decision variables, analogous to the case of Learn Demand — Then Produce in Sec. 
16.7. 

(c) Analyze how the formulation in part (a) is altered if only the exact values of the 
previous sales requirements are known before setting the levels of the Period t 
decision variables, analogous to the case of Produce — Then Learn Demand in Sec. 
16.7. 

(d) Analyze how the formulation in part (a) is altered if the levels of all the decision 
variables must be set before knowing any of the exact values for the sales require¬ 
ments, analogous to the case of Full Commitment in Sec. 16.7. 


MIND-EXPANDING EXERCISES 

42 Neyman-Pearson Lemma, Consider a random variable V that can assume a value of 
either 1, 2,... , n. Suppose you wish to find out which of two possible probability 
distributions is giving rise to V by taking a single sample observation. Let one of the 
possible distributions be denoted by P, containing the probabilities > 0, and the 
other by (2,> containing the probabilities > 0. Explain why it is always possible to 
assign index values j = 1, 2,. .., « to the observation values 1, 2,..,, n, in such 
a way that 

In 

SO that the index j = I refers to the observation value having the smallest ratio and the 
index j = n to the observation having the largest ratio. Given that the sample yields 
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the observation indexed as let be the probability that you decide the distribution 
is P (and 1 — Xj that it is Q,). 

(a) Formulate a linear programming model that maximizes the probability of decid¬ 

ing the distribution is P when indeed it is P, subject to the constraint that the 
probability of deciding it is P when in fact it is Q, docs not exceed where 
0 < < 1. Also write the dual problem. 

(b) Show that an optimal solution has the form that xj == 0 for j and Xj = 1, for 
j > J for some critical J, ^ 

♦43 Consider the transportation example in Sec. 16. 6. Suppose in addition that the Si 
arc also random, and that their exact values are not known until after the Xij are 
determined. Let P[Si = 5] denote the probability that the value of Si is S, (The level 
ofx<j is to be viewed as a commitment to ship Xtj units from the ith source to the jth 
destination.) If + xjo -f — 4- Xi,n exceeds the actual value of S^ the difference 
must be supplied by another means at a cost Cj per unit. Discuss in detail how you 
can alter the formulation in Sec. 16.6 to obtain an equivalent enlarged deterministic 
transportation problem. 

♦44 Warehouse Problem (exercise 36 of Chap. 5). Suppose the selling price and purchase cost 
arc random variables. Specifically, consider the random variable r, which has proba¬ 
bility distribution ^^(r) in Period t, where r = 1, 2,.. ., 10. Then assume that the 
selling price and purchase cost in Period t arc related to r by the functions pi{r) and 
Ct(r), and that these values are known at the start of Period t, prior to deciding the 
number of gallons to be sold or purchased. Formulate an equivalent multistage linear 
programming problem that yields a strategy maximizing expected profit. (Be sure to 
define your variables in such a way as to exhibit their dependence on r.) State the 
numbers of variables and constraints in the linear model. 

♦45 Caterer Problem (exercise 48 of Chap. 6). Suppose that p is the probability of losing a 
bundle of napkins sent for normal laundering. The catering firm learns about a loss 
at the beginning of the period when the laundry is to be returned, and when there is 
time to purchase new napkins if necessary. Formulate an equivalent linear program¬ 
ming model. State the numbers of variables and constraints in the linear model. 

♦46 Expando Manufacturing Company (jcxercise 36 of Chap. 2). Suppose the company antici¬ 
pates the possibility of a strike in the construction industry. Let j = 0, 1,..., 6, 
denote that a strike occurs during the jih period, where pQ represents the probability 
of no strike at all. Assume that if a strike occurs during Period j, then any expansion 
by Method 2 undertaken in the previous period or any expansion undertaken in 
Period j extends for an additional period. In case of a strike, the company is willing 
to alter its objective function to include production capacity that would have become 
available by the end of the sixth period were it not for the strike. 

(a) Formulate an equivalent linear programming model assuming that the entire 
expansion plan must be made at the beginning of Period 1, prior to knowing 
when a strike occurs (if at all). State the numbers of variables and constraints in 
the model. 

(b) Formulate an equivalent linear programming model assuming that the company 
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knows prior to each period whether a strike will occur in that period. (Assume 
that if a strike occurs in Period k, it does not recur in any subsequent period.) 
State the numbers of variables and constraints in the model. 

*47 Employment Scheduling. Consider the example of the Spitzen-Pollish Company in Sec. 
6.7. Suppose the number of required crews /?* in Period k is a random variable. 
Specifically, let = P] = />*(/?), where R= 1, 2,..., 10, and assume that the 
Rk are mutually independent. In each part below, show how to formulate an equivalent 
linear programming model, given the specified assumption about when the actual 
values of 7?^ become known. Be sure to define all the symbols you use and indicate the 
numbers of equations and variables. 

(a) Assume that Pjt becomes known at the start of Period k, prior to making decisions 

^/r,Ac+I> . . . , 

(b) Assume that R^ becomes known at the start of Period k — 1, prior to making the 

decisions ... ,xic-\,n- (Hence, Ri and R^ arc known at the start of the 

horizon, R^ and R^ at the start of Period 2, etc.) 

*48 (a) Consider a nonlinear programming problem that can be solved by an algorithm 
of the sort described in Chaps. 14 and 15. Suppose that the underlying application 
lends itself to a two-stage description, analogous to that given in Sec. 16.4. Discuss 
whether the approach used in Sec. 16.4 generalizes to the nonlinear problem. 
State whether you can construct an enlarged equivalent nonlinear problem 
solvable by the same sort of algorithm that was applicable in the deterministic 
case. 

(b) Do the same for a multistage stochastic nonlinear problem and the approach used 
in Sec. 16.7. 

(c) Do the same for two-stage and multistage stochastic integer programming prob¬ 
lems of the sort described in Chap. 13 and the approaches used in Secs. 16.4 and 
16.7. 
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CHAPTER 



Probabilistic Dynamic 
Programming Models 


17.1 INTRODUCTION 

In the preceding chapter you studied the impact of uncertainty on optimal 
decision processes. Proper structuring of a model containing random elements 
was emphasized, especially taking into account the intermingled sequence of 
decisions and emergent information about the exact values of the random ele¬ 
ments. Most of the analysis was in the context of linear models, and you saw that 
the stochastic generalizations frequently lead to expanded linear models that are 
too large to solve in practice. 

This chapter partially reverses the emphasis. Each model below is a specific 
application of probabilistic dynamic programming. The examples will show that 
stochastic versions of many dynamic programming models are not much more 
difficult to solve than their deterministic counterparts. 

Except for the advanced material in Sec. 17.7, each illustration in this chapter 
is for a finite horizon or finite stage dynamic programming problem. (Optimiza¬ 
tion over an unbounded horizon is treated in Chap. 18.) This chapter focuses 
on how to include random phenomena in multistage problems. We assume that 
you are already familiar with formulating deterministic dynamic programming 
problems, and suggest that you take a few minutes to review Chaps. 8 and 10 to 
refresh your memory on the concepts used in dynamic programming, and in par¬ 
ticular, on the notions of state and stage variables, the appearance of a recursion 
formula, and the numerical procedures for computing solutions from such recur¬ 
sions. 

We explain probabilistic dynamic programming in a manner similar to the 
approach in Chaps. 8 and 10, namely, by presenting several illustrative examples. 
The models get progressively more complicated looking, and a few represent 
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stochastic versions of deterministic models that you previously studied in Chaps. 
8 and 10. We do not dwell in this chapter on the details of computing answers 
to specific problems, because in practice, the computational task is so lengthy 
that you would use an electronic computer and a program written by a technical 
expert. Instead, we concentrate on problem formulation and on certain insights 
into what happens when you introduce probabilistic elements. 

To keep perspective on assessing the impact of uncertainty while reading the 
material below, be sure you answer these questions: 

1. What is an optimal policy for a deterministic version of the model? 

2. How much information about the probability distribution does an optimal 
solution require? 

3. How does the dynamic programming algorithm for the stochastic model 

differ from that for the deterministic version? « 


17.2 DISTRIBUTION OF EFFORT EXAMPLE 


We begin with an example in which the introduction of probabilistic elements 
has an almost trivial impact on the way the appropriate dynamic recursion is 
formulated. The model is a stochastic version of what was called the Distribution 
of Effort Problem in Chap. 10. To save you from flipping back to Sec. 10.2 where 
the particular example was first discussed, we repeat the statement of the deter¬ 
ministic problem here. 

The owner of Shopping Basket Markets has one week’s supply, N, of eggs to 
distribute among his s stores. From past experience he knows that if he allocates 
yj eggs to Store j, his profit will be Rj{yj)^ He wants to find a distribution of eggs 
among his Shopping Basket stores that maximizes overall profit. 

The mathematical characterization of the problem is given as 


maximize 2 Rj{yi} 


^^yj = (available number of eggs) 

for eachj (distribute only whole eggs). 


0 ) 

subject to 

( 2 ) 

( 3 ) yj = 0 , 1 , 

In Sec. 10.2, we let 

SjM = profit when n eggs are distributed optimally to Store 1, 

Store 2,.. ., Store j. 

You saw that (1) through (3) could be solved as a multistage problem by employ¬ 
ing the dynamic programming recursion 

(5) gM = maximum [Rjiyj) + — j-j)] forj’ = 1, 2,..., j 

Vi 

(6) go(n) = 0 forj=0, 
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where n = 0, 1,. . ., iVT and the maximization is over only nonnegative integer 
values ofthat satisfy < n. The optimal value of (1) is given hy 
The above formulation proceeded under the assumption that the profit from 
distributing eggs to Store j is known with certainty. Suppose, however, that 
profit depends not only on but also on the actual demand at Store j, which can 
be described as a random variable independent of the levels of all y^^ and known 
exactly only efier are all set. Specifically, let 

(7) T^){d\y) = profit at Store j when demand actually equals dy given that 

Store j has been allocated y eggs 

(8) P}{d) = probability that demand actually equals d at Store j. 

For example, the profit function in (7) might be described by 


( 9 ) 



for d '^y 
for d >y 


(supply satisfies demand) 
(demand exceeds supply). 


In this csise, the number represents the profit at Store j for each egg sold. When 
the supply is large enough to satisfy all demand dy the total profit is Tyd\ but 
when actual demand exceeds supply, then only the eggs are sold, so total profit 
is r^y. 

The expected profit from distributing^^ eggs to Store j is 
(10) R){y)) = s r}{d\yj)p}{d), 


where the letter d underneath the summation sign designates that the sum is to 
be taken over every possible value of demand. Using the above definition for 
a solution to (1) through (3) represents an optimal allocation given the 
criterion of maximizing expected total profit, and an optimal solution can still be 
obtained by employing (4) through (6). Therefore, once the expected profit 
functions in (10) have been calculated, the stochastic model is no more complicated 
to solve than the deterministic version. 


17.3 NEW PRODUCT INTRODUCTION—DECISION TREE 

Hopefully, the preceding section has restored your self-confidence in working 
with dynamic programming recursions and you are ready to go on to a more 
challenging problem. In Chap. 16, you learned a fundamental proposition about 
finding optimal solutions to a dynamic decision-making problem with stochastic 
elements: you can only find specific optimal values for the first-stage decisions, and 
must determine optimal decision rules that are to be applied in future 
periods. Such rules indicate what decision is optimal based on the actual history 
of the system at the time the particular choice is made. 

The “Cool It” Baby Air Conditioner Company is developing a major design 
improvement for its window-installation models. The design is far enough along 
for the change to be made right away. The company prefers to wait, however. 
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until additional research and testing have been completed, because some minor 
difficulties still beset the manufacturing process. On the other hand, the President 
of Cool It knows that if he delays too long, one or more of his competitors will 
surely announce a similar product improvement and his share of sales will conse¬ 
quently .be smaller. What he must do, then, is balance the profit improvement 
from waiting until the “bugs” in his production process are worked out against 
the possible loss of sales to his competitors. 

Consider the following mathematical representation of this situation. Suppose 
the manufacturer wants to decide the optimal month in which to announce the 
new improvement. Assume that he considers the latest possible month to be T, 
and that he definitely will introduce the change at any time his competition adver¬ 
tises a similar improvement. Let 

Tt = profit when, in Month t. Cool It announces the improvement in advance 
of competition 

gi = profit when, in Month t. Cool It and any competitor simultaneously intro¬ 
duce the change 

hi = profit when, in Month t. Cool It announces the improvement after com¬ 
petition has already done so, 

where it is plausible (although not really essential) to postulate that > h^ 

and r^, and hi increase as time goes on, that is, with larger values of L 

Assume that if no improvement is announced before Month t, then Cool It 
and its competitors make their Month t decisions independently, without knowl¬ 
edge of each other’s current decisions. Accordingly, suppose that the President of 
Cool It believes pi represents the probability that his competition will announce 
an improvement in Month t, given that no one of his competitors h 2 is introduced 
the change up to that time, where he feels that p^, — \. By assumption, then, the 
manufactarer postulates that these values for the conditional probabilities pi 
do not depend on the particular strategy he decides to use. The President predicts, 
however, that if Cool It introduces the improvement in Month t in advance of 
competition, then the competitors assuredly will make a similar introduction 
in the next period. Consequently, he calculates the values for to reflect such 
competitive reaction. 

The underlying decision process can be described by a decision tree 
as shown in Fig. 17.1, where we let T = 3, The labels M and C at each node refer 
to the action taken by the manufacturer and his competitors. The tree terminates 
at Month t unless both the manufacturer and his competitors decide to delay the 
introduction. 

Study the tree to see how it graphically displays the multiperiod decision process. 
The bottom of the tree represents the first-stage or immediate decision the manu¬ 
facturer must make. If he chooses to introduce the improvement, his decision pro¬ 
cess terminates. But if he decides to delay, then he may or may not have an oppor¬ 
tunity to delay again at Period 2, depending on the outcome of the probabilistic 
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Figure 17.1. Cool It Com¬ 
pany’s Decision Tree for 
New Product Introduction 
(7'= 3). 



Note: M = manufacturer’s decision (Cool It Company) 
C = competitors’ decisions 


element representing the competitors’ behavior. Similar remarks hold for his other 
decision points on the tree. 

Assume that the President wants a strategy -that maximizes expected profit. 
As usual in a dynamic programming problem, you can easily find the optimal 
decision in case the process continues to the final period, Period T. At that time, 
you need only compare the expected profit of his introducing the improvement 
with the expected profit of a further delay. Explain why these two quantities are 
PtSt and pTh^ respectively. Since by assumption pr = ^ and gr > Ar> better 
decision is to introduce the improvement if the decision process lasts until the final 
period. The foregoing computations can be written as 


( 1 ) 


/t = maximum \ 




{ pTgr’ introduce 1 
pfhf: do not introduce J 
where we have let fj. denote the expected profit of an. optimal decision at Period T. 
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Next consider what happens at an earlier period, Period t. If the President of 
Cool It chooses to introduce the improvement, his expected profit consists of^^ 
weighted by the probability pi that his competition also makes the same announce¬ 
ment, plus ff, weighted by the probability (1 — pi) that Cool It “jumps the gun” 
on its competition. But if the President chooses not to introduce the improvement, 
then his expected profit consists of hi, weighted by the probability pi that his com¬ 
petition makes the announcement, plus the value of delaying, weighted by the 
probability (1 — that his competition also delays. What is the value of delay¬ 
ing? It is simply the expected profit from an optimal policy given that the decision 
process lasts until at least Period t + 1; letyj+i denote this quantity. Then/t 
is the larger of the expected profits for the two possible decisions at Period t. 
This is all nicely summarized by the recursive calculation, for / = T — 1, . .., 1, 


(2) 


fi = mzucimum 


\^Pigt + (1 —PiW- 

\pthi + (1 — 


introduce | 

do not introduce; 


To compute an optimal decision rule for future periods and the optimal action 
at Period 1, start the calculations at (1), and perform the arithmetic indicated in 
( 2 ). 

The decision yielding/, is optimal for Month 1. If it is “do not introduce” and 
the competitors actually make no announcement, then the decision yielding / 
is optimal in Period 2. The decisions in subsequent months are found in a similar 
manner, and depend on the actual behavior of the competition. Explain the so- 
called backward induction process (1) and (2) in relation to the decision 
tree diagram. (In what way does this process resemble finding the best route in 
an acyclic network?) 


►A critical assumption in the above model is that the values of pi are indeed independent 
of Ihe manufacturer’s own strategy. To see why, consider the following situation. Suppose 
there is a single competitor whose profit figures are identical to the manufacturer’s. 
Then if the competitor could learn that the manufacturer planned to introduce the 
improvement in Period s, the competitor would thus announce his own change in 
Period s — 1 (if > g^). But if the manufacturer suspected this action by his com¬ 
petitor, he would then let = 1, and would change his initial decision accordingly. 
As you quickly recognize, this line of reasoning can lead to a sequence of revisions on 
the part of the manufacturer and his competitor. The question arises whether strategies 
exist for the manufacturer and the competitor that are in equilibrium, that is, which 
would not be revised if either one learned the decision of the other. And even if there are 
such equilibrium strategies, are they optimal in any sense? Such topics fall in the realm 
of game theory, and arc treated in texts on that subject. 4 


17.4 ELEMENTARY INVENTORY MODEL 

The Cool It Company example was fairly simple because the state of the sys¬ 
tem at any Period t was really an indication of whether the decision process had 
lasted that long, in other words, whether either the manufacturer or his competi- 
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tors had announced the improvement in prior periods. The state variable in this 
next illustration accords with what you have seen in previously presented deter¬ 
ministic problems. 

The illustration is a probabilistic version of the Dependable Manufacturing 
Company production and inventory example contained in Sec. 8.4. (An extensive 
treatment of inventory models with stochastic demands will be given in Chap. 
19 and Appendix II. The emphasis there will be on deriving ihtform of an optimal 
policy, and then exploiting this knowledge to calculate numerical answers.) 

In the previous deterministic case, we made the following assumptions about the 
data for Dependable Manufacturing Company’s problem: 

(1) demand D = 3 production x <5 ending inventory j ^ 4 

(2) production and inventory costs C{x,j) = C(x) + A/> 


where x and j are nonnegative integers and 


(3) ^ C(l) = 15 C(2) = 17 

C(3) = 19 C(4) = 21 C(5) = 23 and A = 1, 

for all periods. Thus the model’s parameters are stationary over time. 

The state variable is entering inventory, designated by the symbol z, where 
^ = 0, 1, . . . , 4. The appropriate recursion is 

W /n(0 = minimum [C(;r) + \{i + x — 3) +/n-i(i + ^ — 3)] 


for « = 2, 3,.,., 


with 


(5) f,{i) = C(3 - i) for i = 0, 1, 2, 3, 

where in (4), z = 0, 1,. . ., 4, and the minimization is over only nonnegative 
integer values in the range 3 — z < x < minimum (5, 7 — z). 

Suppose now that the demands are independently and identically distributed 
random variables, where 

(6) P[Z) = 2] = y P[Z) = 4] = y SO that E[D] = 3 for all periods. 

To make the analysis of this stochastic version comparable to the deterministic 
model, a few additional assumptions need to be specified. First, assume that 
inventory at the end of the horizon has no salvage value and incurs no holding 
cost (in the deterministic case the final inventory level*equalled 0). Second, assume 
that production x is made large enough so that a stockout never occurs, implying 
the restriction 

(7) entering inventory + production ^ 4. 

Since demand may be as small as 2 and ending inventory cannot be larger than 
4, the constraint 

(8) entering inventory + production ^ 6 
must also be satisfied. 
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The most important observation is that the state variable in this stochastic 
version continues to be the level of entering inventory. Think about why. Since 
the random demands are completely independent, the only thing that matters 
about the previous history when there are n periods remaining is the level of 
inventory currently available. Assuming that the objective function is to minimize 
expected cost over the horizon, the symbol fn{i) can now be interpreted as the 
expected cost of operating an optimal policy when entering inventory is at level i 
and there are n periods remaining in the horizon. 

The case of n = 1 is easy to compute: 

(9) /i(0=^(4-0 for I = 0,1,..., 4. 

Explain why the assumptions justify that the optimal production level a: is 4 — z 
when w = 1. 

For larger values of n, the expected cost of an optimal policy, given the level f, 
can be reasoned as follows. First, you have to add in the production cost C(x) 
for producing x. Second, you have to include the expected holding cost on inventory 
at the end of the period, which is the quantity (z + a: — /)). And finally, you have 
to account for the cost that will be incurred in later periods. But this la^t amount 
is simply the expected value -E[/„.j(z + x — D)], where D has the probability 
distribution in (6). State why. Thus the appropriate dynamic programming recur¬ 
sion for the stochastic model is, for zz = 2, 3,..., 

fn{i) = minimum {C(x) + 1 [^(z + a: — 2) + i{i + ^ — 4)] 

/jQx + + ^ — 2) + i/n-i(t + a: — 4)} 

= minimum {C(a:) + 1 [z + a: — 3] 

+ ^ 2) + X — 4)]} 

where z = 0, 1,. . ., 4, and the minimization is over only nonnegative integer 
values in the range 4 — z ^ a: < minimum (5, 6 — z). 

Note that the major difference between the recursion for the deterministic 
problem (4) and for the stochastic problem (10) is that in the latter an extra 
amount of arithmetic must be performed to compute £[/„_,(z + Af — Z))]. 

In Fig. 17.2 optimal production decisions Ar„(z) for « = 1, 2,. . . , 5 are shown 
along with/„(z). It can be proved that the optimal policy for zz = 3 is also optimal 
for all zz > 3. 

A comparison of the results in Fig. 17.2 with those for the deterministic version 
in Fig. 8.12 (p. 270) indicates: 

1. For any initial inventory, z, and horizon, zz, the expected cost from an optimal 
policy is higher when demands are stochastic. 

2. Given z, the optimal decision Xn{i) never decreases as zz gets larger when 
demands are stochastic. 

3. The optimal policy for an extended horizon is obtained as soon as zz = 3 
when demands are stochastic, as compared to zz = 18 in the deterministic 
version. 
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Of course, these conclusions arise from the numerical values of the specific 
example, but the illustration is evidence of the general observation that the 
solution characteristics for deterministic and stochastic versions of a 
model can be quite disparate. 


Entering 

Inventory 

/ 

n = 1 

n = 2 

n = 3 

1 /? = 4 

to 
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c: 
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0 
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20 

0 

37.75 

0 

54.87 

0 

72.62 
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Figure 17.2. Dependable Manufacturing Company Example. Stochastic 
Version of Inventory Model. 


17.5 OPTIMAL BATCH SIZE MODEL 

The Voltex Company, a manufacturer of industrial tape recorders, purchases 
from a supplier some very expensive and specially made electronic components. 
The company does not maintain an inventory of these components, since Voltex 
produces only to customer order and the specifications vary widely from customer 
to customer. Because of the exceptionally high quality standards required for the 
tape recording equipment, components often fail from a short circuit during the 
performance inspection phase. In this event, Voltex can return the components 
to the supplier for a full refund. In order to avoid the expense and inconvenience 
of having to delay production until more components arrive from the supplier, 
Voltex initially orders a larger batch than it actually needs. But if Voltex orders 
too large a batch and consequently has excess components, it has to pay a cost 
penalty. (In some instances, the supplier will repurchase the excess items at a 
lower price than Voltex paid. But in other instances, the supplier will not buy 
back the component because it is so special that it is worthless to anyone else.) 

The manufacturer’s decision problem is to choose an optimum batch size. 
If the production manager at Voltex knew exactly how many components would 
fail for each batch size, his decision problem would be simple. But he does not— 
and that makes the analysis nontrivial. 

Let 

c = cost per electronic component purchased from the supplier 
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V = salvage value per component in excess of required amount 
(where 0 ^v^c) 

K = expense of reordering if additional components are required 
pz{j) = probability that j out of a batch of x components fail 
during the inspection process [where px{x) < 1], 

As you think about the way an actual decision process of this sort would evolve 
over time, you will discover how to formulate the problem as a multistage optimiza¬ 
tion model. Suppose N undamaged components are required, and you place an 
order for x'^ j turn out to be defective, but j x ^ N, then you have met 
your requirement, and x — N — j good components are left over as salvage. 
But if j > X — Ny then you still need N — x +j more. [For example, if to receive 
10 good components, you order 15 and there are 6 defectives, then you still need 
1 (=10—15 + 6) more component.] 

So when j > x — N, you have to reorder. Your decision problem at this point 
will be of the same character as before, except now you need fewer components 
(unless all that you ordered before turned out to be defective). In any event, 
the only information you need to have about the outcome of the previous batch 
is how many good components you still require. Thus, the appropriate state vari¬ 
able is the number of undamaged components required. 

Assume that an optimal policy is one that minimizes the expected total cost to 
meet the requirements. Let n denote the amount required, and consider what the 
total expected cost is if you order x. First, you must add in the actual purchase 
cost less the expected cost of defective components from the batch, since these are 
returned for credit. Second, in case there were so few defectives that you have 
met your requirements and have some good ones left over, you must reduce the 
overall expected cost by the expected salvage value. Third, in case there were so 
many defectives that you have to reorder, you must also add in the reorder cost 
Ky weighted by the probability of having to reorder, and likewise the expected 
cost because you need more components. 

Consequently, letting/(«) denote the minimum expected cost when ;z working 
components are required, verify that the values for f (;z) must satisfy 

/(«) = minimum (+ - Sj>xO’)l- v {x - n -j)px{j) 

I L J ^=0 

+ t iK+fin-x+j)]p.U)] for« = 1,2,3,.... 

An x{n) that yields f{n) is an optimal batch size. The set of x{n)y for « = 1, 2, 3, 
... is an optimal decision rule. (Explain how you would employ the rule.) 

If N components are required, then f{n) is successively computed for n = 1, 2, 
. . . y N. But note that (1) has to be manipulated slightly because the term/(n) 
also appears on the right-hand side of (1) for j = a: in the third summation. 
[Explain what random event causes/(«) to appear on both sides of (1).] Accord¬ 
ingly, (1) must be simplified to yield a form suited for computational solution. 
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The algebra is straightforward, and leads to the formula 

f{n) = minimum [1 — ^xW]"‘ {c\x — —v'^{x — n —j)px(j) 

' L >=o J ;-o 

+ ^ S PxU) +, S ,/(«-* +j)/>»0‘))* 

Jmx-n+l J 

where you ignore the final summation on the right for n = 1. 

Numerical example. To illustrate (2), suppose 

( 3 ) c= 10 v = 0 

and px(j) is described by a binomial probability law 

( 4 ) Pz(j) = J,^/ljy P^G -py-^ hTj = 0,l,....X, 

where p is the probability of a single component failing. Then Fig. 17.3 shows the 
optimal batch size amounts A:(n), when n components are required, and when the 


(c = IO.v = 0) 
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Note: K = reorder cost 
c = purchase cost 
V = salvage volue 
p = failure probability 
n = number required 

Figure 17.3. Voltex Company Batch Ordering Example. 
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(c=IO.t^=0.p=^) 



K 

n 

50 

500 

5000 

50,000 

1 

3 

6 

9 

13 

2 

6 

9 

13 

17 

3 

8 

12 

16 

20 

0 

12 

17 

22 

27 

10 

23 

30 

36 

41 


Note: K = reorder cost 
c = purchase cost 
\/ = salvage value 
p = failure probability 
• n = number required 

Figure 17.4. Voltcx Company 
Optimal Batch Size x(n). 


failure probability -J-, and f, and the 

reorder cost A" = 50 and 1000. Of course, A:(n) 
increases as the values of n, and K increase. 
Additional information on the way x{n) varies 
with n and K is given in Fig. 17.4, where p 

If the batch order size is rV, then the expected 
number of usable components is (1 — p^x. 
Therefore when x = nl{\ — p)^ the expected 
number of usable components equals the re¬ 
quirement n. In the example, the value x{n) is 
always larger than x. The difference between 
these two values, which can be termed the 
safety component, is tabulated in Fig. 
17.5. Notice that the difference increases with n; 
however, the ratio of the difference to x decreases 
with n, (Explain why such behavior is plausible.) 

One way to assess the economic impact of 
uncertainty is to compare the value of f (n) with 
cn, which is the minimum cost (to Voltex) of 
obtaining n usable components. The difference 
[f{n) — cri\ can be interpreted as the most the 
manufacturer should be willing to pay if the 


(c = IO,v= 0) 
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20 

6 


10 

13.33 

1.66 

20 

3 

40 

6 


1 

1,33 

2.66 
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13 

1000 

5 

6.66 

4.33 

10 

9 

20 
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13.33 

5.66 

20 

II 

40 

27 


Figure 17.5. Voltcx Com¬ 
pany Safety Component 
of Optimal Batch Size. 


Note: X — 7 "^ 

1-p 

K = reorder cost 
c = purchose cost 
V = salvage value 
p = failure probability 
n = number required 
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supplier can provide components that always work. This quantity is evaluated 
in Fig. 17.3 for the numerical example. Note that \J{n) — cn\ increases with 
larger values for w, the reorder cost K, and the failure probability />, but as you 
can verify, the value [/(n) — rn]/n decreases with n. (Explain why this behavior is 

reaisonable.) 

* • 

*17.6 SALES FORECASTING PROBLEM 

Each example so far had a one-dimensional state variable, and the random 
elements were completely independent. In the illustration below, we show how 
correlated random variables can be introduced at the expense of enlarging the 
dimension of the state variable. 

Consider the following simplified version of an actual situation. During the first 
iV" weeks of a season, the Buss-Stout Company, a manufacturer of lady’s sweaters, 
has some flexibility in adjusting the level of total production for each item in its 
line. In the fashion-wear industry, as in many another, a manufacturer cannot 
predict with certainty the total amount of orders it will receive for each item. 
Since the company’s salesmen call on the retail trade throughout the N weeks, 
new bookings arrive continually during the season. 

The manufacturing expenses include not only labor and materials costs but also 
costs due to adjusting the level of total production. The latter expense arises from 
the disruptions associated with substantially varying the size of the production line 
at the factory; it is particularly costly to increase the total production level late in 
the season. If the Buss-Stout Company sets the level for an item too high, it accord¬ 
ingly has goods left over at the end of the season, and must dispose of them at a 
loss. If the company schedules the level too low, it may lose potential sales revenue. 
And the company incurs a penalty cost if it tries to adjust the production schedule 
upward too much as it receives better information each week about the amount 
of total orders. In addition to the impact of the penalty cost, the variation in 
production level is also constrained to be within certain bounds, depending on the 
production level in the previous period as well as on how many periods in the 
season have elapsed. 

The manufacturer’s own season runs several months ahead of its customers’ 
season. Buss-Stout produces the fall line early in the year, for example. In general, 
the company completes its production before the retailers’ season begins, and sales 
revenue depends only on how many orders in total the company is able to fill by 
the time it ceases production. Consequently, Buss-Stout’s scheduling problem is 
essentially due to the uncertain elements, because if it could know the amount of 
total orders exactly, the company would produce at an even rate throughout its 
season. 

This scheduling problem for each item can be formulated by an optimization 
model. Let 

= accumulated bookings of the item at the beginning of Week t. 

Since there are cancellations as well as new orders each week, may be smaller 
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than Assume that depends only on Di and is independent of earlier book¬ 
ings and the production level. Let 

pi{D\d) = conditional probability that accumulated bookings Di^x will equal 
jD, given that accumulated bookings Di equal for / = 1,2,..., 
N- 1 

pif{D\d) = conditional probability that total bookings for the season will be D, 
given that accumulated bookings Dy equal d. 

The manufacturer bases the values of these probabilities on booking statistics 
from previous seasons. (Suggest how you might use past data to obtain such prob¬ 
abilities.) 

Also let 

r = dollar return from an item sold during the season 
s = dollar return from an item disposed after the end of the season 
= cost of setting the total production level at X in Week t, given that 
the level was set at x in the previous week, 

where r and s include the direct labor and materials cost of manufacturing a unit, 
and j < 0. Assume that Li{x) and Ut{x) are the lower and upper limits on 
representing the range of feasible levels given that x is the scheduled level just 
prior to Week t. 

A decision rule then consists of a strategy indicating a level for current produc¬ 
tion X given values for previous production x and ; assume an optimal rule is 
one that maximizes expected return. An optimal strategy may be found by solving 
the dynamic programming recursion: 

d) = maximum | 2 — D\\pjf(D\d) 

+ rX ^^pif{p\d) — 

ft{x, d) = m^^imum j-c,(Ar|Ar) + D)p^{D\d)^ 

for f = AT - I,..., 1, 

where the inequality D > X under the second summation sign in (1) indicates 
that the sum is over every possible value of end-of-the-season accumulated 
bookings that is greater than X, and the D under the summation sign in (2) 
indicates that the sum is over every possible value for accumulated bookings 

The value o{fi{X*y Z)*) represents the maximum expected return, assuming 
that the preseason production level is set at and bookings-to-date at the start 
of Week 1 are D*. Notice that the state variable in (2) is two-dimensional; the 
first component reflects the previous decision and the second indicates the realized 
value of a random event. 

To test your understanding of the formulation, write a definition for d) 


( 1 ) 

( 2 ) 
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andyi(A:, d). Explain why the terms on the right of (1) represent the sum of expected 
returns, depending on whether production exceeds total bookings or vice versa, 
and the penalty cost of changing the production level. State how this model actually 
uses sales forecast data. Indicate how the formulation (1) and (2) would differ 
if depended on both Di and A_i. 

17.7 STOCHASTIC REGENERATION MODEL— 

EQUIPMENT REPLACEMENT 

In this final illustration, we discuss both a finite and an unbounded horizon 
optimization problem. You should carefully review Sec. 11.4 before going further, 
as we build directly on the previous analysis and do not start afresh. 

In Sec. 11.4, you studied the following deterministic problem: each time a 
decision process regenerates itself, the decision-maker must choose among N 
alternatives, which are indexed k = 1, 2,. .., If Alternative k is selected at a 
Period t, the next regeneration occurs at Period t + k. Let 

(1) Rk = cost of Alternative k valued at the start of a regeneration period. 
Then for the finite horizon problem, we defined 

(2) = present value of an optimal regeneration policy in which an 

alternative must be chosen when n periods remain until the 
end of the planning horizon, 

and these values must satisfy the recursion 

(3) /n = minimum + -^*1 /o = 0 for 0 < a: < 1, 

A.=l,2. N 

where n> N, (For n < N, the minimum is taken over k = 1, 2,..., n.) 

In the infinite horizon problem the appropriate extremal equation is 

(4) f = minimum + /?*:] for 0 < o' < L 

*01,2,..., JV 

yielding 

(5) f = minimum (present value). 

Re-expressing (5) in terms of equivalent average return as 
(fi) S = minimum | , . _ 

gives 

(7) g = minimum 1 for O' = 1 (average return per period), 

*-1,2,....JV LK ^ 

Equipment replacement. An important example of a regeneration 
problem is equipment replacement. In the deterministic version, the value of 
k represents the length of time that a new piece of equipment is actually used. 
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For a stochastic version of this problem, suppose that the equipment may break 
down before the planned replacement in Period t + k. That is, if at Period t 
the planned replacement decision is k but the machine breaks down during 
Period t + j, forj < A:, then assume the equipment must be replaced at the start 
of Period t + j + 1. Let 

k = planned replacement interval 

Pi = probability that the equipment breaks down for the first time during 
the jth period of usage 

Tj = cost of operating the equipment during the jth period of usage if 
the equipment does not break down 

+ s^ = cost of operating the equipment if it does break down during the 
jth period of usage when j < k (where Sj> 0), 

and where pj = 1, ri includes the initial purchase cost of the equipment, and 
for expositional simplicity, the equipment is assumed to have no salvage value. 
You can interpret as a penalty cost for early breakdown. 

Assume that an optimal policy is one that minimizes expected discounted cost. 
If you are at a regeneration period and your planned-replacement decision is A:, 
then expected discounted cost is comprised of the following. First, you must add 
in the expected discounted cost incurred at the next regeneration point and beyond 
in the event that the equipment breaks down before the planned replacement 
period. Second, you must include the expected discounted cost at the next regen¬ 
eration point and beyond in the event that the equipment does not break down 
before the planned replacement period. And finally, you must add the expected 
operating cost between this and iht next regeneration point. 

Accordingly, for a finite horizon, the appropriate generalization of (3) for 
n'^ N and 0 < o' ^ 1 is 


(8) f„ = minimum [jj (1 — E /'j) -I- ^*1 = 0 

A:»1.2, ...N U=l \ > = I / J 

and where now 

•^jfc = 4“ 0Lr2{\ — pi) + oirTi{\ — pi — p^ + • • • 


(9) 


+ (l — 


Here and in the formulas below, ignore the summations whenever A: = 1. [As 
before, for n < N, the minimum in (8) is taken over k = 1, 2,...,«.] Verify 
that (8) reduces to (3) when pj = 0, for j = 1, 2,.. ., -Y. Also note that in 
(9) depends on o', and has been omitted from the notation only for simplicity. 


Unbounded horizon. For an infinite horizon, drop all the subscripts 
where f appears in (8). This gives the stochastic analogue of (4): 

nO) / = minimum (E[a^k]f + R^) for 0 < o: < 1, 


• 17.7 • 


Stochastic Regeneration Model 717 


where 

(11) E[a^\k'\ = 2 + CL^i^ ”” (expected discount factor), 

and Rj^ is given by (9). Analogous to (5), you can rewrite (10) as 

(12) / = minimum _ E[a^ \k ^ 0<Q' < 1 (expected present value). 

Here f represents the expected value of discounted cost over an unbounded horizon 
when an optimal policy is followed. 

The formulations (10) and (12) take for granted that it is optimal to use a 
stationary strategy (always select k as the planned replacement interval each time 
a new piece of equipment is purchased). It can be proved rigorously that such 
an assumption is warranted (actually the proposition can be shown to follow from 
the Stationary Strategy Policy Theorem in Sec. 18.3). Thus, once the expected 
discount factors in (11) have been calculated, the optimization in (12) is no more 
complex than that in (5). 

For the case oc = 1, it is necessary to re-express (12) in terms of expected equiva¬ 
lent average return [^ = (1 — ot)f] : 

which can be shown to yield 

(14) ^ = minimum 
where is calculated in (9) with o' = 1, and 

(15) E[j\k] = ^ (expected replacement interval). 

Observe that (15) represents the expected number of periods that each piece 
of equipment is used, given that the planned replacement interval is k. Thus the 


for O' = 1 (expected cost 




Figure 17.6. Stochastic 
Replacement Model Ex¬ 
ample. 
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value g in (14) can be inlerprctcd as the minimum ratio of expected cost during the 
interval between successive replacements to expected interval length. Or, loosely speaking, 
you can say that g is the minimum expected cost per period over an unbounded 
horizon. Here, too, using a stationary strategy can be justified as optimal. 

An illustration of the case oi = \ is shown in Fig. 17.6. Notice that /: = 3 is 
optimal. In this illustration, costly mistakes occur if you do not correctly account 
for uncertainty. For example, verify that if the erroneous criterion (Rkl^) were 
used, then the decision would be A* = 5; and if (2j=i were used, the decision 
would be k = 4. In these instances, expected cost per period would increase 
nearly 20% above the optimal value. 

► It is straightforward to justify the mathematical manipulations that yield (14) and (15) 
as the limit of (13) when oc approaches 1, For example, you can make a substitution 
€ = 1 ~ cz in (13), use the binomial expansion in the denominator, and let e approach 
0. Or more directly, you can simply apply L’Hospital’s rule. [Use these approaches to 
verify (14) and (15).] However, it is not trivial to give a rigorous justification of the 
interpretation that g is an expected cost per period over an unbounded horizon for the 
policy selected by (14). The reason is that there are several ways to define the meaning 
of “expected cost per period over an unbounded horizon.” 

For example, one way is to define such a quantity as the limit of the ratio of expected 
costs during n periods to n, as n gets arbitrarily large. Another way is to define the quantity 
as the limit of the ratio of expected costs during T replacement intervals to the total number of 
periods in the combined intervals, as T gets arbitrarily large. The questions to be answered 
are, “Do these limits exist?” “If so, are they equal?” It can be demonstrated that the 
answers are both “yes” (assuming that the corresponding is finite). A 

17.8 APPLICABILITY AND COMPUTATIONAL FEASIBILITY 
OF STOCHASTIC DYNAMIC PROGRAMMING MODELS 

You will find it helpful to review the summary remarks about dynamic pro¬ 
gramming in Secs. 10,9 and 10.10, since they are equally pertinent to models 
with stochastic elements. In brief, the most important applications of these models 
are in special contexts, such as finding optimal inventory replenishment rules, 
minimal cost maintenance and replacement policies, efficient waiting line dis¬ 
ciplines, etc. Beyond Chap. 18, this book focuses on obtaining deeper understand¬ 
ing of these particular types of phenomena, and, when possible, on finding special 
solution techniques that exploit the structure of the phenomenon considered. 

In sharp contrast to the situation pertaining to the application of linear program¬ 
ming, no general purpose dynamic programming computer codes are widely 
available. This does not represent any real drawback to the use of the technique, 
however, since the computational formulas for any specific problem are simple 
enough to be coded for computer solution on an ad hoc basis with only a modest 
effort. 

Using advanced analytic methods, it is possible to solve stochastic dynamic 
programming models with less restrictive assumptions than those made in this 
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chapter. For example, you sometimes can analyze situations in which time is 
treated as a continuous variable, and where the states and decisions arc not neces¬ 
sarily discrete or finite. And in many of these cases, the approaches in this chapter 
are generalized in a straightforward fashion. But such extensions will not be 
pursued in this text. 

Chapter 18 treats models in which the time horizon is unbounded. Rather 
than deal with a host of special examples, the results are given in terms of a network 
formulation; thus the approach in many ways parallels the exposition in Chap. 
12. The most significant results cited are the theorems establishing the optimality 
of stationary policies. Whenever a specific model has a structure meeting the 
assumptions in the next chapter, then you can restrict your search to stationary 
strategies without any loss of optimality. The chapter also presents numerical 
techniques for solving such general network models. But as we pointed out above, 
you can improve on these general techniques in most applications by exploiting 
the special structure of the particular model. You will learn how to do this in 
Chap. 19 and beyond. 


REVIEW EXERCISES 


1 Consider the Distribution of Effort Model in Sec. 17.2. Assume the demand distribu¬ 
tion at Store j is pj{d) = for ^/ = 0, 1,..., 4. In each part below, calculate the 
expected profit Rjiyj), for = 0, 1,..,, 5, given by (10), employing the specified 
profit function r){d\y)y as defined by (7). 


(a) 

(b) 


rM\y) = 


lOd 

lOj^ 


for d ^y 
for d > y. 


fM\y) 


=i; 


{\0d for d <y 

(10)/+ 8(i — {or d>y. 

(c) Rework part (b), using the coefficient 6, instead of 8. 

(d) 






lOd 

10(/-2 


for d ^y 
for d>y. 


(e) Rework part (d), using the constant 4, instead of 2. 

*(f) ■ Suggest situations that might give rise to the profit functions in parts (b) and (d). 
{Hint: consider the possibilities of the customer purchasing another brand or the 
store manager expediting a resupply.) 


2 Answer the questions in exercise 1, parts (a) through (e), except let the demand 
distributions be 

(a) ^j(</)=^^forrf=0,l,...,4. 




720 Probabilistic Dynamic Programming Models 


(c) ^,(0) = ^^(4) = =Pi{3) = hpj(2) = 

(d) Pi(0)=P}W = =y>j(3) = A,^j(2) = iV. 

*3 Shopping Basket Markets (Sec. 17.2). Suppose the demand at the s stores are correlated. 
In particular, assume there are Q, possible sets of values for the demands 
(/),, Dj,. . ., D^) at Stores 1, 2,.. ., s, and pq is the probability of the ^th set of 
values. Show how to determine the cost of uncertainty. (That is, indicate how to 
calculate the maximum value to the owner of having a perfect prediction of demand 
before he allocates the eggs.) 

4 Cool It Company (Sec. 17.3). Suppose the data pertinent to the decision tree in Fig. 
17.1 are 


hx = 40 = 50 ri = 60 == -~ 

= 60 ^2 = 70 r 2 = 100 />2 =*= "^ 

^3=70 ^3 = 80 - />3=1- 

(a) Calculate/ 3 , 7 i,/i and exhibit the optimal strategy. 

(b) Give the probability distribution for the decision process* terminating in Period t, 
for / = 1,2, 3. 

(c) Explain why the expected profit is different at the beginning of Period 1 from 
what it is at the beginning of Period 2, given that the decision process continues 
beyond the first period. 

(d) What is the largest value for ry such that the Period 1 decision remains optimal? 
Similarly, what is the smallest value for /ij? Similarly, what is the smallest value 
for r 2 ? 

(e) Assuming that the decision process lasts beyond the first period, what is the 
smallest value for r 2 such that the Period 2 decision remains optimal ? Similarly, 
what is the largest value for ^ 2 ? Similarly, what is the largest value for ^ 3 ? 

(f) What is the largest value for py such that the Period 1 decision remains optimal? 
Similarly, what is the largest value for /> 2 ? 

(g) Assuming that the decision process lasts beyond the first period, what is the 
smallest value for p^ such that the Period 2 decision remains optimal? 

* (h) Suppose that the pi are accurate probabilities for the competitors* decisions, but 
that the President can purchase information indicating whether the competitors 
will introduce before he has to make his own decision. If the decision process 
lasts beyond the first period, the President can again find out whether the competi¬ 
tors will introduce in Period 2, etc. How much is such perfect information worth ? 

*5 Cool It Company (Sec. 17.3). Show how the value of fy in (2) can be obtained from a 
linear programming formulation. (Show all the constraints for / = 1, 2,..., T and 
each decision; display the objective function). Exhibit the model in detail for T = 3. 
Write the dual problem and interpret the variables and constraints. {Hint: the 
variables in the dual problem can be viewed as probabilities.) 
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Exercises 6 through 12 refer to the stochastic inventory model described in Sec. 
17.4. 

6 (a) Justify in detail the constraints (7) and (8). 

(b) Explain why the optimal production level x is 4 — i when n = 1. 

(c) Justify in detail each term after the first equality in (10). 

(d) Suppose the holding cost function is A - (i + x — /))* for i x > and 0 other¬ 
wise. How is (10) modified? 

(e) Verify the entries in Fig. 17.2 for n = 2. For n = 3. 

(f) Suppose there are three periods remaining in the horizon and entering inventory 
is 1. What is the probability distribution for the optimal production quantity in 
the next period? In the final period? 

7 Suppose the demand distribution is P[P = 1] = ^ and P\P = 4] = §. 

(a) Write the appropriate recursion analogous to (10). 

(b) Calculate fn{i) and x„(i), analogous to the entries in Fig. 17.2, for n = 1 and 2. 
For « = 3. 


8 Suppose the demand distribution is P[D = ^/] = ^ for ^/ = 2, 3, 4. 

(a) Write the appropriate recursion analogous to (10). 

(b) Calculate fni}), and a:„(i), analogous to the entries in Fig, 17.2, for n = 1 and 2. 
For n = 3. 

9 Give a plausible argument explaining why the optimal policy "settles down” so much 
faster (at w = 3) than it did in the deterministic version (at n = 18). 

10 Suppose you drop the assumption that a stockout never occurs, so that the restriction 
in (7) is eliminated; whenever demand exceeds entering inventory plus production, 
the excess is lost. Suppose, however, the company earns revenue r per item of demand 
filled. Let the symbol p{q) denote the probability that demand is where 

g = 0, 1, 2,_Show how to alter the recursion (10) accordingly. {Note: the objective 

is to maximize expected profit.) 

11 (a) Suppose the minimization criterion is expected discounted cost, where the one-period 

discount factor is o', 0 ^ ^ 1. Show how to alter the recursion (10) accordingly, 

(b) Suppose there is a probability q,0 ^ 1, representing at each period the chance 

that the items will not be needed anymore in future periods. In other words, 
q is the probability that the horizon does not last beyond the current period; if, 
however, the items are demanded in the next period, then again probability q is 
the chance that the horizon does not extend beyond the second period, etc. Show 
how to alter the recursion (10) accordingly. 

12 Suppose the company can obtain a perfect forecast of each period’s demand. That is, 
prior to choosing production x, suppose the company can learn the actual level of 
demand. [This amount still varies from period to period according to the probability 
distribution in (6).] 
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(a) Formulate a dynaAfiic programming recursion to find an optimal policy. {Hint: 
you can let the state variable be entering inventory minus current demand.) 

(b) Calculate optimal policies for n = 1,2, 3, 4. 

(c) What is the maximum worth to the company of having such a perfect forecast 
if the horizon n= l?Ifn = 2?Ifn = 3? Assume initial inventory at the start 
of the horizon equals 0. 

13 Voliex Company (Sec. 17.5) 

(a) Give a verbal interpretation of each term in the recursion (1). 

*(b) Give the detailed algebraic justification for transforming (1) into (2). 

(c) Consider the data in Fig. 17.3. Suppose you require three items, K = 50, and 
^ Explain the optimal strategy in detail. What is the maximum number of 
items that may be ordered ? (Be careful!) Is it possible that you would ever place 
an order for three items more than once? 

*(d) What is the expected number of items ordered in part (cj ? 

(e) How do your answers to the questions in part (c) change if/? = ^ ? 

14 Voltex Company (Sec. 17.5). Consider the policies in Fig. 17.3. Verify the values ofx(7i) 
and f{n) for 

(a) yC = 50, « = 1, 2, 3, and p = i- 

(b) iC=50,n= l,/> = ^ 

(c) K= 1000, n= lyp = i, 

15 Voltex Company (Sec. 17.5). Suppose the salvage value v > 0. How are x(^i) and/(7i) in 
Fig. 17.3 affected? Corroborate your economic intuition by letting v — 5, K = 50, 
p = iy and finding A:(n) and f{n) for n = 1,2, 3. 

16 Voltex Company (Sec. 17.5) 

(a) Consider the tabulation of the safety components in Fig. 17.5. Give a plausible 
explanation of why the ratio of the safety component to = «/(l — p) decreases 
with n, 

(b) Give a plausible explanation of why the ratio [/ {n) — cn]ln decreases with n, where 
the bracketed quantity is tabulated in Fig. 17.3. 

17 Consider the Optimal Batch Size Model in Sec. 17.5. Suppose N working items are 

required. Show how the value of/ {N) can be obtained from a linear programming 
formulation. (Show all the constraints for n = 1, 2,..., iV, and each x; assume that 
you have an upper bound Xn on x when n items are required.) Write the entire problem 
in detail for N 3, = 3y X 2 = 4, and X^ = 5. Also write the dual problem and 

interpret the variables and constraints. {Hint: the variables in the dual problem can 
be viewed as probabilities.) 

♦18 Buss-Stout Company (Sec. 17.6) 

(a) Explain each term in the dynamic programming recursions (1) and (2). (Specifi¬ 
cally, state how the formulation includes the expenses of labor and materials, as 
well as the costs of adjusting the level of total production. Indicate how disposal 
loss is accounted for, and how the formulation implies that there is a revenue loss 
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if the production level is too low.) Write a definition for d) and ft{x^ d). 

(b) Explain what factors influence the values for the lower and upper limiti on accu¬ 
mulated production in Week t, namely, Lt{x) and Ut{x), (Explain why these 
limits ordinarily depend both on the scheduled level x in the previous week and 
on the week itself.) Do the same for the cost function Ct{X\x). 

(c) Explain what decision rules emerge from the solution of the recursions. Does the 
analysis yield a production quantity for each week? If not, suggest how to deter¬ 
mine the weekly production level from the analysis. 

*(d) How does the formulation (1) and (2) differ if depends on both Dt and Dt-i? 
(Be sure to define any new symbols you introduce.) 

(e) Suppose that the disposal value j > 0. How, if at all, does this affect the recursion 
(1) and an optimal policy? 

(f) Many companies like this one use statistical demand forecasting techniques. Is the 
information conveyed by the probability distributions pi{D\d) and pji(D\d) the 
same as a demand forecast? Explain. Why would you expect these probabilities 
to depend on both the accumulated bookings in Week t as well as Week t itself? 

Exercises 19 through 25 refer to the Stochastic Replacement Model described 
in Sec. 17.7. 

19 Consider the finite horizon problem. 

(a) Explain each term in the recursion (8). 

(b) Derive the formula (9) for jRjt* (Start with the conceptualization that the equip¬ 
ment may break down in either the first, second, third,..., Ath period of usage, 
or not at all during the planned replacement interval.) 

*20 Consider the finite horizon problem. Let the horizon length be 6 and N = 3, Show 

Iiow/q also can be found by a linear programming formulation. Write the dual problem 

and interpret the variables and restrictions. 

21 Consider the infinite horizon problem. 

(a) Perform the algebra justifying the statement that dropping the subscripts on fj 
in (8) yields (10). 

(b) Why is the expression in (11) termed the expected discount factor! Give a verbal 
interpretation of the formula. Answer the corresponding questions for the expres¬ 
sion in (15). 

*(c) Show that (14) and (15) result from letting a approach 1 as a limiting value in 
(13). 

22 (a) Verify all the entries in the examples shown in Fig. 17.6. 

(b) Show why the policy k = 3 remains optimal even if the entries marked 0* are 
made positive. 

23 Consider the example in Fig. 17.6, where OJ = 1. Suppose the horizon is finite and 

let iNT = 5. 

(a) Apply the recursion (8) to find an optimal policy for n = 1, 2,..., 6. 

(b) Do the same, but let ct == ^. 
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24 (a) Suppose there is a probability q, where 0 ^ ^ ^ 1, that the equipment will not 
have to be replaced again (because it will not be needed). Show how to alter the 
formulations for both the finite and infinite horizon models. 

(b) Given that the equipment is needed in any Period t, suppose there is a probability 
where 0 < ^ ^ 1, that the equipment will not be required in any future periods. 
Show how to alter the formulations for both the finite and infinite horizon models. 

♦25 Suppose that when a new piece of equipment is purchased, the probability that the 
equipment breaks down during the jih period of usage is still pj, but that you can know 
with certainty in which period a failure actually occurs. Hence you may decide to 
keep the equipment anywhere from 1 to j periods. 

(a) Formulate a dynamic programming recursion for the finite horizon problem. 

(b) Formulate a dynamic programming recursion for the infinite horizon problem 
and discuss how to calculate an optimal policy. (Assume the discount factor 
satisfies 0 ^ OJ < 1.) 

(c) Given your answer to part (a), apply the recursion to the data in Fig. 17.6, 
where ct = 1; let = 5. Find an optimal policy for /i = 1, 2,..., 6. 

(d) Rework part (c), letting o' = i. 

26 Explain your understanding of the following terms: 

decision rules 

decision tree 

backward induction 

solution characteristics 

safety component (in the Batch Size Model). 


FORMULATION AND COMPUTATIONAL EXERCISES 

In the exercises below, you are asked to formulate models in terms of dynamic 
programming recursions. Be sure to define all the symbols you use, and give the 
appropriate optimization function when there is a single period (stage) remaining. 
Explain how to initiate and when to terminate the calculations. 

27 The Ball Sapphire Company is planning its annual budget for the exploration of sites 
at which to mine rare gems. The company has N potential locations. The President 
and Chief Geologist, Mr. Ball, estimates that if dj dollars are spent in digging at 
Location j, then there is a probability p){dj) of finding gem stones, and if there is a 
successful finding, the return or value of the site will be Vj, where is a random variable 
with probability distribution qj{v)). Formulate a dynamic programming model to 
determine an exploration budget not to exceed D dollars that maximizes expected 
total return. 

28 Ghengis Motor Company (Sec. 10.3). The previous objective function for this example 
was to maximize net return from a radio advertising budget. Suppose, instead, that 
the company wants most to reach a certain type of customer with its ads. It estimates 
that each daytime spot announcement on Station j has a probability pj of reaching the 
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desired type of customer, where pj \s assumed independent of the total number of 
spots on that station; recall Kj{yj) is the number of daytime spots aired on Station j 
from an allocation of^j dollars. Similarly, let qj denote the probability of reaching 
this type of customer in a prime time, and let Hj(y^ denote the number of such prime¬ 
time spots obtained on Station j with an allocation oiyj dollars. Define an optimal 
allocation to be one that minimizes the probability that the desired type of customer 
misses all of the commerciab. 

(a) Formulate the problem as a mathematical model, and show how a dynamic 
programming approach, analogous to that in Sec. 10.3, can be used to find an 
optimal solution. 

(b) Show how to alter the formulation in part (a) if the objective is to maximize the 
expected number of times such a customer hears the company’s ads. 

29 The Saki Tumi Company is a Japanese manufacturer of portable televbion sets. It b 
designing a high-priced model and wants to ensure maximum reliability. The set 
contains N circuits in series, so that a failure in any one of the circuits causes the set 
to malfunction. Therefore, the design will include redundant parallel circuitry. 
Specifically, let denote the number of parallel elements placed in the nth circuit, 
let pn{x) denote the probability that the nth circuit operates properly during the first 
year of usage, given a redundancy level x, and let r„(x) be the corresponding manu¬ 
facturing cost. 

(a) Formulate an optimization model that maximizes the first-year reliability of the 
set, subject to the constraint that total manufacturing cost cannot exceed C. 

(b) Show how the model can be solved by dynamic programming. 

(c) Use your answer to part (b) to find an optimal solution for -V= 3, where 
Cn(x) = nx\ C= 15, and />„(x) = 1 - />*, where pi = .08, p^ = .05, p^ = ,1. 

(d) Formulate an optimization model that minimizes cost subject to the constraint 
that first-year reliability of the set be at least R, Show the model can be solved by 
dynamic programming. 

30 Rock Kitt and his crew are planning a space mission to a dbtant planet. They must 
carry N different types of electronic gear, which, even in this day and age, are prone 
to failure. Each unit of Equipment i weighs Wi pounds, and there is a total weight 
restriction IV, The value of large enough to permit taking along a limited number 
of spares to be put into operation when a piece of equipment faib. Let ^<(0 be the 
probability distribution of the number of periods t that a unit of Equipment i operates 
before it breaks down (and must be discarded), and let Xj be the number of spares of 
Equipment i that Kitt takes along. When the supply of ary of the types of equipment 
is exhausted (that b, when, for any Equipment i, all Xt have broken down) the mission 
has to return to Earth. 

(a) Devbe a dynamic programming model for selecting the quantities xi to maximize 
the expected time the mission stays on the planet. {Note: all N types of equipment 
are operated continuously on the mission.) 

(b) Devise a dynamic programming model for selecting the quantities x^ to maximize 
the probability that the mission stays on the planet at least T periods. 

(c) Explain how to revbe the answers to parts (a) and (b) if each piece of Equipment i 
costs Ci and there b a total cost restriction C. 
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31 Newton Dewing has to solve a difficult homework problem for his operations research 
course. He knows from past experience that if he works too long on a problem, he goes 
“stale.” Hence, he realizes that he may have to make several tries. Let pq{t) denote 
the probability that he solves the problem if he allocates t hours on the ^th trial. He 
wants to minimize the probability of his failing to solve the problem. 

(a) Formulate a dynamic programming model to give him an optimal strategy 
assuming that he will spend no more than T hours in total trying to solve the 
problem. {Note: he may take as many as T trials, each one lasting one hour. 
Also, at the ^th trial he can not have more than T — ^ + 1 hours left, since each 
trial must take at least one hour.) 

(b) Show how the formulation simplifies if pq{t) docs not depend on q, that is, if the 
probability of solution at each trial depends only on the number of hours t that 
is spent and not on how many previous trials there have been. 

♦(c) Show how the formulation becomes more complicated if the probability of success 
at the ^th trial also depends on the total number of hours previously allocated to 
solving the problem. 


32 Professor Frank N. Stein is notorious for his monstrous examinations. This term he is 
oflTering his students the opportunity to take a treacherously planned oral exam. He 
will ask a series of questions, each question being progressively more difficult. If the 
student correctly answers Question k, where k = 1, 2,..., /C, he can exercise the 
option of going on to the next (more difficult) question or terminating the exam and 
receiving the grade G*, where G* < G^.+i. If he misses Question k. Professor Stein 
flunks him. Suppose you are a student in this class, and you estimate p^ to be the 
probability of your correctly answering Question k. 

(a) Formulate a dynamic programming model that gives you a rule for when to 
voluntarily terminate the examination if you have not already flunked—your 
objective is to maximize the e-xpected grade you receive. 

(b) Show how to alter your formulation in part (a) if Professor Stein looks kindly 
upon you and allows you to make one error before flunking. 


33 The Chuckles Ice Cream Company operates a fleet of small trucks that cruise through 
residential districts and sell products directly to consumers. Assume that there are I 
products, where i = 1,2,...,/, and each unit of Product i requires ct cubic inches of 
space in a truck. The total cubic space available on a truck is G. Consider the route 
of a particular truck and assume that the demand qi for Product i is described by a 
continuous uniform distribution, having the density function 


A(?i) = 


bi 

.0 


for 0 < qi^ bi 
otherwise. 


Let Xi be the number of units of Product i loaded into the truck; for simplicity, assume 
Xi is a continuous nonnegative variable. Since all the products sell for the same price, 
the company sets as its objective to minimize the expected number of unsatisfied 
demands per trip. 

(a) Formulate the optimization problem for loading the truck. Explain how it can be 
solved by dynamic programming. 
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*(b) Discuss alternative approaches of solution that exploit the fact that the problem 
consists of a special type of nonlinear objective function subject to a single linear 
constraint in nonnegative variables. 

* (c) Apply your proposed method in part (b) to the data 7=4 and 

_ 1 _ 5 _ 9 _ _10 

Cl — i ^ 2—3 — 3 . ^ 4—3 

bi = 20 b2 = 24 bi = 25 ^4 = 27 

and C = 5, 15, and 75. 

34 Ivan Tew is a salesman for the Suburban Life Insurance Company. This week he will 
make as many as T telephone calls during the evening hours in trying to reach n 
potential clients. He estimates that Individual j may buy a policy worth vj. To keep 
matters simple, suppose that he decides to allocate up to xj calls in order to reach 
Individual j, where the sum of the xj cannot exceed T. If he reaches Individual j 
before making xj calls, assume he does not reallocate the unused calls to other potential 
clients. Let the probability that he can reach Individual j be pj for each call he makes 
to j, and assume this probability is independent of the previous (unsuccessful) calls 
he has made. He wants to select the Xj to minimize the expectation of the estimated 
policy values for all those individuals he fails to reach during the week. 

(a) Formulate the problem as a mathematical optimization model. Show how a 
solution can be found using dynamic programming. 

*(b) Show how to formulate the problem if Tew docs not select the xj at the begin¬ 
ning of the week, but makes his calling decisions one by one (depending on which 
potential clients he has not yet reached). Will this formulation yield a lower 
expected value than that in part (a) ? Explain. 

35 The Dolittle Washing Machine Company has been approached by Slips, a large 
chain of retail department stores, to manufacture a specially branded line of washing 
machines for sole distribution by Slips. Dolittle’s annual profit is currently SI million, 
and Slips’ proposition will bring about an annual contribution to profit of S250 
thousand. But the promotion of these machines by Slips will cut into Dolittlc’s own 
sales in such a way that the profit on its current business will drop by 25% with 
probability .1, 30% with probability . 6 , and 35% with probability .3. 

If Dolittle turns Slips down, then there is a .4 chance that a competitor will accept 
Slips* offer. In that case, Dolittle stands to lose profit, as already described, unless it 
undertakes a costly advertising campaign or lowers its price. The advertbing budget 
will reduce profit by SI 00 thousand; as a result, the profit on current business will 
only decline by 10% with probability .2, 15% with probability .7, and 20% with 
probability . 1. The price cut has the combined effect of keeping the profit-reduction 
to 10 % with probability . 1 , 15% with probability . 8 , and 20 % with probability . 1 , 
provided that competition does not also reduce price. But there is a probability .5 that 
competition meets the price cut, in which case profit will decline by 10 % with proba¬ 
bility . 1 , 15% with probability . 6 , and 20% with probability .3. Construct a decision 
tree showing the structure of the problem, and determine an optimal strategy. 

36 The Trans-Send Television Network has an elaborate approach for developing and 
selecting programs to be viewed over its affiliate TV stations. Each possible program 
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goes through one or more stages of idea review, script development, pilot testing, and 
audience preview. (Actually some of these stages are subdivided into component 
stages.) Suppose there are at most T stages, and let Ci denote the cost of making the 
^th review. A program is judged a hit if the audience reaction is high, as determined by 
the Neil Downc TV Rating Service; the network stands to gain r dollars in revenue 
from a hit. But if a program turns out to be a bomb, the network earns only s dollars, 
where j < r. Since Trans-Send can always show old movies which cam v dollars, 
where j < i; < r, it docs not want to air a bomb. 

If the program is going to be a hit, then pi is the probability that the /th stage of 
review judges it so; (/"the program is going to bomb, then qi is the probability that the 
/th review judges it so. Assume that after the /th review, pi or qi summarizes all the 
information about the show’s likely success. Before the initial review, the probability 
of a hit is p. After each review, Trans-Send can choose to drop the program from 
further development (and show a movie in the time slot), or to go on to the next 
review stage, or to air the show without any further reviewing stages. 

(a) Let T* = 4, and draw a decision tree for the review process. 

(b) Devise a dynamic programming formulation for determining a review strategy 
that maximizes expected net return (that is, earnings less total reviewing costs). 

(c) Compute an optimal strategy for j = 50, y = 100 , r = 200 , /> = and 

/"i = y = Y T T 

9 3 15 

?i - io - T “ T = T 

Cj = 1 Cj = 10 Cj = 30 C 4 = 5. 

37 The Eurohne Bag Company has made considerable profits on the sale of its major 
product due to a lack of any direct competition. The President of the company 
believes that in all likelihood the Micks Bag Company will announce a highly com¬ 
petitive product. To ward off a significant drop in his share of market, the President 
is going to allocate D dollars for advertising during the next T months. He estimates 
that the probability is pi that Micks Bag will enter the market in Month t, if it has 
not already done so by then (assume p^ = \ and that each pi is unaffected by 
Eurohne’s adv'ertising strategy). If the President allocates d dollars in Period t to 
advertising, he can purchase ini{d) “messages.” He feels that if his competitor enters 
the market at the end of Period t and if he has accumulated M advertising messages 
over the first t periods, then his company has a probability qi(M) of maintaining most 
of its present share. Formulate an advertising expenditure strategy that maximizes 
the probability of the Eurohne Bag Company retaining its market position. 


38 Consider the deterministic inventory model, summarized by (1) through (3) in Sec. 17.4. 
Suppose that the ))roduction cost function is random 


cw = 


0 for .V = 0 

13 -f rx for a: = 1 ,2,. . ., 5, 


where P[c = IJ = P[c ~ 3] = 

(a) Assume that the production quantity a* must be decided each period prior to know- 
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ing the value of c. Formulate the appropriate dynamic programming recursion, 
and indicate how it differs from (4) and (5). 

(b) Assume that the production quantity x is decided after learning the value of c. 
Formulate the appropriate dynamic programming recursion. 

(c) Find optimal policies in part (b) for n = 1,2, — ,5. 

39 Consider the stochastic inventory model in Sec. 17.4, characterized by the recursion 
(10). Suppose that when one or more units of inventory exbt at the end of a period, 
there is a probability ^ that one of these units is damaged beyond use. 

(a) Show how to alter the recursion (10) accordingly. 

*(b) Find an optimal policy for « = 1,2, 3. 

40 Consider the stochastic inventor)' model described in Sec. 17.4. Suppose that x 
represents a “target” production level each period, and that if actual production x 
deviates from x, a “smoothing cost” w |x — ;c| is incurred, where x is a prespecified 
constant. 

(a) Show how to reformulate (10) to take account of this smoothing cost. 

*(b) Let V = 1 and x = 3. Find an optimal solution for n = 1, 2,..., 5, 

41 Consider the example of the Elementary Inventory Model with stochastic demands 
in Sec. 17.4. Suppose that the value of demand in Period t influences the value in 
Period t -j- 1. Specifically, assume that 

= 2|a = 2J = y P[A+i = 41A = 2] = y 

P[A*, = 2|A = 4] = y PlDt,, = 4|A = 4] = y- 

(a) Write a dynamic programming recursion analogous to (10). 

(b) Find an optimal policy for n s= 1, 2 ,..., 5, and compare the results to those in 
Fig. 17.2. Assume Dq = 2. 

(c) Rework part (b), assuming that 

/>[A.. = 2|A = 2] = y P[A.. = 4|A = 2] = y 

P[Am = 21A = 4] P[A+i = 41A = 4] =y- 

Also compare the results with those in part (b). 

(d) What is the average demand per period over an unbounded horizon in part (b) ? 
In part (c) ? 

42 Equipment Replacement with Uncertain Costs. Consider the example of the Rhode-Bloch 
Trucking Company in Sec. 6,5. Suppose the cost of a new piece of equipment leased 
at the start of Year i and replaced at the start of Year j is a random variable. Specifi¬ 
cally, let r be a random variable having probability distribution pi{r) in Year i, and 
assume that the cost of initiating rental in Year i and replacing in Year j is given by a 
function Cij(r). Also assume that when a replacement occurs in Year i, the value of r 
in Year i is known before the decision is made to next replace at Year j. See parts {a) 
and (b) on p. 730. 
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(a) Formulate an appropriate dynamic programming model. 

(b) Suppose that the probability distribution of r in Year i depends on the value of r 
in the previous year; let />t(^ki-i) designate this probability distribution, where 
at i = 1, the value of Tq is known exactly. Show how to alter the formulation in 
part (a) accordingly. 

43 Shortest-RoiUe Model (Sec. 6.5). Consider the example in Fig. 6.8. Suppose that the arc 
costs are random and that traversing each arc requires one period. Assume that 
during Period t, the arc cost Ci), where ry = 0, 1, 2,..., occurs with probability 
pi},t{ci))^ Assume that each arc cost each period is completely independent of all the 
other random events. Suppose you can learn the actual value of Ci) only when the 
system arrives at Node i. The objective is to select a route from Node 8 (source) to 
Node 1 (sink) that minimizes expected total cost. {Note: the entire route \snol selected 
at the beginning of the horizon; each period, you select an arc, depending on the 
current node and the arc costs.) 

(a) Formulate a dynamic programming model to find an optimal strategy. 

*(b) Suppose the number of periods to traverse arc (t, j) is random. Specifically, assume 
that during Period t, the arc time where dij— 1,2,..., occurs with proba¬ 
bility qi},i{di)). Suppose you can learn the actual value of di) only after you 
have selected arc (i, j). Show how to alter your formulation in part (a) accordingly. 

44 Stagecoach Problem (Sec. 8.2). Consider the data in Fig. 8.1 and let co/20 represent the 
probability that Mark Off does not survive the journey from State i to State j; for 
example, the probability of mortal danger in traveling from State 2 to State 6 is 
r2o/20 = |. Find a route that maximizes Mark Off’s probability of survival. If you 
were Mark Off, would you take the trip? Explain. 

*45 Assortment Problem (exercise 52 of Chap. 9). Suppose Itsa Steel does not know the exact 
amounts for the demand requirements Dj when it has to select the assortment of 
strengths. The company does, however, learn the demand quantities prior to the 
actual manufacturing of the beams. Let P[Pj = D] = pi{D) be the demand distri¬ 
bution for Beam j. Show how to formulate the problem as a dynamic programming 
model. 

46 Consider the study-time allocation problem of Howie Kramms, described in exercises 
23 and 24 of Chap. 10. Suppose the grade point assessments given in those exercises 
occur only with probability |, and that the actual outcomes are one grade point lower 
with probability y. For example, if Howie studies three periods, he receives three grade 
points with probability | and two grade points with probability Assume each 
random event is completely independent of the others. 

(a) Formulate a dynamiw programming model and find a solution; assume that 
Howie’s objective is to maximize the expected number of grade points he receives. 

(b) Assume instead, that Howie’s objective is to maximize the probability that he 
receives at least nine grade points. Revise your formulation in part (a) accordingly, 

47 Consider the cattle ranching problem in exercise 32 of Chap. 10. Suppose that the 
revenue received in Year n is Rn{yn\p)y where is a random variable representing 
a general price level, and qn{p) designates the probability distribution of p in Year n. 
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Also assume that the herd not sent to market increases in size by^^fold by the begin¬ 
ning of the following year, where q{f) denotes the associated probability distribution 
fory*. Formulate a dynamic programming recursion to determine how many cattle to 
send to market in each of the periods; assume that 

(a) The value of p is not known until after is chosen. 

(b) The value of p is known prior to choosing 

48 Capacity Expansion Problem (exercise 33 of Chap. 10). Suppose each capacity require¬ 
ment Ri is random, with probability distribution pt{R)^ Assume that //i(C, /2) is also 
defined for R> C, and in that case includes a penalty cost for having too little capacity. 
Formulate the expansion model in terms of a dynamic programming recursion; 
assume that 

(a) The value of Ri is not known until after Xt is chosen. 

(b) The value of Ri is known before Xt is chosen. 

49 ARK A Mutual Fund Problem (exercise 34 of Chap. 10). Suppose ARKA’s gift of perfect 
prophecy is nullified, and instead, ARKA knows only the probability distribution qt{p) 
for the security price in a future Period t. Assume that when Period t occurs, the actual 
value of Pi becomes known before Xi is chosen. 

(a) Formulate the problem in terms of a dynamic programming recursion, and 
assume that ARKA wants to maximize the expected total amount of cash it can 
accumulate by the end of the horizon. 

♦(b) Show how to alter the formulation in part (a) if the price in Period t depends 
on the price in the previous period, and the associated probability distribution is 
denoted by qt{p\pi-\)- 

50 Consider the case of the Feedem-Speedem Airline Company, described in exercise 32 
of Chap. 2. Let the flight service requirement in Month t be jD^, where Di has a proba¬ 
bility distribution pt{Di). Assume that Di becomes known at the beginning of each 
month, and that the company must pay a penalty cost C for each stewardess-flight- 
hour that is required in excess of the time actually available for flight service during 
that month. Show how to find an optimal hiring and training schedule by means of 
dynamic programming. 

51 Ina Rush is a busy lawyer with a passion for peanut brittle. Each day she passes a 
crowded candy store and faces the “moment of truth** as to whether to enter. Addicted 
to rationality as well as sweets, she estimates that each minute of her time is worth c\ 
so if she waits w minutes in line, the associated waiting cost is cw. The candy, however, 
is worth r (measured in the same units as cw). She has carefully observed that if there 
are n customers in the store ahead of her, then there is a probability ^„(j) that s of them 
will be served each minute (for the sake of simplicity, assume the random events take 
place in intervals of a minute and are independent in successive minutes). She limits 
the time she is willing to wait until being served to T minutes, and is ps>’’chologically 
prepared to leave the store if she finds the servace is too slow. 

(a) Formulate a dynamic programming model to find an optimal policy for entering 
or leaving the store that maximizes Miss Rush*s expected value. 

(b) Explain whether the formulation changes if Miss Rush only leaves the store after 
she is served or when the T minutes have elapsed. 
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52 Each year Willie B. Poore allocates funds between personal consumption and invest¬ 
ment in either common stocks or bonds. Let Ui{c) represent the value, or utility, to 
Poore in Year t of spending c dollars on consumption. Let r, denote the return per 
dollar invested in stocks (that is, a dollar put into stocks at the beginning of the 
year yields r, dollars at the beginning of the next year). Similarly, let ft, denote the 
return per dollar invested in bonds. Let ptijs) and qiir^) be the associated probability 
distributions of the returns in Period t. (Assume these distributions are discrete, and 
r, ^ 0 and rn ^ 0.) Suppose Poore’s horizon is T years and he starts with M dollars. 
He wants to maximize the expected sum of his consumption utilities over the entire 
planning horizon; assume cash after Period T is of no value to him. 

(a) Exhibit a dynamic programming model that finds an optimal consumption and 
investment policy. 

(b) Discuss how the formulation is changed if he revises his probability estimates each 
year on the basis of the previous year’s return performance in stocks and bonds. 

(c) Show how to alter the formulation if Poore does not have a fixed horizon T*, 
but instead estimates that if he is still living in Year t, he will survive another year 
with probability /(, where = 0 for / > T, [Make the beneficent assumption 
that if Poore dies during Period t, he will have already enjoyed Ui{c),^ 

53 James Stock, Agent 0007, is assigned to make photocopies of documents in the secret 
files of The Unfriendly Country. There are two such files, situated in separate locations. 
File X has M documents to be copied and File Y has N documents. Agent 0007 
figures that he can make at most T tries at photographing the documents. For each 
picture-taking session, he believes the probability is px that he will be caught when 
rifling through File X, and similarly py for File Y. Assume that if Stock gets caught, 
it means the demise of his life and camera. If he selects File X on his /th trial and it 
contains m documents remaining to be copied, he estimates that he can photograph 
g{m) ; similarly, if he selects File Y and it has n documents remaining, he can pho¬ 
tograph h{n). 

(a) Construct a dynamic programming model to devise a strategy that maximizes the 
expected number of documents photographed. 

*(b) Suppose the probability is qx for File X and qy for File Y that he hears a guard 
approaching while he is rifling a file. When this happens, he is able to slip away 
but must forego taking photographs during this attempt. Show how to revise your 
formulation in part (a) accordingly. (Assume that hearing a guard and being 
caught arc mutually exclusive events and px qx < 1, and py -J- qy < 1 .) 

54 Hcdda Gambler is a bettin’ woman and unlucky only in love. On each of the next N 
days, she has the opportunity to wager money on probabilistic outcomes. Assume she 
begins with I dollars and each day can bet any sum up to the amount she has at the 
start of the day. If she bcts^„ dollars on Day n, the probability is p^ of increasing her 
capital by the same amount and the probability is I — p,^ of losing that amount. She 
wants to maximize the probability that she has S dollars at the end of the N days. 

(a) Formulate a dynamic programming recursion to find an optimal betting level 
each day, given the amount of cash at the start of the day. 

(b) Suppose the situation is altered so that if she bcts^„ dollars on Day n, she has 
probability } of increasing her capital by (Spn — 1 )^„, where is the same 
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quantity as above, and probability J of losing Show how to revise the formulation 

in part (a) accordingly. 

(c) Assume N = 3,1 = 1, = f, ^2 = l> Pi = 4- optimal policies in parts 

(a) and (b) for S = 3, For 5=6. Draw the corresponding decision trees. 

(d) Given the probabilistic assumptions in parts (a) and (b), show formulations 
appropriate to the objective of maximizing the expected amount of cash at the 
end of Day N. 

(e) Apply the formulations in part (d) to the data in pari (c) and find optimal policies. 

55 Cash Management Problem, New York bankers always look forward to meeting their 
friends at the Bankers Tryst, a long established annual convention in the Poconos. 
This year members of the Chaste National Bank have proposed the following service- 
charge structure for corporate bank accounts. Each time a corporation makes a direct 
deposit or withdrawal of cash from its account, the bank will charge D or W, respec¬ 
tively. These cash transactions must occur at the start of the day. During the day, the 
balance of the corporation’s account fluctuates due to deposited checks from creditors 
and due to other companies cashing the checks the corporation has written. If the 
balance falls below 0, the bank will automatically lend the customer the additional 
amount at the interest charge of r per dollar per day. 

Suppose you are a Chaste customer and assess that your "opportunity cost” is s per 
dollar per day (that is, you can earn s on each dollar invested elsewhere). Assume that 
like r, is assessed on the bank balance at the end of the day. Let q be the amount 
by which the level of your account fluctuates during a day, and let pt{q) be the proba¬ 
bility distribution of q on the Ith day. Note that q may be either positive or negative. 

(a) Formulate a finite horizon dynamic programming model to determine the optimal 
amount of cash to deposit or withdraw at the start of each day. 

(b) Show how your formulation changes if the transaction service and interest charges 
are deducted from your account. 

56 An cnd-of-the-ycar sale on late model automobiles is being held by the Wheeler 
Dealer Agency. The firm has N autos it wants to sell in the next T days. The agency 
owner estimates that if he has n autos on Day t and sets the price at r, he can sell s 
autos that day with probability )&t(rlr), where j = 0, 1,..., n. If he has any cars 
left at the end of Day T, he realizes a value v on each. Formulate a dynamic program¬ 
ming model that sets price each day so as to maximize Wheeler’s expected return on 
the N autos. 

57 The Plutocratic Party, a national political organization, is trying to gain support for 
its Presidential candidate. It intends to spend as much as S dollars in newspaper 
advertisements over the next T weeks to raise campaign funds. If it has raised r dollars 
by the beginning of Week t, then pi{rt j r, st) is the probability that it will raise u dollars 
during the /th week, given that it spends st on advertising. The Party wants to adver¬ 
tise in such a way as to maximize the expected number of dollars raised (inclusive of 
unspent advertising funds). 

(a) Show how an optimal strategy for the amount of advertising to be done each week 
can be found using dynamic programming. 

(b) Show how to alter the formulation in part (a) if S dollars are available at the 
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beginning of Week 1 and the Parly is willing to spend any additional funds it 
raises during the ensuing weeks to finance further advertising. 

58 The Buy-and-Bye Food Store gives its customers excellent service by keeping open 
sufficient checkout stands to take care of the afternoon rush hour surge of customers. 
The manager has gathered data on the number of customers that enter the store in 
each period (15 minutes) as well as on the number of customers m served during a 
period, which depends on both the number of customers in the store and the number 
of checkers s. Let pi{n) denote the probability that n customers enter the store at 
Period t (for simplicity, assume they enter at the start of the period). Let^t(m|«, j) 
be the probability that m customers are served during Period t, given that n are in the 
store at the beginning of the period and s checkout stands are open. Let w be the cost 
per period per checkout stand open. The manager makes a rough estimate of a 
commensurate cost due to customers waiting for checkout. Assume that this cost is 
per customer remaining in the store at the end of Period t. 

(a) Formulate a dynamic programming model to determine an optimal policy during 
the rush hours, which consist of T periods. 

(b) Show how to alter your formulation in part (a) if there is an added cost K each 
time a checkout stand is opened up (hence, the cost is pK if p stands are opened 
up in a period). 


MIND-EXPANDING EXERCISES 

59 A beverage manufacturer is planning to introduce a new instant drink called Nicetea. 
He estimates that if the product is successful, then the company will make a profit of R 
(this figure actually represents a present value over a horizon of several years). He 
currently assesses the likelihood of this event to be p. If the product fails, the company 
stands to lose L, due to expenditures on equipment, packaging, and initial advertising. 
The manufacturer can do some preliminary market testing before spending Z., and 
thereby get a better idea as to whether to introduce the product. Assume, for simplic¬ 
ity, that there are at most two market areas in which to test Nicetea, and that he does 
not have to decide to make a test in the second market area until he sees the outcome 
in the first. If Nicetea is going to be successful, the probability is pi that the first market 
deems it such, and qi that the first market deems it a failure. Similarly, if the product 
is going to be successful, the probability is that both the first and second markets 
deem it such, and pfo that only the second market deems it such. If the product is 
going to fail, the probability is that the first and second markets deem it such, and 
gs 2 that only the second market deems it such. Let the first market test cost Q and the 
second market test cost Co (assume these amounts do not depend on the test outcomes). 

(a) Draw a decision tree for this problem. {Note: the manufacturer can decide to 
introduce or drop Nicetea at any point, and he only incurs L when he introduces 
the product.) 

(b) Explain how to find an optimal strategy. Use the following data to illustrate your 
procedure: R = 100, L = 50, Q = 5, Co = 10, pi = .5, qi .75 ,/>,2 = .8 ,pfi = 
•5, qf2 = -9, ^,2 = -6. 
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(c) How docs the procedure in part (b) simplify (if at all) when pti^Pn and = 
Give a verbal interpretation of this assumption. Illustrate your answers 
using the data in part (b), except let pft = . 8 , and = ,9. 

60 Secretary Problem (Sec. 1 . 6 ) 

(a) Formulate this example in terms of a dynamic programming recursion. Assume 
that the executive wishes to see no more than n girls (in Sec. 1.6, we let n = 3). 
Show how to alter the formulation if there is a cost c per interview. 

(b) Assume the executive does not limit the number of girls he is willing to interview, 
W^at strategy would you suggest that he follow if r = 0 ? 

(c) Assume the executive does not limit the number of girls he is willing to interview 
and r > 0. Also assume he applies a one-period discount factor OC, where 0 ^CC 
^ 1. Write a set of extremal equations to determine an optimal strategy. {Hint: 
let the states represent his having interviewed a terrific, a fair, or a good girl, and 
let the decisions be to hire or to make another interview.) 

(d) Write primal and dual linear programming characterizations of the formulation 
in part (c). 

(e) Use your answer in part (d) to find an optimal solution with the data in the 
example of Sec, 1.6, r = .15, and a = \, 

(f) Suppose that the executive has the option of “postponing” his decision about a 
girl so that he can make additional interviews and subsequendy decide to hire 
a girl whom he saw earlier in the interviewing process. Under what circumstances 
might he want to exercise this option? Show how to alter the formulation in part 
(c) to allow for the added option. Answer parts (d) and (c) given this revised 
formulation. 

61 Parking-Place Problem, You are driving with your date to a movie and want to decide 

on an optimal strategy for parking your car (prior to seeing the movie). You can 
always put your car in a parking lot that charges B, which represents a lot of “bread” 
to you. Or you can try to park on the street. But if you park too far away from the 
movie, your date will think you are a piker, and probably inept in solving other 
parking problems. Suppose there are N parking slots on the street on each side of 
the movie house; for purposes of analysis, assume these are numbered —TV, — AT 1, 
— N’\- 2, . . ., — 1, 0, 1,..., — 2, ^ — lyN, (Place 0 is in front of the movie and 

not available for parking.) You estimate that the “piker factor” loss is | /i |, where n 
is the number of the slot where you decide to park. Let for ;* = — Af,..., — 1,1, 

. . ., AT, be the probability that slot n is vacant. 

(a) Formv\late a dynamic programming model that yields an optimal strategy. 
For simplicity, assume you only see one place at a time. {Hint: let the states of 
the system refer to the parking slot number and whether or not it is empty.) 

(b) Write in detail primal and dual linear programming characterizations of your 
answer in part (a). Assume AT = 3. 

*62 Consider the inventory model with perfect forecasts of fluctuating demand that is 
described in exercise 12. Suppose that the forecasting procedure yields perfect knowledge 
of the amounts for both the present and following periods* demands. What is the 
most the company would be willing to pay for such a perfect forecast if the horizon 
n = 2 ?Ifn = 3? Assume initial inventory at the start of the horizon equals 0. 
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63 Warehouse Problem (exercises 36 of Chap. 5 and 49 of Chap. 8 ). Suppose the selling 
price and purchase cost arc random variables. Specifically, consider the random 
variable r that l;as probability distribution qi{r) in Period t, where r = 1 , 2,. . ., 10. 
Then assume that the selling price and purchase cost in Period t are related to r by 
the functions pt{r) and ci{r)y and that these values arc known at the start of Period t, 
prior to deciding on the number of gallons to be sold or purchased. Formulate a 
dynamic programming model that yields a strategy maximizing expected profit. 

64 Astronaut Problem, Not wanting to make mercurial decisions, an astronaut, C. U. 

Rowan, has turned to operations research for guidance. His mission plan calls for N 
orbits around tne Earth. At the beginning of each orbit, he must judge his spacecraft 
to be in States 1, 2, or 3, corresponding to AOK (all systems “go”), minor malfunction¬ 
ing, and major malfunctioning. In State 1, Rowan always elects to make another 
orbit if he has not completed his mission, and in Slate 3, Rowan must terminate the 
mission. In State 2, he can elect to go another orbit or to terminate. Let P„(j | i) denote 
the probability that if the astronaut makes the wth orbit and the state of his craft is i, 
where i = 1,2, then the state of his craft at the next orbit is j, where j > i. Let j = 4 
refer to the disastcrous event that the mission has to be terminated immediately due 
to an emergency condition, and let pn{^\i)i for i= 1 , 2 , be the associated probability. 
If the astronaut terminates the mission after completing n orbits successfully, the 
value of the mission is i;„, where But if he must terminate during the nth 

orbit because of an emergency condition, the value is only Wm where < v^. 

(a) Formulate a dynamic programming model that yields an optimal strategy. 

(b) Write in detail primal and dual linear programming characterizations of your 
answer in part (a). Assume iV = 4. 

65 Each month the Bank of Hong Kong determines the amounts of funds it needs to 
borrow on the world money markets in order to finance the loan demands of its own 
customers. For simplicity, assume that the total requirements for Month t are D^. 
Let the actual value for be given according to a probability distribution pi{Di), but 
suppose that the value of £)< is known at the start of Period t, prior to making any 
decisions in that period. Each month the Bank may decide to borrow funds for one, 
two, or three months, and denote these quantities as Xi, Xg, Xg. Since Dt is to be 
interpreted as a “capacity” requirement, the amount x, + Xg H- Xg can be used, in 
part, to meet the requirement for and similarly, Xg + Xg for and Xg for D^+g. 
Suppose that the cost of borrowing xj in Period t depends on a parameter r and is 
given by the function ctj{x)\r), where r has a probability distribution qt{r)y and the 
exact value of r is known at the start of Period t, prior to making any decisions in that 
period. Assume that all the random elements are completely independent of each other. 

(a) Formulate a dynamic programming model that yields a minimum total expected 
discounted cost subject to the stipulation that all loan requirements are met. 
(Write a dynamic programming recursion for a general Period t, letting oi denote 
the one-month discount factor.) 

(b) Show how the formulation in part (a) simplifies (if at all) when the requirement 
Dt is known after making the Period t decisions; assume that if Dt exceeds the 
supply of available funds, there is a loss of c per unit. 

66 The Big Cheese Corporation wants to hire a vice-president to head its newly organized 
Limburger Division, and has contracted the service of the Dupry Company, an 
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executive placement firm. This company conducts a series of tests and interviews of 
each applicant who seems promising. Dupry will charge Big Cheese a cost cj for admin¬ 
istering Test (or Interview) j; past experience has shown that there is a probability pj 
that an applicant passes the test. (Assume, for simplicity, that Dupry has cleverly 
designed its testing procedures so that an individual’s performances on the various 
tests arc completely independent.) Big Cheese will accept only an applicant that has 
passed all J tests; therefore, if the candidate fails a test, he is immediately disqualified. 
Formulate an approach for finding a sequence of tests to minimize-expected total cost 
per applicant considered. 

67 Cary Silver, the President of a multibranch bank, sent a confidential letter by mistake 
to one of the bank’s vice-presidents, of which there are N, To Silver’s embarrassment, 
he does not know which VP has received the letter, and wants to find out by telephone. 
He assigns probability pj to having sent the letter to Vice-President j; the probability 
is that Vice-President j will be away from his desk when Silver calls, and so Silver 
will not know whether that VP received the letter. Devise a dynamic programming 
formulation to indicate the telephone calling strategy that minimizes the expected 
number of calls Silver must make to find the letter. {Hint: let the state at the nth call 
represent his probabilities that each VP has the letter, given the knowledge obtained 
from the previous telephone calls.) 

68 Optimal Batch Size Problem, The example of the Voltex Company in Sec. 17.5 can be 
described as a single-stage production process. Batch size problems frequently arise in 
multistage processes. Consider, for example, a manufacturing sequence composed of 
four stages. The first stage is like that in Sec. 17.5, namely, a decision must be made 
to select the initial batch size x in an attempt to produce n good items by the end of the 
final stage. Some items that are produced at this stage do not meet standards and are 
scrapped. Assume every good item is then transmitted to the second stage. Here the 
production quantity cannot exceed the number of good items entering, but it can be 
less and the resultant excess can be sold. (If the number of good items entering is less 
than n, all of them are processed at this stage.) Again, some of the items processed 
at the second stage may not meet standards and are scrapped; the rest are transmitted 
to the third stage. A similar decision structure exists at the third and fourth stages. 
If fewer than n good items have been manufactured at the end of the fourth stage, the 
process starts all over again in an effort to fill the remaining requirement (which typi¬ 
cally is smaller than n). 

For Stage t, where / = 1, 2, 3, 4, let cr(.v) denote the cost of processing x units, let 
PtxU) denote the probability that j out of a batch of x components fail to meet 
standards, let St be the scrap value of an item below standard, and Vi the disposal 
value of an item sold. 

(a) Write a dynamic programming formulation of the model. {Hint: let the state 
variable at the first stage be the number of good items required, and at subsequent 
stages be the number of good items required along with the number of items that 
met the standards from the previous stage.) 

(b) Let the requirement« = 2 and assume that a batch size at Stage 1 cannot exceed 
3. Write in detail the formulation in part (a). Then exhibit equivalent primal 
and dual linear programming models for finding an optimal policy. Also state 
the numbers of equations and variables in these models when the limit at Stage I 
is 4. When the requirement n = 3 and the limit at Stage 1 is 4. 
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CHAPTER 


Dynamic Programming 
in Markov Chains 


18.1 INTRODUCTION 

All the examples in the preceding chapter can be viewed as special cases of 
a more general model, sometimes referred to as a Markovian decision 
process. Taking such a general point of view is insightful, for it helps you 
establish several important dynamic properties that all these models share. 
For example, you will discover sufficient conditions ensuring the existence of a 
stationary strategy that is optimal over an unbounded horizon. You will 
also learn when you can advantageously employ methods of successive approxima¬ 
tion and linear programming to find numerical solutions to those problems for 
which the optimal policy has no special form that can be exploited. 

This chapter builds on many of the ideas you learned in Chaps. 11, 12, and 17. 
In particular, we use the concepts of 

• present value explained in Sec. 11.2 

• equivalent average return explained in Sec. 11.2 

• extremal equations developed in Secs. 12.1 and 12.3 

• value iteration algorithm contained in Sec. 12.2 

• policy iteration algoritlmi contained in Secs. 12.2 and 12.3 

• decision rule or strategy illustrated in Chap. 17. 

Although the explanations to follow are self-contained, we do not repeat all the 
motivational discussion for the above ideas that we provided when they were 
first introduced in earlier chapters. Consequently, we recommend your spending 
a lew minutes reviewing these notions before going on. The emphasis in this chap¬ 
ter is on showing how the previously explained dynamic programming formula¬ 
tions and solution techniques arc modified to account for probabilistic elements. 
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Therefore most of the motivational discussion here focuses on the impact of uncer¬ 
tainty. 

Throughout the chapter we maintain the convention that the objective function 
is to minimize an expected value. Depending on the context of the illustration, wc 
will refer to the objective function as representing cost, present value, or economic 
return. But to repeat, in every dynamic programming formulation below, the 
sense of the optimization is minimization. Only simple modifications in the for¬ 
mulas are needed to treat maximization problems. (Of course, by changing the 
sign of the objective function in a maximizing problem-, you can find an optimal 
solution using a minimization algorithm.) 

18.2 STOCHASTIC SHORTEST-ROUTE MODEL 

While studying deterministic dynamic programming models, you saw in Chaps. 
10 and 12 that it is convenient to view such problems in terms of a network. 
Recall that the network for a general finite horizon problem consists of nodes 
representing the system’s being in State s and Stage n, and arcs indicating the 
results of decisions. The network is acyclic, with the terminal node representing 
the initial state of the system. The network for a general unbounded horizon is 
similarly constructed, except that the node designation no longer includes a stage 
number and only refers to a state. 

For the finite horizon case, the recursive relations expressing the appropriate 
optimization problem are 

(0 yi^ minimum {ayj + for all i ^ r and jr = 0 (finite horizon), 

{i,S) In 
network 

where r is the terminal node and a is the one-period discount factor. For a finite 
horizon, you can let a have any value in the interval 0 < o' < 1. 

You can similarly view finite horizon (or finite stage) and finite state stochastic 
dynamic programming models—like those in Chap. 17—in terms of a probabil¬ 
istic shortest-route model. To do so, three straightforward generalizations of the 
previous deterministic shortest-route models are required. The first accounts for 
the fact that now a decision d at Node i can result in the system moving to one of 
several possible nodes. Let 

(2) p{j\h d) = conditional probability that the state of the system becomes 
Node j given that, at Node i, you make decision </, 

where, for a given i and d, the sum of the />(j|i, d) over all possible j equals 1. 

Actually, (2) embodies a so-called Markov property assumption. Specif¬ 
ically, we postulate that each transition probability in (2) is influenced only by 
the values of i and d and not by the specific history of the system (previous states 
and decisions) prior to arriving at Node i. 

The second generalization recognizes that the return or cost associated with 
making the current decision d at Node i can also be a random variable. Let 

(3) Cta = expected return in the current period from making decision d at Node i. 
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If you let c{j\iy d) denote the actual return in the current period from making 
decision d at Node i, when, as a consequence, the state of the system at the next 
period is Node j, then 

(4) cw = '^c{j\ i, d)pU I i, d), 

where the sum is over all possible 

The third modification postulates a generalized acyclic property. Assume 
that the nodes can be ordered such that the terminal node is r = 0 , and for all 
1 , 7 , and d^ 

(5) p{j 1 1 , d) ifj ^ i (acyclic assumption). 

In other words, it is impossible to go from Node i to a higher-numbered Node j. 
A consequence of (5) is that no node will be visited more than once, if at all, 
in any realized history of the system. As you will see in the inventory model exam¬ 
ple below, property (5) often follows immediately from the underlying structure 
of the specific problem being analyzed. 

Then the stochastic generalization of the recursion (1) is 

( 6 ) y^ = minimum \^p(j\ h d)^yi + ^ 0 and yQ=0 

rf in DU) u»o J 

(stochastic shortest route), 

where D{i) is the set of possible decisions at Node i, Thus^^t now represents the 
minimum expected present value of reaching the terminal Node 0 when starting 
at Node i. Compare ( 6 ) with (1). What is the most important difference? 

Observe that you can calculate the values in ( 6 ) in the order .... 

It is because of the acyclic assumption in (5) that it is always possible to solve 
( 6 ), starting the computations with the lowest-numbered node and progressing 
to successively higher-numbered nodes. The procedure is almost identical with 
that for finding a shortest route in a deterministic acyclic network (discussed 
in Sec. 7.7), the only difference here being the necessity to compute the summation 
indicated on the right of ( 6 ). 

Inventory model. To see how the above model ( 6 ) is applied consider 
once again the Dependable Manufacturing Company’s problem. The data for 
the stochastic version of this problem (introduced in Chap. 17) are 

demand: P[D = 2] = y P[D = 4] = y 

constraints: production x 5 and ending inventory 7 < 4 
production and inventory costs: C(x) -f A- 7 , 
where production x and ending inventory 7 are nonnegative integers and 
C(0)= 0 C(l) = 15 C(2) = 17 
C(3) = 19 C(4) = 21 C(5) = 23 and A = 1 


for all periods. 
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These data led to the recursion (in Sec. 17.4) for n = 2, 3,..., 
fn{i) = minimum {C (a:) + l[i + a: — 3] 

(7) 

+ ^ 2) +yji-i(^ + a: — 4)]}, 

where i = 0, 1, ..., 4, and the minimization is over only nonnegative integer 
values in the range 4 — z a: ^ minimum (5, 6 — i). 

You can establish the correspondence between (7) and a network of the type 
assumed for (6) as follows. Let Node 5n + i designate the state of the system when 
there are n periods remaining, for « = 1,2 ,..., and entering inventory is at level 
z, for z = 0, 1, . . ., 4; let Node 0 be the terminal state. (Observe that each node 
number corresponds to a unique pair of values for n and z; for example, Node 36 
corresponds to n = 7 and z* = 1.) 

Suppose entering inventory is at level z when there are n periods remaining 
and the production decision is a*. Then when there are n — 1 periods remaining, 
entering inventory will be either (z + a: — 2) for demand Z) = 2, or (z + a: — 4) 
for demand Z) = 4, each possibility occurring with probability Consequently, 
verify that the only positive transition probabilities are 

(8) /»[5(«— 1) + i + a; — 2|5n + i,a;] =/>[5(« — 1 ) + j + ^ — 4|5rt + *] = y 

for n '> 2. When n = 1, the optimal decision is 

(9) A = 4 — z for w = 1, 
so that 

(10) />[0|5 + z, 4 — z] = 1 for all z < 4 and for ;z = 1. 

Verify that (8) and (10) satisfy the acyclic assumption (5). 

For the inventory problem, assume that a: = 1, As you saw in (7), the expected 
value for the decision to produce a* when entering inventory is z and there arc n 
periods remaining is 

(H) = + l[e+A--3]. 

As a consequence, the correspondence between (7) through (11) and the solu- 
tion to the recursion (6) is simply 

(12) =/n(0 for « > 1 and 0 < z < 4 

(and yo = 0) Thus thej’j, forj = 1,2 ,..., can be determined in the very same 
fashion as the calculations yieldingy'„(z), shown in Fig. 17.2. 

In Fig. 18.1 you see a diagram associated with the policy in (9) for n = 1 and 
the stationary production policy: 

(13) produce r<.r.k2. 

Notice the production decision at any node for ;i > 2 has two arcs associated with 
it, one for each possible probabilistic occurrence. 
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Figure 18.1. Stochastic Network—Inventory Example (Stationary Policy). 


0 


^Linear programming version. From a computational point of view, there 
is little to be gained by converting the recursion (6) to an equivalent linear programming 
model. The only possible advantages in this regard are the widespread availability of 
computer algorithms for linear programming, and the accompanying facility for perform¬ 
ing sensitivity testing. However, such a conversion will further prepare you for the 
analysis of the unbounded horizon model in the next three sections, and what is more, 
it will show you how to impose chance-constraints on this type of model. 

Suppose the nodes arc numbered 0, 1,2,..., T*. The value for yt can be found by 
solving 

(i) maximize 


subject to 

i-i 

(ii) J'i — 2 I ^ possible i and </, 

where all the yi are unconstrained in sign, and yQ has been suppressed since = 0- 
The corresponding dual problem is 


minimize Tl T] 

i-l d l^(f) 


(iii) 

subject to 

(‘v) 2 - S S = 0 forj= 1,2,..., r—1 

d tf lnX>(0 

(v) s JCrd = I 

d iTbm 

(vi) all JCitf ^ 0. 







744 Markov Chains 


18.2 


An illustration for T = 3 and d = 1, 2 is shown in Fig. 18.2, 
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Figure 18.2. Linear Programming Versions of Stochastic Shortest Route 
Model (Acyclic). 


It can be proved that there always exists an optimal solution to (iii) through (vi) 
such that at most only one X(a is positive for each z; such a solution corresponds to an 
optimal policy determined by (6). The summation in (iii) is the associated expected 
discounted cost over the finite horizon when you start at Node T and use an optimal 
policy. 

When OC = 1, then Xta can be interpreted as the unconditional probability that in an 
actual realization of the decision process, the system will arrive at Node i and decision d 
will be taken. In this case the dual model therefore views the decision optimization 
problem in terms of determining system probabilities. The value of (Sd ini?(i) 
represents the probability that in an actual realization of the process the system 
reaches Node i. 

When a = 1, it is simple to add chance-constraints to (iii) through (vi). For example, 
if Node i* is in some way undesirable and this cannot be adequately indicated by the 
values for ct^a, then you can impose the chance-constraint 

(vii) S ^ Pi 

where ^ is a suitably small probability. If chance-constraints are added, it b not always 
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true that there is an optimal solution in which only one Xia is positive for each i. In fact 
the solution will usually indicate a randomized strategy, where in i>(i) ^td) re¬ 

presents the conditional probability that decision d' is taken, given that the system arrives 
in Node i (assuming Sd ^ 0)- 

Variable transition times. In Sec. 12.1 we mentioned that without much 
further complication the deterministic model can be generalized to encompass the 
situation where the transition times vary from arc to arc. The same approach can be 
applied in the stochastic model. Let aij(d) be the appropriate discount factor associated 
with the system’s arriving at Node j, given that decision d is taken at Node i. Actually, 
ctjj(d) could itself be an expected value, that is, an expected discount factor, if the length 
of the transition is itself a random variable. The only required alteration in (6) is to 
substitute oCaid) for a. Similar modifications are appropriate in the formulas below. A 


18.3 UNBOUNDED HORIZON WITH 
DISCOUNTING {(X<il) 

The stochastic shortest-route model in the previous section deals with an acyclic 
structure and provides a convenient way to view probabilistic dynamic program¬ 
ming models with a finite number of stages, stages, and decisions. Here the analysis 
is extended to situations with an unbounded horizon and with a finite number of 
states and decisions. 

The mathematical description of the model to be treated is simply enough 
stated: solve the extremal equations 


( 1 ) 


yi 


= minimum f 2 

d In />(() j in 
Lnotwork 


p{j\i, d)ayi + cU 


for each i, 


where we assume 0 < o: < 1. Note that we do not postulate any acyclic property 
here [such as (5) in the preceding section], and now there is no terminal node. 
The value of represents the minimal expected present value of starting at Node i 
and using an optimal strategy over an unbounded time horizon. Notice again 
the Markov property assumption about the transition probabilities, namely, that 
they are conditional only on i and they do not depend on the system’s history of 
previous states and decisions. 

The description (1) is sufficiently flexible to encompass most dynamic program¬ 
ming models where the number of states and decisions is finite, the horizon is 
unbounded, and the transition probabilities are stationary. 


► We can establish the stochastic version of the canonical form for describing discrete 
dynamic programming models, namely, expression (i) of Sec. 12.1 (p. 406). Let iS re¬ 
present the set of all possible states; D{s) the set of all feasible values for the decision d 
when the system is in State s; R{s^ d) the immediate expected economic return of decision d, 
given that the system is in State s; and T{s^ d) the transformed state of the system at the 
next stage, where/>[7’(j, d)] denotes the probability that the system reaches State T(s, d) 
at the next period, given that you make decision d at State s in the current period. 
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Then the functional equation for the stochastic version can be written as 
(i) /W = optimum {R{s, d) + a 'Ef[Tis, d)Mns, </)]} 

tf In i>(«) T 

for every j in (0 ^ oi < 1 ), 

where f (j) represents the expected present value of following an optimal policy over an 
unbounded horizon, given that the current state is s. *4 

Inventory example. Consider once again the structure of the stochastic 
inventory model in the preceding section, where here we assume there is a dis¬ 
count factor cc. You saw in the preceding section how to construct the network 
equivalent when the horizon is finite. For the unbounded horizon case, the period 
number is no longer relevant, and the states of the system can therefore be denoted 
merely by the value for entering inventory t, for t = 0, 1 ,. . ., 4. (You may recall 
this simplification was also possible for the deterministic case in an unbounded 
horizon, analyzed in Sec. 12.5.) 

The appropriate dynamic programming functional equation is simply 

(2) ^ minimum {C(Ar) + l[t + x — 3] + a-K/(» + * — 2) 

^ ^ * +/(* + X - 4)]} for f = 0, I,.... 4 (0 < a < 1), 

where the minimization is over only nonnegative integer values in the range 
4 — f < a: < minimum (5, 6 — i). 

The notational correspondences for the network version in (1) are 

(3) yi =f{i) 

(4) = + l[i + ^-3] 

(5) />[£ + *- 2|i, at] =p[i + x- 4|i,ac] = y* 


where a decision d = x, the production quantity. These data are summarized in 
Fig. 18.3, 

Suppose the stationary production policy 


( 6 ) 


produce x = 



lo 


if z = 0 
iff = 1,2,3 
iff = 4 


is used. Then the behavior of the system can be described in terms of a Markov 
chain array or matrix of transition probabilities: 

Next period’s 
entering inventory 

j=0 1234 

"O^O^O" 

0 0 ^ 0 ^ 

0 0 ^ 0 ^ 

0 0 ^ 0 ^ 

. 2 - 0 7 0 0 .- 


f =0 

(7) Current j 

period’s ^ 

entering 
inventory ^ 

4 
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pU\i,d) 


/ 

d in D{i) 

0 

1 

2 

3 

4 

Ci<f 


4 

1 

2 


1 

2 



22 

0 

5 

1 

2 

1 

2 


25 




3 

1 

2 


1 

2 



20 

1 

4 


1 

2 


1 

2 


23 


5 



1 

2 


1 

2 

26 


2 

1 

2 


1 

2 



18 

2 

3 


1 

2 


1 

2 


21 


4 



1 

2 


1 

2 

24 


1 

1 

2 


1 

2 



16 

3 

2 


1 

2 


\ 

2 


19 


3 



1 

2 


1 

2 

22 


0 

1 

2 


1 

2 



1 

4 

1 


1 

2 


1 

2 


17 


2 



1 

2 


1 

2 

20 


Figure 18.3. The Data Note: / = entering inventory this period 
for a Stochastic Inventory j = entering inventory next period 

Model. d= production quantity 

[Use Fig. 18.3 to verify the entries in (7).] Likewise, any stationary production 
strategy will give rise to a Markov chain array of conditional probabilities. 

Justification of extremal equations. Define a policy over an un¬ 
bounded horizon as a complete description of the decisions to be made, period 
by period and state by state. Such a policy need not be stationary in that the 
decision rule may differ from period to period. This definition is the concept 
discussed in Chaps. 16 and 17. Define an optimal polity as one that gives the 
minimum obtainable expected present value for every possible initial state of the 
system. Again such a strategy need not be stationary. However, the extremal equa¬ 
tions in (1) implicitly assume: there wan optimal deterministic stationary 
strategy (the same decision is to be made each time the system arrives at 
Node i), and there are unique values satisfying (1) corresponding to such an 
optimal strategy. The validity of these two assumptions must be proved, and 
indeed we can show the following theorem. 

Stationary Pougy Theorem for 0 < o' < 1. There always exist unique finite 
yi that satisfy the extremal equations (1), and a stationary policy corresponding 
to these is optimal among all possible policies. 
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This is the same important theorem as in Sec. 12.1 for the deterministic case 
and, in fact, most of the analysis there remains applicable here. For convenience, 
the discussion in Sec. 12.1 is repeated below in a somewhat abbreviated form and 
with special attention given to the minor modifications required to encompass 
transition probabilities. 

►Without going into a detailed algebraic proof of the Stationary Policy Theorem, we 
can state the line of reasoning used to establish the result. 

Denote a general policy (not necessarily stationary) by the symbol n that indicates 
what action to take in the current period, given the actual state of the system; and what 
action to take in each future Period n, where n = 2, 3, 4,..., for the state of the system 
at that particular lime. Given tt, consider a composite policy consisting of a decision rule 
at each state for the initial period—denote these by^^and followed thereafter by tc 
(that is, the policy % moved ahead by one period). Let (/tt) symbolize this composite 
policy. Thus, the action for the current period is given by the rule/. The action rule you 
would have used in the current period under ;r, you now use at Period 2 under the 
composite policy (/tt). And similarly, the rule you would have used at Period n under tt, 
you use at Period n + 1 under (/tt). 

Then it can be proved that 

(i) Given a general policy tt, if every composite policy (/tt) results in an expected 
present value for each state that is at least as large as the associated expected 
present value for tt, then n is optimal. 

(ii) If a general policy tc has an expected present value for every state that is at least 
as large as the corresponding expected present value for a compositite policy 
(/tt), but is strictly larger for at least one state, then a stationary policy using 
only /will result in an associated expected present value for every state that is 
no larger than the corresponding expected present value for policy tc, and has 
an expected present value that is strictly less for at least one state. 

Given (i) and (ii), an algorithmic type of argument demonstrates there is an optimal 
policy that is stationary: consider any trial policy that is stationary. Then by (i), it is 
optimal unless there are decision rules for the initial period that make a strict improve¬ 
ment for at least one state. If there are such decisions, then by (ii), using these decisions 
as a new trial stationary policy yields an improvement. Therefore the sequence of improved 
trial policies can be restricted to include only stationary policies. Since the number of 
states and decisions is finite, there is only a finite number of stationary policies. Con¬ 
sequently, the process of obtaining improved trial solutions is finite; that is, in a finite 
number of trials, we obtain a stationary policy that is optimal. 

We can then show that the expected present values, which do exist, must satisfy the 
extremal equations (1), thus establishing the existence of a solution to (1). The unique¬ 
ness of these values follows from an examination of the algebraic structure of (1), ^ 

Successive approximations. A solution to the extremal equations (1) 
can be obtained by the process of successive approximations in func¬ 
tion space. Let j/J be arbitrarily chosen and apply the value iteration 
algorithm 
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( 8 ) 


= minimum 

d In D{t) 




for each i 


(value iteration). 


Each always will converge to a limit that satisfies the extremal equations in 
(1), but the convergence generally is not finite. If all ^ 8 and all = 0, the 
yt increase monotonically. By starting \vith every = 0, the process is the same 
as looking at a certain finite horizon model. Specifically, yi represents the expected 
present value of an optimal path starting from Node i and traversing n arcs. 

For any you can obtain convergence from above as follows. Select a trial 
stationary policy; for brevity of notation, denote each of these decisions by the 
symbol d\ Then solve the set of simultaneous linear equations 


(9) 


yi— Yi PU\hd’)ay) 

Jin _ 


for each i (value determination) 


for the initial values ^yi to he used in the algorithm (8). If you arbitrarily 
stop at iteration m and employ the immediate optimal policy from (8) as a sta¬ 
tionary policy in the unbounded horizon model, the corresponding expected 
present values will be no larger than-the in (8). 


(a =.9) 


Entering 

.nventory 

n = 

= 1 

n = 

= 2 

n = 

= 3 




Unbounded 

Horizon 

/ 

x'U) 

y) 

xHn 

y? 

xHn 

y? 

y'? 

yr 

y?° 

xji) 

Vi 

0 

4 

22 

4 

40 

5 

54.29 

122.89 

163.82 

178.09 

5 

185.73 

1 

3 

20 

5 

34.55 

5 

49.19 

117.75 

158.68 

172.95 

5 

180.59 

2 

2 

I8 

4 

32.55 

4 

47.19 

115.75 

156.68 

170.95 

4 

178.59 

3 

1 

(6 

3 

30.55 

3 

45.19 

113.75 

154.68 

168.95 

3 

176.59 

4 

0 

1 

0 

19 

0 

33,64 

I02.il 

143.03 

157.30 

0 

164.94 


^Initial iterotion: yP = Ofor each / 

Figure 18.4. Stochastic Inventory Example—Value Iteration 
(From Below*). 

An application of value iteration is shown in Figs. 18.4 and 18.5 for the stochastic 
inventory model in Fig. 18.3. Notice convergence to the optimal stationary policy 
is rapid—at iteration n = 3 in Fig. 18.4 and iteration n = 2 in Fig. 18.5. (At 
iteration n = 40, the values o^yt starting with y\ = 0 are within 1 % of the minimal 
yi,) The initial^? in Fig, 18.5 are found by solving the value determination equa¬ 
tions (9), which in this case are 
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(a = .9) 


Entering 

Inventory 

n = 

= 0 

n 

= 1 

n' 

= 2 




Unbounded 

Horizon 

y# 

i 

x°(/) 

/? 

/'(/•) 

y] 

x2(/) 

yf 

y? 

/f 

yf 

0 

4 

202 

4 

202 

5 

200.09 

191.94 

187.89 

186,48 

185.73 

1 

3 

200 

5 

196.55 

5 

195 

186.80 

182.75 

181.34 

I80:59 

2 

2 

189 

4 

194.55 

4 

193 

184.80 

180.75 

179.34 

178.59 

3 

1 

196 

3 

192,55 

3 

191 

182.80 

178.75 

177.34 

176.59 

4 

0 

181 

0 

181 

0 

179.44 

171.15 

167.11 

165.70 

164.94 


^Values of /fore calculated from the value determination equation (9) using the 
policy shown in x®(/) column above. 


Figure 18.5. Stochastic Inventory Example—^Value Iteration Approx¬ 
imations in Function Space (From Above*). 


+ (1 — .5a)jfo — ,5ccy2 = 22 

— .5ayo +1^1 — .5a:^2 = 20 

( 10 ) - .5ayo + (1 - = 18 

— .5ayQ — ,5ay2 + ly^ =16 

— .5ayo — . 5 q :^2 + 1^4 = 1 , 

where cx. = .9, yielding 

( 11 ) ^0 = 202 ^, = 200 ^2 = 189 ^3=196 ^^4 = 181. 

[Write the equations (9) for the optimal policy and verify the minimal va-lucs 
yi in Fig. 18.4.] Notice that convergence toyi occurs more rapidly by starting with 
for the stationary policy in ( 6 ) rather than by letting {yl = 0). Explain why; 
for i = 1, 2, 3, the values ofj;? and^i differ by the amount 2 in Figs. 18.4 and 
18.5. In Fig. 18.6 observe how the optimal policy and the values for change with 
different cc values. Explain why the decision amounts x^{i) decrease as <X gets 
smaller. 

You also can solve (1) by successive approximation in policy 
space, that is, by the policy iteration algorithm: 


Step 1, Select an arbitrary initial policy, and let n = 0. 

Step 2, Given the trial policy, calculate the according to the value determina¬ 
tion equations: 

(12) yi—^ p{j\h ^ for each i (value determination routine), 


network 


. 
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Entering 

Inventory 

a = 

= .9 

1 a = 

= .8 

• a = 

= .5 

/ 


y/ 

xjl) 

y,* 

xji) 

y/ 

0 

5 

185.73 

4 

97.2 

4 

42 

1 

5 

180.59 

5 

92.8 

3 

40 

2 

4 

178.59 

4 

90.8 

2 

38 

3 

3 

176.59 

3 

88.8 

1 

36 

4 

0 

164.94 

0 

76.2 

0 

21 


Figure 18.6. Stochastic Inventory Example—Optimal Soluuons for 
Unbounded Horizon. 

where d' represents the decision at Node i indicated by the specific policy being 
evaluated. 

Step 3, Test for a policy improvement by calculating 

(13) minimum Y. p(j\h d)ay] + = 7" for each i (test quantity), 

d In D{i) I J 

network 

Step 4. Terminate the iterations if 7" =yi 3 for all i. Otherwise, revise the policy 
at each Node k where 7J < jJ, using a decision yielding 7f in (13). Increase n 
to w + 1, and return to Step 2 with the new trial policy. 

The policy iteration algorithm has the properties 

(i) yr' for each Node i. and^^^ <yi if 7J < yt. 

(ii) The algorithm terminates in a Jinite number of iterations. 

(iii) At termination, a policy yielding 7? is optimal. 

When the policy iteration algorithm is appfied to the stochastic inventory exam¬ 
ple in Fig. 18.3, the sequence of stationary policies is the same as that shown in 
Fig. 18.5. However, the values for jJ, calculated from the linear equations (12), 
are the minimal yi and so the iterations terminate at n = 2. [Check to sec that 
when^ given hyyt in Fig. 18.4 are used in the test (13), the iterations stop.] 

A linear programming approach for finding an optimal policy is explained in 
Sec. 18.5. 

18.4 EQUIVALENT AVERAGE RETURN (OJ = 1 ) 

You will recall from the discussion of deterministic models in Chap. 12 that the 
case where a = I gives rise to certain conceptual as well as computational difii- 
culties. In particular, it is not legitimate merely to set o' = 1 in the extremal 
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equations (1) in the preceding section; doing so produces a set of functional 
equations that has either no solution or only an indeterminate solution. 

We approach the analysis of this situation in three steps. First, we define the 
criterion for an optimal policy when o' = 1. Then we assert there always exists 
a stationary policy that is optimal. Finally, we give appropriate extremal equations 
and a successive approximation technique for finding a stationary policy that opti¬ 
mizes the objective function. In the first two steps, we make no further assumptions 
about the structure of the model beyond those given in the preceding section. For 
the third step dealing with the algorithm, however, we will impose some mild 
restrictions to make the exposition easier. 

Consider a given policy (not necessarily stationary), denoted by the symbol 
TT*. We also permit the possibility that the policy tt* includes randomized rules, 
such as, “when at Node i, pick decision with probability p and decision di 
with probability 1 — /?.” For any specified value of cc, 0 < a: < 1, let yf{oL) 
signify the. expected present value if the system starts at Node i and the given 
policy TT* is followed. [For a fixed o:, if tt* is stationary and nonrandomized, then 
you can compute the numerical values o^yf{a) by using the value determination 
routine indicated in (12) of the preceding section. Otherwise, (a:) can be com¬ 
puted by a straightforward but lengthy series of calculations.] As before, denote the 
minimal expected present value if the system starts in Node i by jt(Q'). Observe 
here we have explicitly included the symbol a, and, for a fixed ot, the (o') are 
just the solution values to the extremal equation (1) in Sec. 18.3. 

The policy tt* is defined as optimal for o: = 1 if 

(1) = J<(QJ) for each i (optimal policy) 

for all a sufficiently close to 1. That is, policy tt* is optimal if there exists a number 
a* <\ such that for all a in the interval o'* < o' < 1, the expected present 
values for tt* equal the minimal possible expected present values. A significant 
result that can be proved is the following theorem. 

Stationary Policy Theorem for o: = 1. There always exists a stationary 
policy that is optimal. Furthermore, there always exists such a policy that does 
not randomize. 

As plausible as these conclusions seem, their proof is not trivial. Indeed, the result 
is not true if the number of states is unbounded, because then there may be no 
optimal policy that is both stationary and deterministic. (See exercise 46.) 

Markov chain structure. Prior to establishing the appropriate extremal 
equations and solution algorithm, we elaborate on the probabilistic structure of 
the model when a stationary policy is employed. We will impose an assumption 
in (3) below that serves to eliminate some special cases that are of importance 
primarily to technical specialists. This postulate eases the exposition considerably 
without much loss of practical applicability. Afterwards we present a thumbnail 
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sketch of Markov chain behavior. If you are already familiar with the detaib 
of Markov theory, then skim over, or skip, these examples. 

For simplicity, assume from here on that the states are numbered 0, 1,..., T. 
Consider any stationary policy, and, for brevity of notation, denote each of the 
indicated decisions by the symbol d\ Then the law of motion for the system 
is completely specified by the corresponding square matrix of transition probabil¬ 
ities for moving from State i in one period to State j in the next period: 


-^(0|0,O 

p{\\(i,d') 

... pimd') 

... p{T\o,dr 

p{0\\,d’) 


••• pun,d') 

... p{T\\,d') 

p{0\i,d') 

p{\\i,d') 

••• pu\hn 

... p{T\i,d') 

_/»(0|r,o 

p{\\T,d') 

••• PU\T,d') 

... p{T\T,d')_ 


Such a system that is completely characterized by its one-period transition probabil¬ 
ities is known as a Markov chain. [You already saw an example of such an 
array for the inventory model in (7) of Sec. 18.3.] In keeping with tradition, we 
will treat Markov Chains in terms of states rather than nodes. We will resume the 
network language when we return to discussing algorithmic techniques. 

Before stating the extremal equations for o' = 1, we now assume there is an 
optimal stationary policy with decisions having stationary probabilities 
qh for J = 0, 1,. . ., Ty that are uniquely characterized by the T + 1 linear equa¬ 
tions 


( 3 ) 


yj = <lipu\h d*) for J = 0, 1,.. 


j, (unique stationary 
probability postulate) 


and the restriction (S = 1). This assumption does not hold for all Markov 
chain models, but it is valid for many cases of practical interest. The theory and 
techniques below can be suitably modified to treat cases where the assumption in 
(3) is not justified. 

One interpretation of (3) is the following: suppose each q^ is the probability 
that in a given period the system is at State j. Then the right-hand side of (3) 
is just the probability that the system is at State j in the next period. According 
to (3), the values of q^ are such that these two probabilities are the same. Thus, 
repeating the same line of argument, if the q^ satisfy (3) and in fact arc the proba¬ 
bilities of the system being at State j in the initial period, then the q^ arc also the 
probabilities that the system is at State j in any future period. 

A second interpretation of q^ that holds for most practical cases is that the values 
represent the limiting probabilities of the system being at State j far in 
the future, when the system approaches statistical equilibrium. A third 
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interpretation, imprecise but intuitive, is that each represents the long-run 
relative frequency of the system being at State j. 

The stationary probabilities are important, because they provide most of the 
information needed to calculate the system’s operating characteristics. 
For example, you will see below [in (5)] how to use the to compute the system’s 
expected return per period. In subsequent chapters you will learn how to exploit 
the special structure of certain Markov models in order to write explicit expres¬ 
sions for qj, 

^Examples of stationary probabilities. The following examples 
illustrate the notion of stationary probabilities. 


Case 1, Let the array of transition probabilities be 



Since = Ij the equation^for j = 0 from (3) is 

(h) ^0 = ^o(-5) + (1 — ^o)(?)> 


so that 



Let qj{n) denote the probability that the system is at State j in the beginning of 
Period n, where ^^(1) = q^ is any given probability for the initial period. Then ana¬ 
logous to (3), it is easy to verify that 

(iv) + 1) = S qM)pU I i, d*). 


When T = 1, it can be proved that (iv) may be written in the form 

(v) 9 j(«+I)= 9 , + [^j-?,][l-/.(110,</*)-^(011,rf*)]" fori = 0,1. 

The term following the plus sign on the right-hand side of (v) is called the 
transient correction, since it represents the difference [qj{n + 1) — qH 
attributable to the system’s operating only over a finite horizon. What is the range 
of possible values for [1 — ^(110, d*) — p{Q j 1, ^/*)] ? 

In the case of (i), since (1 — ^ = (—-^)” approaches 0 as n gets arbitrarily 

large, the values of qi{n + 1) approach qj in the limit. Thus the values in (iii) 
can be interpreted as limiting state probabilities as well as long-run relative 
frequencies. 

Case 2. Given the chain matrix 


(vi) 
then 

(vii) 
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Whenever all the rows of a Markov array are identical, the stationary probabilities 
equal these row elements. Calculate + 1) from (v). 


Case 3. Given the transition law 



check that the stationary probabiliti^ are 


(«) ?o=Y 9i=Y' 

Whenever the entries in every column of a Markov array add to 1, all the qj are 
equal (that is, = IjT + 1). Such an array is called doubly stochastic. 


Case 4. Show that for 



the stationary probabilities are 



Case 5. Suppose the chain matrix is 

(o !)■ 

Notice that once this system reaches State 1, it remains there forever. State 0 is 
referred toasatransient state. The associated stationary probabilities arc 

(xiii) ^0 = 0 qi = I- 

If the system is at State 0 in the initial Period I, use (v) to verify that it is still 
at State 0 in the beginning of Period n with probability (-y)". Thus as n gets 
arbitrarily large, the probability of being at State 0 becomes negligible, and equals 
0 in the limit. Thus (xiii) is in accord with the limiting probability and long-run 
frequency interpretations. 


Case 6, Suppose the transition probabilities are 



so that the system alternates between States 0 and 1. Then check that 

(xv) ?o=y qi=Y‘ 


Although the long-run relative frequency interpretation for the values in (xv) 
are intuitive, observe from (v) that 


(xvi) 




for n even 
for n odd. 
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Thus (if qi 9 ^ y) the value of + 1) depends on whether n is odd or even and 
does not approach a limit as n gets arbitrarily large. For this illustration, it is 
therefore inappropriate to interpret the values in (xv) as limiting probabilities 
of the system being at State j. This case represents an example of a so-called 

cyclic chain. 

Case 7. Finally, consider the system described by 



Notice if the system begins at any State j, it remains there forever. In this case 
(3) does not have a solution; any probabilities satisfy (3). The system (xvii) 
is an illustration of what is sometimes called a multiple chain: the system 
can be split into separable subsystems, each of which can be analyzed indepen¬ 
dently. 

Thus by assuming the equations in (3) are uniquely solvable for an optimal 
stationary policy, we are ruling out systems with optimal multiple chain policies. 
It is not really difficult to obtain optimal policies for multiple chains, but this 
situation is better left to more advanced texts. 

Extremal equations. Suppose a deterministic stationary policy tt* 
is optimal. Then by definition, for a sufficiently close to 1, 

[ T *17’ 

E PU I + Cw = S PU 1 for each i, 

J "0 J ^=0 

where d* denotes the decision at Node i indicated by tt*. Further let 

(5) c* = 2 

t = 0 

where are the stationary probabilities calculated from the equations in (3) 
for policy tt*. An intuitive interpretation for c* is the system’s long-run expected 
return (or cost) per period. Although this description is somewhat imprecise, 
we shall use it below in referring to c*. 

We employed a heuristic argument in Sec. 12.3 to derive the appropriate extre¬ 
mal equations for the case o' = 1, and will apply the same approach here. But 
we repeat the warning that the reasoning is meant mainly to motivate the resulting 
extremal equations, which may look strange to you; the derivation is not a mathe¬ 
matically rigorous approach to the development of the functional equations. A 
different heuristic approach is explained in the special material below. 

Consider the expected equivalent return (1 — (X)yi and define Wi by means 
of the identity 

(6) (1 — a)yi = (1 — a)Wi + c* for each z. 

Then you can rewrite the functional equations (4) as 

(7) Wt + -r-^—; = minimum rS/>0’l *, d)a (wj+ ^ for each i. 

' ' 1 — CC din D { i ) u»0 \ 1 — OLJ J 
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Since SJ.o/>0 | d) = 1, and e* is a constant, the term involving c* on the right 
of (7) can be brought to the left. Check that after doing so and letting oc = 1, 
you have the extremal equations 

(8) Wi + c* = minimum 1 * 

d In D(t) U-0 J 

(extremal equations for oc = 1), 

You can interpret the quantity Wi -- Wj as the incremental’ expected return of 
starting at Node i instead of at Node j. (In formal terms, — Wj is the limiting 
value of yt —yj as a approaches 1.) 

[As we pointed out in Chap. 12, in order to make the above derivation precise, 
you should use the notation (o') and Wi((X) in (6) and (7) and define Wi in (8) 
as the limiting value of Wi{a) as a approaches 1 from below.] 

Verify that if a set of Wi satisfies (8), then so do {wt + K) for an arbitrary constant 
K, Therefore, it is necessary to add a normalizing restriction. A convenient one 
is 

(9) Wo = 0 (normalization). 

Applying the same reasoning to the second equality in (4) yields the value 
determination equations for policy it* : 

T 

(10) Wi-\-c* =^p{j\i,d*)w)-\-Cn. for each i (value determination routine). 

.^=0 

The system (9) and (10) consists of T + 2 linear equations in T + 2 unknowns 
, M/y and c* . An implication of the postulate in (3) that unique stationary 
probabilities exist is that the equations in (9) and (10) have a \mique solution. 
Further, the resultant value for c* is indeed the same value as given in (5). To 
see this, first multiply each relation in (10) by qf and sum over z, thereby giving 

(11) S 7*K + c*) = § d*)Wi + g qfcia^. 

Then use (3) to verify that the equation in (11) simplifies to (5). 

We now summarize the principal implication of the extremal equations. 

Necessary Condition for Optimality, (a) Given a deterministic stationary 
policy TT*, if 7C* is optimal, then the solution to (9) and (10) satisfies the extremal 
equations (8) and (9). (b) Equivalently, if the solution (9) and (10) does not satisfy the 
extremal equations (8) and (9), then Jt* is not optimal. 

Since there always exists a stationary policy that is optimal, it follows that there 
always exists a solution to (8) and (9). 

The Necessary Condition asserts that an optimal stationary policy yields a 
solution to the extremal equations. Now we ascertain whether a stationary policy 
that yields a solution to the extremal equations is optimal. 
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Consider a stationary strategy it* having decision elements df. Let u}\ and d 
be the associated solution to 

(12) a;J = 0 

(13) + r' = 2 pu\h for each u 

J=Q 

The following proposition can be proved. 

Sufficient Condition for Minimum Expected Return per Period, (a) Given 
a deterministic stationary policy tt', if the corresponding Wp and c' from (12) and (13) 
satisfy the extremal equations (8) and (9), then c' is minimal among all policies, 
(b) Equivalently, if c' is not minimal, then the solution (12) and (13) does not satisfy 
the extremal equations (8) and (9). 

Consequently, a stationary policy leads to a solution of the extremal equations, 
only if the associated c' has the same value as an optimal policy. Note, however, 
that such a policy tc' may not be optimal according to the particular definition 
of an optimal policy in (1), that is, n' may not be optimal for every Node i when 
CL is near to 1 . But in many practical situations, obtaining a policy having the mini¬ 
mum obtainable expected return per period is adequate. What is more, you can 
expand the formulation in a way that leads to computational techniques for find¬ 
ing an optimal policy as defined in (1). These topics are more appropriate for 
advanced texts and are not pursued here. 

► The Sufficient Condition can be demonstrated as follows. As before, let d* refer to a 
decision in an optimal stationary policy tc*. Since the strategy zr' yields a solution in 
(12) and (13) to the extremal equations, it follows that 

(i) -f- c' ^ ^PU 1 d*)w'^ + da* for each i. 

Using (10), you can write (i) as 

(ii) w[ + c' - 2/>(jlz, d*)w] c* - d*)wp 

^=0 

Multiply (ii) by the stationary probabilities qf for policy tt*, sum over i, and thereby 
obtain 

(iii) c' ^ c*. 

Since c* is minimal by hypothesis, c' = c*, as was to be shown. 

When the model is in fact deterministic, the extremal equations (8) reduce to those 
given in Chap. 12, namely, (6) in Sec. 12.3. Examples 3 and 6 in Sec. 12.4 demonstrate 
how a stationary policy can yield a solution to the extremal equations but not be optimal 
according to the definition (1) above. A 

Policy iteration. A straightforward extension of the policy space algorithm 
in Sec. 12.3 can be used to obtain a policy tt' having minimal c\ To simplify the 
exposition of the technique, we make one further postulate that is satisfied in most 
practical situations: assume that unique stationary probabilities exist for each trial policy 
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TC* tested in Step 2 below, (The algorithm can be easily modified to handle the 
situation where this assumption is not satisfied.) 

The policy space algorithm is 

Step 1, Select an arbitrary initial stationary policy, and let n = 0. 

Step 2, Given the trial policy n' at iteration n, solve the value determination 
equations 

Wq (value determination routine). 

(14) T 

= L />(i I h d')w^ + Cia- for each z, 

where d' represents the decision at Node i indicated by the specific policy being 
evaluated. 

Step 3, Calculate 

(15) minimum quantity). 

a In J){i) u=o J 

Step 4, Terminate the iterations if Wi =Wi-\-c^ for all f; then c" is minimal. 
Otherwise, revise the policy at every Node k, where < zuJ + c”, using a deci¬ 
sion yielding kKj in (15). Increase n to n + 1, and return to Step 2 with the 
new trial policy. 

The policy iteration algorithm converges in a finite number of iterations. 

In Fig. 18.7 the policy space algorithm is applied to the stochastic inventory 
example in Fig. 18.3. Using in Step 1 the policy of the smallest feasible value 
for production, the algorithm converges at n = 2. 
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Figure 18.7. Stochastic Inventory Example—Policy Iteration. 
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► Consider again the case of the Dependable Manufacturing Company. The recursion 
when 0 ^ < 1 was exhibited as (2) in Sec. 18.3. If you apply to this recursion the 

heuristic derivation used to obtain the extremal equations (8) above, you obtain the 
corresponding functional equations 

(i) + r* = minimum {C(x) + l[i + x — 3] i[iyi+z -2 + 

z 

This recursion can also be developed by another heuristic argument that sheds further 
insight into the meaning of the Wi values. Before presenting this line of reasoning, 'we 
record from Fig. 18.7 that the solution to the extremal equations (i) is 


(ii) 


= -9.43 

= ~20f = -20.29. 


c* = 174 = -54= -5.43 

Wo = 0 Wo = — 74 = —7.43 

Consider the finite horizon recursion (7) in Sec. 18.1. You would expect that as n 
gets sufficiently large, the value for fn(i) would nearly equal n times the expected cost 
per period c* plus a correction to account for the initial condition that entering inventory 
is 1 : 

(iii) fn{i) ^ nc* -j- w*, 

where here Wj denotes the “correction” factor. Verify that substituting the right-hand 
side of (iii) into the recursion (7) in Sec. 18.1, and simplifying terms, also yields the set 
of extremal equations (i) above. 

Using the normalization Wq = 0, you have from (iii) that 

(iv) fn{i) -/n(0) Wt - Wo = Wi, 

For n = 5, the values found in Fig. 17.2 give 

Ml) -MO) = -5.57 /„(3) -/„(0) = -9.57 
/n(2) -/n(0) = -7.57 /„(4) -/„(0) = -20.38, 
which already agree closely with (ii). ^ 


(V) 


18.5 LINEAR PROGRAMMING APPROACH 

Numerical solutions to the extremal equations in the preceding two sections can 
also be obtained by means of linear programming formulations. The approach 
is analogous to that used in Sec. 12.6 for the deterministic shortest-route model. 

Consider first the case where 0 <q! < 1 . Then the extremal equations (1) 
in Sec. 18.3 imply the linear inequalities 

0 ) 

network 

where the j;* are unconstrained in sign. You can obtain all the minimal values for 
ji by using the objective function 


Ji — PU Ih d)ayi < Cia for each i and d. 

Tin t 


( 2 ) 


maximize ^ 


j in 

network 


where each is an arbitrary but strictly positive number. [When 2 rj = 1, you 
can interpret as the probability that the system is at Node j in the initial period, 
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and the value of (2) can be interpreted as the associated expected present value.] 
The corresponding dual problem is 

(3) minimize 2 2 

rin d lnD{i) 
net^rork 

subject to 

(4) 2 — S S ^01 ij = r/ for each j in network 

d In />U) tm d lnD{i) 

network 

(5) all xw > 0. 

It can be shown that an optimal basic solution contains one > 0, for each i 
corresponding to an optimal strategy. 

Now consider the case o: = 1, where we impose the same assumptions as we 
did for the policy iteration algorithm in Sec. 18.4. (It is possible to formulate modified 
linear programming models, which are larger, for cases where these assumptions 
are not justified.) 

The appropriate primal linear programming model is 

(6) maximize c 
subject to 

T 

(7) — 2 P{j\h ^ < ^id for each i and 

j = 0 

where Wi and c are unconstrained in sign. 

The corresponding dual problem is 


(8) 

minimize 

2 Cid^id (expected cost per period) 
d uTbn) 

network 

subject to 

(9) 

E - 2 

2 P{j\h ^)^id = 0 for each j in network 

^ network ** (conscrvation of probability) 

(10) 


2 2 ^<d = 1 (normalization restriction) 

tin d InP(t) 
network 

(11) 


2tll Vid ^ 


The Via can be interpreted as the joint probability that the system is at Node i 
and the decision d is taken. Thus the dual model views the optimization as deter¬ 
mining system probabilities for every possible pair (i, d). The restrictions (9) 
can be interpreted as conservation of probability equations: the total probability 
of leaving Node j equals the total probability of entering Node j. The restriction 
(10) states that the sum of the joint probabilities must be unity. It can be proved 
that there always exists an optimal solution to (8) through (11) such that at most 
only one Via > 0 for each i; such a solution corresponds to a policy that yields 
a minimum value for c in (6) and (7). 

Since the dual model is expressed in terms of probabilities, it provides a con¬ 
venient formulation for those situations where you wish to impose chance-con- 


762 Markov Chains 


. 18.5 • 


straints. If such restrictions are added, however, then in general an optimal policy 
will indicate a randomized strategy; that is, there may be more than one 
that is strictly positive in any optimal policy. In such an event, in z)(o 

represents the optimal probability for choosing decision d' when the system is at 
Node i. 

Inventory example. The dual model for the stochastic inventory example 
(Dependable Manufacturing Company) is shown in Fig. 18.8 The optimal solu¬ 
tion is 

noN 2 16 14 

( 12 ) 

The associated expected cost per period is 

(13) *(25) + *(26) + *(24) + *(22) + *(1) = 17^ 

as compared to 15^ when demands are deterministic {D = 3) for every period. 
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Figure 18.8. Dual Linear Program for Stochastic Inventory Example. 

Also in the deterministic case, production is 0 in -| of the periods, and is 5 in 
of the periods. For the stochastic version, the probabilities in (12) indicate that 
the production quantity is 0 in of the periods, and is 3, 4, and 5 in tt* 
and of the periods, respectively. Thus expected production per period is 

(14) *(0) + *(3) + *(4) + *(5) = 3, 

in agreement with the assumption that all requirements are filled. 
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* 18.6 COMPUTATIONAL CONSIDERATIONS 

Two points must be made in assessing the computational burden associated 
with obtaining numerical solutions to Markov chain dynamic programming 
models. The first pertains to overall computational feasibility, and the second to 
the relative merit of each algorithmic approach given in Secs. 18.2 through 18.5. 

Letting denote the number of distinct decisions that can be made at Node i. 
where i = 0, 1, . . ., T, the structure of the network model can be viewed roughly 
in terms of a rectangular array with dimensions (S ^t) by + !)• Figure 18.3 
represents such an array for the inventory model considered in this chapter. 
Note these are approximately the dimensions of the linear programming versions 
in the preceding section. For even a simple problem, the values (S ^i) 

(T + 1) can be quite large. For this reason, it is important to find techniques that 
exploit the specific form of an optimal policy in particular situations. The example 
in the next section clearly illustrates this conclusion. Sometimes you can specify 
the Markov chain model to reflect the form of an optimal policy. Often such for¬ 
mulations slightly increase the number of states but drastically decrease the num¬ 
ber of possible decisions at most states. (See exercises 50 and 51.) 

With regard to the relative merit of each algorithmic approach, the remarks 
in Chap. 12 are equally applicable here. Specifically, when o' < 1, value iteration 
has the advantage of numerical simplicity, but the disadvantage of nonfinite 
convergence. The advantage of policy iteration is its finite convergence, but the 
associated computational burden at each iteration is greater since you must solve 
a complete system of simultaneous linear equations. (It is a simple matter to com¬ 
bine the two approaches into a composite algorithm.) A merit of the linear 
programming approach is that you can draw on the widespread availability'of 
sophisticated computer programs. What is more, the simplex method applied 
to the dual problems given in Sec. 18.5 is closely related to the method of policy 
iteration. Specifically, each iteration of the simplex method corresponds to making 
a policy improvement at only a single state rather than at every state where an 
improvement is possible. 

* 18,7 MARKOV CHAIN VERSION OF THE 
EQUIPMENT REPLACEMENT MODEL 

Recall the stochastic equipment replacement model of Sec. 17.7, where 

k = planned replacement interval 
Pj = probability that the equipment breaks down 
for the first time during the jth period of usage 
rj = cost of operating the equipment during the jth period of usage 
if the equipment does not break down 
= penalty cost of operating the equipment if it does break down 
during the jth period of usage when j < k. 
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If at a regeneration Period t the planned replacement is A: but the machine breaks 
down at the end of Period t + j, for j < A:, then the equipment is replaced at the 
start of Period t + j + 1. 

For the discount factor o: = 1, an optimal k was characterized as the solution 
to 

where is the expected return and E[j\k'\ is the expected number of periods 
each piece of equipment is used, given that the planned replacement interval is 
k. The development of this simple solution exploits the fact that an optimal policy 
is of the regeneration type. 

Now you will see how this same problem can be formulated and solved by 
means of Markov chain analysis. The discussion serves a twofold purpose. First, 
it demonstrates how the stochastic equipment model is a special case of the model 
in Sec. 18.4. Second, it makes apparent that a naive Markov formulation is cum¬ 
bersome as compared to an analysis that exploits the form of an optimal solution. 
[See exercise 50, part (f), for an alternative Markov formulation.] 

Let Node j of the system denote the age of the equipment at the beginning 
of the period, immediately after the decision to keep or replace the old equipment 
has been made. Thus 7 = 0 designates that a new piece of equipment was pur¬ 
chased, and 7 = 1, 2, ..., indicates that the old equipment has been retained. 
Let 

dy = decision to replace equipment at the beginning of the next period 
^2 = decision not to replace the equipment at the beginning of the next period 
unless it breaks down in the current period. 

Then, for all i, 

i &(01 «,</,) = 1 

^(01 J, d,) ^(i + 11 i, d,) = l- Pio I i, d,) ; 

= r, and = r, + />(0 [ i, 

Notice the value ^(01 i, d^ represents the probability that a new piece of equip¬ 
ment is purchased because the old piece fails before the planned replacement 
period. 

The Markov chain formulation for the example in Fig. 17.6 (p. 717) is displayed 
in Fig. 18.9. The optimal policy is 

^ 5 ) di for t ^ 2 and dj otherwise. 

Verify that the extremal equations (8) and (9) in Sec. 18.4 are satished by 


( 2 ) 


( 3 ) 

also, 

( 4 ) 





Review Exercises 765 


Figure 18.9. Markov 
Formulation of Replace¬ 
ment Model. 
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-— W 2 = —30 Wi = —30 Wi = 6 c* = 50 


in accordance with the policy in (5), and the associated stationary probabilities 
are 


n\ _ 4 _ 3 

(7) “ To “ To’ 

Since of the time the system is at Node 0, that is, a new piece of equipment has 
just been purchased, the expected interval between successive purchases is [1/-^ 
= 2J-] periods, in agreement with E[j\3] in Fig. 17.6 (p. 717). 

A linear programming or policy space approximation approach to the solution 
of this example involves computations with a system of size (9 X 5), as indicated 
by Fig, 18.9. Such techniques applied to this “naive” formulation are inefficient 
as compared to the use of the simple optimization in (1). 


REVIEW EXERCISES 


1 Consider the stochastic shortest-route model in Sec. 18.2. 

(a) Suppose </) denotes the actual return in the current period from making 

decision d at Node i, when as a consequence, the state of the system at the next 
period is Node j. Perform the intermediate algebra justifying the recursion ( 6 ), 
where is calculated as in (4), 

(b) Justify the validity of the recursion ( 6 ), and compare ( 6 ) with the deterministic 
version (1). Show where the acyclic assumption is exploited in writing the recur¬ 
sion ( 6 ). 

(c) Explain how finding a numerical solution to ( 6 ) provides an optimal decision rule. 
State how to implement such a rule. 
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2 Consider the production and inventory example in Sec. 18.2. 

(a) Suppose the production constraint is a maximum of 6 items, instead of 5. How 
docs this alteration afTcct the network characterization of the model ? 

(b) Suppose the ending inventory constraint is a maximum of 5 items, instead of 4. 
How would you number the nodes in the equivalent network formulation? 

(c) Verify that the only positive transition probabilities arc those given in ( 8 ). Also 
verify that ( 8 ) and (10) satisfy the acyclic assumption (5). 

(d) Explain the equality in (11). Is expected value? Explain. 

(c) Draw a network characterization, analogous to Fig. 18.1, for the policy: produce 
3 if /= 0, 1, 2, 3, and produce 0 otherwise, for n > 2. 

*3 Consider the linear programming version of the stochastic shortest-route model, as 

described in the special material at the end of See. 18.2. 

(a) Perform the intermediate algebraic sieps leading from the recursion ( 6 ) to the 
constraints (ii). 

(b) Verify that (iii) through (vi) is the dual problem to (i) and (ii). 

(c) Write the primal and dual linear programs for 7" = 4 and </ — I, 2, 3, analogous 
to the tables in Fig. 18.2. 

(d) What effect docs the acyclic assumption have on the appearance of the primal 
and dual problems, such as shown in Fig. 18.2? 

(c) Suppose q; = I. Interpret all the linear constraints in the dual problem. 

(f) How many variables and constraints arc in the primal and dual problems for the 
inventory and production example? Assume the horizon is three periods, as in 
Fig. 18.1. 

(g) Answer the question in part (f) assunxing that the production constraint is a 
maximum of six items, instead of five. 

(h) Answer the question in part (f) assuming that the ending inventory constraint is 
a maximum of five items, instead of four. 

(i) Consider the illustration in Fig. 18.2 and let o' = 1. Show in the dual problem 
how to constrain the probability that the system enters Node 2 to be at least { 
but no more than |. Show in the dual problem how to ensure that if the system 
enters Node 2, then d — \ is chosen with no greater than a 3 probability. 

4 Optimal Batch Size Model (See. 17.5). Assume you require three working items, and 

that no more than five items can be ordered at a time from the vendor. 

(a) Describe how this problem can be characterized as a stochastic shortest-route 
problem. Be sure you indicate the definition and number of nodes in the network. 
Explain the connections between this characterization and the recursion (2) in 
See. 17.5. 

(b) Draw the network illustrating the policy of always ordering two more items than 
are required. {Hint: use a “dummy'* node to represent the event that there arc 
enough working items in the order received to fill the requirement.) 

(c) Write e.xplicitly the formula for each expected return 

(d) Write explicitly the conditional probabilities/^(^ | z, d), [Hint: consider and inter¬ 
pret the probabilities px{j)IU — ~ 0 , 1 , . . ., .v — 1 .} 

(c) What difficulty arises in formulating a stochastic shortest-route model based on 
the recursion (I) in See. 18.2? 
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5 Show that if, in fact, the model characterized by the extremal equations (1) in Sec. 
18.3 is really deterministic, then (1) simplifies to the corresponding recursion in 
Chap. 12. 

6 Each part below indicates a change in assumption for the stochastic inventory model 
in Sec. 18.3. For each change, explain in detail the alterations in Fig. 18.3. 

(a) The holding cost function is l'(i -f ^ where D designates the demand 

level. 

(b) The probability that demand is 2, 3, or 4 is i for each value. 

(c) Maximum production is 6, where C(6) = 25. 

(d) Maximum entering inventory is 5, instead of 4. 

(e) Assume maximum entering inventory is 5, instead of 4, and that the policy in (6) 
is used with .V = 0 if i = 5. Write the array of transition probabilities analogous 
to (7). Do the same for the policy of producing 3 if i = 0, 1, 2, 3 and producing 
0 otherwise. 

7 Consider the stochastic inventory model, as described in Sec. 18.3. 

(a) Perform the value iteration algorithm and verify the entries in Fig. 18.4 for n = 1 
and n — 2 . Do the same for n = 3. 

(b) Write the equations (9) for the optimal policy over an unbounded horizon with 
ct = .9, and verify the minimal valuesgiven in Fig. 18.4. 

(c) Explain why, for i = 1, 2, 3, the values ofjf” and ofj'i differ by the amount 2 in 
Figs. 18.4 and 18.5. 

(d) Calculate the present values associated with using the policy shown for /i = 1 
in Fig. 18.5 over an unbounded horizon. 

(e) Give a plausible explanation for why the amounts Xo(i) decrease as a gets smaller, 
as shown in Fig. 18.6. 

(f) Write the equations (9) for the optimal policy over an unbounded horizon with 
a = .5, and verify the minimal valuesgiven in Fig. 18.6. 

(g) Suppose you have applied the policy iteration algorithm and at n = 2, the values 
of^l equal the optimal in Fig. 18.4. Perform the test in (13) to verify that the 
iterations terminate. 

*8 Consider employing the value iteration algorithm, as described by (8) in Sec. 18.3. 
Suppose you let the initial values be obtained by selecting a trial stationary policy 
and solving (9). The text states that if you stop at iteration m and employ the immediate 
optimal policy from (8) as a stationary policy for the unbounded horizon model, the 
corresponding expected values will be no larger than the jJ* in (8). Show how this 
conclusion follows from the argument sketched in the advanced material justifying 
the Stationary Policy Theorem. 

9 (a) Compare the value iteration algorithm for the stochastic shortest-routc problem 
in Sec. 18.3 with that for the deterministic problem in Chap. 12. In what ways 
are the algorithms alike? In what ways are they different? Compare the compu¬ 
tational burdens and convergence properties. 

(b) Answer part (a) for the policy iteration algorithm. 

10 Give a verbal interpretation of the definition of an optimal policy for o' = 1 expressed 






768 


Markov Chains 


by the equality in (1) of Sec. 18.4. {Hint: remember that (o') is calculated for a 
specified policy ^r*.) Recall from Chap. 12 that when o' = 1, several policies may give 
the same average return per period over an unbounded horizon, but may differ in their 
behavior ”en route.” (For example, the two streams 100, 1, 1, 1,... and 10, 1, 1, 1, 
.. . both have an average return of 1 per period.) How docs the definition in (1) treat 
such transient effects? 


11 In each part below, determine whether there exist stationary probabilities, and if so, 
calculate them. 

*(a) Cases 2 and 3 in Sec. 18.4. 

♦(b) Case 4 in Sec. 18.4. 

♦(c) Case ^in Sec. 18.4. 


(d) 


(c) 


(j) 


( 1 ) 


if 


(i) 


(n) 


(o) 


(P) 


(q) 


/I I o\ 

1 

/I 1 0 0\ 

(t) 0 J i\ 

r (u) 

0 f i 0 

\I 0 o) 

0 0 0 1 
\1 0 0 0/ 
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♦12 Formulas (iv) and (v) in Sec. 18.4 show how to calculate the probability ^j(n) that 
the system is in State j at Period n, given probabilities for the state of the system at 
the initial period. In each part below, calculate the ^j(n) for /i = 2, 3,4, assuming 
that the initial probability — 1 (the system starts in the first state). Consider the 
problems in exercise 11, 


(a) Part (a). (h) Part (n). 

(b) Part (b). (i) Part (q). 

(c) Part (d). (j) Part (r). 

(d) Part (e). (k) Part (s). 

(e) Part (j). (1) Part (t). 

(f) Part (1). (m) Part (u). 

(g) Part (m). 

(n) Answer questions (a)-(m) assuming = 1. 

(o) v. Name the problems above where ^j(n) does noi approach a limit as n becomes 

arbitrarily large? 


13 (a) Consider the extremal equations (8) in Sec. 18.4. Show the intermediate algebraic 

steps that justify going from (7) to (8), 

(b) Consider the heuristic argument establishing (8). Explain where the reasoning 
uses the assumption that there exists an optimal stationary strategy having 
stationary probabilities uniquely characterized by (3). 

(c) Verify that if the satisfy (8), then so do (wt + K) for an arbitrary constant K, 

(d) Verify the algebraic steps in (11) and use (3) to verify that (11) simplifies to (5). 

14 (a) Consider the Necessary Condition for Optimality, given in Sec. 18.4. Explain 

why parts (a) and (b) are equivalent. Why is the condition termed “Necessary” ? 
(b) Consider the Sufficient Condition for Minimum Expected Return, given in Sec. 
18.4, Explain why parts (a) and (b) arc equivalent. Why is the condition termed 
“Sufficient”? 


15 Compare the policy iteration algorithm for oc = 1 in Sec. 18,4 with that for the deter¬ 
ministic problem in Chap. 12, In what ways are the algorithms alike? In what ways 
are they different? Compare the computational burdens and convergence properties. 

16 (a) In Fig. 18.7 the policy space algorithm is applied to the stochastic inventory 

example in Fig. 18.3. Verify the entries. 

(b) Apply the algorithm starting in Step 1 with the policy of producing 3 if i = 0, 1, 2, 3 
and producing 0 otherwise. 

♦ 17 Consider the case of the Dependable Manufacturing Company in the special material 
at the end of Sec, 18.5. 

(a) Show how to derive (i) from the recursion (2) in Sec. 18.3. 

(b) Verify that the values in (ii) satisfy the extremal equations (i). 

(c) Verify that substituting the right-hand side of (iii) into the recursion (7) in Sec. 
18.1, and simplifying terms, also yields the set of extremal equations (i). 

♦18 Consider the canonical form for discrete dynamic programming models given by (i) 
in the special material at the start of Sec. 18.3. See parts (a) and (^), p. 770, 
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(a) Apply the same heuristic reasoning given in Sec. 18.5 to derive a functional 
equation for ct = I. Use the notation and c*, 

(b) Similarly, write the finite horizon version, using the notation /n(j). Then let 
fn{s) ^ nc* + w{s) and derive a functional equation for ot = I analogous to the 
approach suggested at the end of Sec. 18.4. 


19 Consider the primal and dual linear programming models in Sec. 18.5. 

(a) Write (1) through (5) in detail for a network containing Nodes 0, 1,2,3 and. 
having two decisions d = 1 and </ = 2 at each node. 

♦(b) Given that each rj is a strictly positive number, devise an argument for why a 
basic solution to the dual model (3), (4), and (5) contains one > 0 for each 
Node i. 

(c) Repeat part (a) for (6) through (II). 

(d) In the dual problem of part (c), show how to constrain the probability that the 
system enters Node 2 to be at least J but no more than |. Also show how to ensure 
that when the system enters Node 2, then </ = 1 is chosen with no greater than 
j probability. 

20 (a) Verify that the solution (12) for the inventory example in Sec. 18.5 is feasible and 

optimal in the dual linear program shown in Fig. 18.8. Translate the solution in 
(12) into a decision rule. 

(b) What arc the coefficients in Fig. 18.8 if the probability that demand is 2, 3, or 4 
is J for each value? 

♦(c) Find an optimal policy for the demand assumptions in part (b). 


21 Consider the primal problem (1) and (2) in Sec. 18.3. Suppose there arc two nodes 
(t = 0,1) and three decisions {d= 1,2,3) at each node. Assume the probabilities 
pU\hd)&Tc 


^(0|0,1)=-L ^(010,2)=^ 
^( 110 , 1 ) =4 ^( I | 0 , 2)=4 
^(0|l,l)=y ^(0|l,2)=i- 

^( 1 | 1 , 2)=4 


/,( 010 , 3 ) =4 
^( 1 | 0 , 3 ) =4 

^(0|1.3)=|- 
^(111, 3) = 4 . 


and the costs are 


Cox = 24 co2 = 60 ro3 = 48 


cix = 36 C 12 =.24 ris = 45. 

Also assume o' = 4 . Write the equations in detail. Construct a solution space dia¬ 
gram and show that the optimal solution is obtained for any positive Tq and f] used in 
the objective function, which is maximize 


♦22 Consider the stochastic equipment replacement model in Sec. 18.7. 

(a) Justify the transition probabilities given by (2) and (3), and the expected costs 
given by (4). . 
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(b) Verify that the optimal policy is that stated in (5). Use the extremal equations 
in (8) and (9) of Sec. 18.4 and the values given in (6). 


23 Explain your understanding of the following terms: 


Markovian decision process 
stationary strategy (policy) 
deterministic policy 
stochastic shortest-route model 
Markov property 
extremal equations 
value iteration algorithm (successive 
approximations in function space) 
value determination routine 
policy iteration algorithm (successive 
approximations in policy space) 
law of motion for a system 


Markov chain 
stationary probabilities 
limiting probabilities 
statistical equilibrium 
operating characteristics 
♦transient correction 
♦doubly stochastic 
♦transient state 
♦cyclic chain 
♦multiple chain 

conservation of probability restrictions 
normalization restriction. 


FORMULATION AND COMPUTATIONAL EXERCISES 

In many of the following exercises, you are asked to formulate models in terms 
of dynamic programming recursions. Be sure to define all the symbols you use. 

24 For the exercises in Chap. 17 listed below, show how to formulate the problem as a 
stochastic shortest-route model, analogous to the formulation used for the inventory 
model at the end of Sec. 18.2. 

(a) Exercise 23. 

(b) Exercise 38, part (b). 

(c) Exercise 39. 

(d) Exercise 41, part (a). 

(e) Exercise 42, part (a). 

(f) Exercise 42, part (b). 

(g) Exercise 47, part (a). 

25 Consider the extremal equations (1) in Sec. 18.3. Assume you have available a lower 
estimate Li and an upper estimate Ui on each unknown that is, Li ^ Ui. Show 
that it is never optimal to use decision d' in D{i) if 

S PU\ <i')aL} + Cta- > minimum S PU I«. <^) V) + . 

i In tf In D{1) L > In J 

network network 

Suggest how you might use the above inequality to speed convergence in successive 
approximation algorithms. 

26 Consider the inventory model with uncertain production costs in exercise 38, part (b), 
of Chap. 17; Let a = 1 and find an optimal policy over an unbounded horizon. 


(h) Exercise 47, part (b). 

(i) Exercise 49, part (a). 

(j) Exercise 51, part (a). 

(k) Exercise 52, part (a). 

(l) Exercise 53, part (a). 

(m) Exercise 56. 
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27 Consider the stochastic inventory example in Fig. 18.3. Suppose that the production 
limit is raised to 6 and that C(6) = 28.5; assume U = I, 

(a) Use the policy space algorithm to find an optimal policy. {Hint: examine Figs. 
8.17 and 8.18 to select an initial trial policy.) 

♦(b) As was shown in the special material at the end of Sec. 18.4,y^(i) — /n(0) 5 =^ Wi, 
Examine how close this approximation is for each i and /i = 3, 4, 5, using the 
values of Wi from part (a) and /n(i) from Fig. 8.17. 

28 Consider the stochastic inventory model with smoothing costs in exercise 40, part (b), 
of Chap. 17. Let o' = 1 and find an optimal policy over an unbounded horizon; use 
the policy iteration algorithm. 

29 Consider the inventory example in Sec. 18.3. Suppose the demand distribution 
alternates from period to period: 

P[D = 2] = Y P[£) = 4] = y (in Period t) 

P[D = 1] = -1 />[£) = 4] = |- (in Period t + 1). 

(a) Show how to alter the recursion (2) accordingly. 

♦(b) Let O' = 1 and determine the appropriate extremal equations. 

♦(c) Find an optimal policy using the formulation in part (b). 

30 In each part below, find an optimal inventory stockage policy over an unbounded 
horizon with oc = 1; use the policy iteration algorithm. 

(a) Exercise 41, part (a), of Chap. 17. 

(b) Exercise 41, part (b), of Chap. 17. 

31 Consider the stochastic inventory model described in Sec. 18.2. Suppose that a demand 
need not occur in every period. Specifically, assume that if demand occurs in Period t, 
the next demand occurs in Period t + s, where s = 1, 2, 3, and the associated proba¬ 
bility is 

(a) Alter the recursion (7) accordingly. (Note that if inventory at the end of Period t 
isj and the next demand occurs in Period t -f s, then the holding cost on the j 
items is incurred s times.) 

(b) Suppose the horizon is unbounded and that the one-period discount factor is of, 
where 0 ^CC < 1. Show how to alter the extremal equations (2) in Sec. 18.3 
accordingly. 

♦(c) Suppose the horizon is unbounded. Derive the appropriate extremal equations 
for O' = 1 . 

♦(d) Using your answer in part (c), find an optimal policy. 

32 Sally Forth, a campus politico, enjoys spending her Saturday nights attending a 
Cause meeting in The Big City. Feeling the need for a male escort, she asks, each 
week, one of her three regular dates, whom she affectionately refers to as The Beard, 
The Haircut, and The Square. To keep the scenario simple, assume she asks only one 
each week, and if he turns her down, then she goes unescorted. Suppose the following 
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conditional probabilities represent the chance that each of her dates accepts the 
invitation, given the person she dated on the previous week: 


Last Week’s Date 

Probability of Acceptance by 

The Beard 

The Haircut 

The Square 

No Escort 

1 

I 

1 

2 


The Beard 

1 

3 

1 

2 

T 

The Haircut 

1 

J 

3 

3 

1 

3 

The Square 

3 

5 

3 

S 

1 

Z 


Suppose there arc n more weeks until summer vacation. Formulate a dynamic 
programming model to provide an asking policy that maximizes her expected 
number of escorts. 

Suppose Sally’s horizon is unlimited, but she has a discount factor a = Write 
the appropriate extremal equations. Given the data and these assumptions, find 
an optimal policy. 

As in part (b), suppose Sally has an unending future, but Wants to maximize the 
“expected number of escorts per week.” Derive the appropriate extremal equa¬ 
tions and find an optimal solution. 

33 The manager of Maisie’s Department Store has to decide how much advertising copy 
to place in the local Sunday newspaper; in particular, he can choose either light (L) 
or heavy (H) coverage. He classifies weekly sales into three categories: average (A), 
above average (AA), and below average (BA), and believes that the current week’s 
sales depend probabilistically on both the previous week’s sales and the advertising 
level. Specifically, his data are 


Last 

Week’s 

Sales 

Light Coverage (L) 

Heavy Coverage (H) 

AA 

This Week’s Sales: 
A 

BA 

AA 

This Week’s Sales: 
A 

BA 

AA 

.2 

.5 

.3 

.6 

.3 

.1 

A 

0 

.6 

.4 

.4 

.5 

.1 

BA 

0 

.3 

.7 

.2 

.7 

.1 


(a) 

Cb) 

*(c) 


Thus, for example, if last week’s sales were A and his advertising is H, then there is a 
.4 probability that this week’s sales arc AA, .5 probability that they are A, and .1 
probability that they are BA. Assume that advertising coverage L costs 100 and H 
costs 300, and suppose that the weekly return (excluding advertising costs) from sales 
AA is 1200, A is 1000, and BA is 800. 

(a) Use the policy iteration algorithm to. find an advertising strategy that maximizes 
weekly net return (including advertising costs) over an unbounded horizon. 

(b) Over what range of cost for coverage L docs the policy in part (a) remain optimal ? 
For coverage H ? 

(c) Over what range of weekly return from sales AA docs the policy in part (a) remain 
optimal? From sales A? From sales BA? Parts (d) and (e) on p, 774, 
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♦(d) Let px denote the probability that this week’s sales are AA and p 2 that they are A, 
given that last week’s sales are A and that there is heavy coverage, where px -f Pz 
= .9, What is the range for px such that the policy in part (a) remains optimal? 

(e) Formulate the primal and dual linear programming models, and find an optimal 
solution to the dual problem. Interpret your solution. 

34 A large soap company, renowned for successful research and advertising promotions of 
its products, has introduced a new wonder detergent called “LYE.” The product 
manager, in conjunction with the advertising department, is investigating a special 
sales campaign built on the slogan “Come LYE with us.” Like all of the company’s 
products, the first six months sales will start at a high level. The manager thinks there 
is a .8 chance that the sales will remain at a high level at the beginning of the next six 
months if he uses this LYE campaign. He feels that if he does not use the special 
advertising, the probability is only .5 that the sales will remain high at the start of the 
next six months. If the sales drop to an average level, then the product manager has 
two options. He can initiate research to improve the product. This has a .7 chance of 
returning sales to a high level by the beginning of the subsequent six months. Alter¬ 
natively, he can keep the product “as is.” Then there is a .6 chance that it remains at 
the average sales level at the beginning of the subsequent six months; but consumers 
being changeable, the probability is .4 that sales will pick up again to the high level. 

When the sales start out high and ordinary advertising is used, the six-month returns 
are 19 if sales stay at a high level and are 13 if they don’t; if a special advertising 
campaign is used, then the analogous returns arc 4.5 and 2. ^vhen starting sales are 
average and research is initiated, then the returns arc 11 if the sales rise to high level 
and are —9 if they don’t; if the product remains “as is,” the analogous returns are 13 
and 3. Assume the same decision problem concerning the marketing of LYE repeats 
itself every six months over an unbounded horizon. 

(a) Formulate the appropriate dynamic programming extremal equations for the 
cases 0 ^ QJ < 1 and o' = 1. {Note: the sense of optimization is to maximize.) 

(b) Write the associated primal and dual linear programming models given your 
formulation in part (a). 

(c) Solve the case c: = 1 by the value iteration algorithm. 

35 Cash Management Problem. Suppose in the example in exercise 55, Chap. 17, that the 
horizon is unbounded and that the probability distribution of the daily fluctuation is 
stationary. Write the appropriate extremal equations for the cases 0 ^ OJ < 1 and 
a= 1. 

36 Tiny Timber Company (Sec. 11.3). Suppose the probability is 1 — )> that at the end of 
each period the entire forest burns down. 

(a) Derive the appropriate optimizing formulation for ^e single decision problem. 
Assume a burned-down forest has no value, and that a forest fire cannot destroy 
harvested trees. 

(b) Repeat the analysis for the unbounded horizon problem, where you assume that 
if the forest burns down prior to the planned harvest period, then the forest is 
replanted in the next period. Exhibit formulas that determine the optimal planned 
harvest interval k for the cases 0 ^ o: < 1 and a = 1. 
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37 The Kitch-Kitchy Water System consists of a dam and a subsidiary reservoir. The 

dam has capacity K, Each period the inflow to the dam is S, where the actual amount S 
in Period t has an associated probability distribution pt(S), If the water in the dam 
plus S exceeds K, the excess is spillover. Let Di be the demand for water in Period t; 
this demand can be satisfied from either the dam or the reservoir, but not from any 
spillover. (For simplicity, assume that we refer to the supply of water in the dam 
before the demand for water must be filled.) Denote the cost of a unit of water from 
the dam as c^i and from the reservoir as in Period t, where For all practical 

purposes, you can consider the reservoir to have an "unlimited” supply of water so 
that there always exists a feasible policy for meeting demands. 

(a) Formulate a dynamic programming model to find a strategy for determining the 
amounts of water to draw from the dam and from the reservoir in order to 
minimize expected discounted cost over a T-period horizon. (Let a' denote the 
one-period discount factor, where 0 ^ ct < 1.) 

(b) Show how to alter the formulation in part (a) if the demand level Di is character¬ 
ized by a probability distribution qt{Di), 

(c) Show how to alter the formulation in parts (a) and (b) if the probability distri¬ 
bution for St depends on denoted by ^i('S'|5t-i). If the probability distribution 
for St depends on 5'i -f- 5*2 -f- • • • -{- 5’(-i, denoted by pt{S\ Sj-i 

(d) Discuss whether the decision problem is made simpler or more complex if, in 
parts (a) and (b), all the probability distributions and costs are stationary over 
time and the horizon is unbounded. 

(e) Suppose that any excess water spilling over the dam is worth vt per unit in Period t. 
(For example, spillover may be used for irrigation, or routed lo another reservoir.) 
Show how to alter the formulation in part (a) accordingly. 

38 KUen City Police Problem (exercises 30, Chap. 2, and 27, Chap. 5). Let ptirt) be the 
probability distribution of the minimal requirement in Period t; assume that the exact 
value of rt is known before making the decision Xt of the number of men to start on 
duty. 

(a) Assume there is a one-period discount factor ct, where 0 ^ of < 1, and the horizon 
is unbounded. Formulate an appropriate dynamic programming model. 

(b) Show how the formulation in part (a) simplifies (if at all) when all the probability 
distributions are identical pt{r) .= P{t), 

♦(c) Derive the extremal equations to be satisfied in parts (a) and (b) when OC = 1. 

39 The Kleen City Hospital has a well-designed procedure for the regular staffing of its 
emergency ward with a sufficient number of interns. But occasionally, there is a 
greater than normal influx of patients so that the regularly assigned staff has to be 
augmented with additional interns. The hospital wants a rational policy for determin¬ 
ing when to call in supplemental staff and for deciding how many additional interns to 
call. Assume that each additional intern is assigned a cost factor c per period (15 
minutes). Let w denote the assigned cost factor of a patient waiting to be treated 
(assessed at the beginning of each period). The state of the system is the number of 
patients waiting to be treated; given this state, the hospital decides the number of 
extra interns s it needs. Assume that p(j\ i, s) is the probability that there arc j patients 
waiting at the beginning of the next period, given that t patients are waiting at the 
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beginning of the current period and s extra interns are placed on duty. Past experience 
has shown that' there are never more than 10 patients waiting, 

(a) Formulate a set of extremal equations to yield a policy that minimizes the expected 
cost per period. 

(b) Comment on the difficulties attendant with estimating the model’s parameters. 

(c) Explain how the formulation in part (a) must be altered if there is a cost Jf) 

due to changing the number of extra interns from Period t — 1 to Period t. 

40 Captain Ron, operator of the Crocker Ship Company, runs a charter tanker between 
two ports, A and B. Assume that it takes the tanker a day to sail between these ports. 
Each morning the Captain must decide whether to load the tanker with the freight 
available and sail to the other port, or to wait in port another day hoping that the 
freight to be moved will be more profitable. Let the daily cost of sailing the ship be Ci 
and the daily cost of leaving the ship at anchor be C 2 , where Ci > ^ 2 * Assume there are 
two types of cargo shipments that are loaded at Port A, having values aj and a 2 ^ 
Suppose Cl > 02 and that pa is the probability that Oi is available (so that 1 — is 
the probability thac only 02 is available). Also suppose that the cargo available each 
day is independent of the availabilities on previous days (hence if the Captain does not 
sail on one day, he still has probability pa of obtaining Ox on the next day). Analogously, 
let the values of the cargos at Port B be and ^21 where by > ^ 2 > and pn be the proba¬ 
bility of ^ 1 . 

(a) Formulate a dynamic programming model for an unbounded horizon. Treat 
both cases of 0 ^ cr < 1 and a = 1. 

(b) Write the primal and dual linear programs corresponding to your formulations in 
part (a). 

♦ (c) Discuss how to alter your formulations in part (a) if the sailing times between the 
ports are three days. If the sailing time from Port A to Port B is three days and from 
Port B to Port A is two days. 

*(d) Discuss how to alter your formulation in part (a) if each sailing time i is a random 
variable having probability distribution p{t), 

41 At the start of each day, an expensive piece of machinery is examined to determine 
whether it is in good working order, in need of minor maintenance, or requiring a 
major repair.. Denote these states as 0, 1, 2, respectively. If the machine is in good 
working order, the probability that it remains so at the beginning of the next day is 
^(0|0), that it needs maintenance is ^(110), and that it needs repair is ^(2|0). If the 
machine is not in good working order, the company can utilize either of two main- 
tenance-and-repair services. The We-Fix-It Service Company charges M for minor 
maintenance and R for a major repair. The We-Try-to-Fix-It Service charges m and r, 
where m < M and r < 7?. As you might suppose, We-Fix-It does better quality work, 
and this is reflected in the probabilities that at the beginning of the next day, the 
equipment is in good working order. Let d = 1 refer to using We-Fix-It and d = 2 
to using We-Try-to-Fix-It; let p{j\hd) denote the transition probability of the 
machine being in State j next period {j = 0, 1, 2) if it is in State i this period 
(i = 1, 2) and decision d is made (d = 1,2). 

(a) Formulate dynamic programming extremal equations for 0 ^ oc < 1 and gC = 1 
that determine an optimal policy over an unbounded horizon for deciding which 
service to use, given the state of the equipment. 
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(b) Write the corresponding primal and dual linear programs, as described in Sec. 
18.5. 

(c) Let oi = I and assume that 

^(0|0) =.6 ^(1|0) =.3 ^(2|0) =.l 

^(0|1,1) = .9 ^(l|l,l) = .l ^(2|1,1)=0 Af= 14 
^(C|l,2) = .7 /»(1|1,2) = .2 ^(2|l,2) = .l m=12 

^(0|2,1) = .6 ^(1|2,1) = ,3 ^(212,l) = .l 72 = 21 

^(0|2,2) = .5 ^(1|2,2) = .4 ^(2|2,2) = .l r = 19. 

Find an optimal policy and the minimum expected cost per period. (Use the 
policy iteration algorithm.) Show the corresponding optimal solution to the dual 
linear programming problem in part (b). 

♦(d) Suppose that We-Fix-It requires a full day to perform a major repair, and We- 
Try-to-Fix-It requires two full days. Assume that the company loses c for each day 
the machine is inoperative. Show how to alter the formulations in part (a). 
{Hint: define two new states, "first day inoperative” and "second day inopera¬ 
tive.”) 

♦(e) Find an optimal policy in part (d) using the data in part (c) and c = 2. 

42 Optimal Replacement Problem, The Hard Gore Computer Company, manufacturers of 
solid-state memory devices, serves several installations at Cape Kennedy. In one such 
installation an expensive device is service tested every morning and then rated as 
being in Working Condition j, where j = 1, 2,... ,y. Condition j = J means that 
the device is inoperative and must be replaced. The testing procedure is calibrated 
such that Condition j is deemed better than Condition j -j- 1. If the condition is i, 
where i < y, the company has the option of letting the device operate for the day; 
denote this decision </ = 1 and the consequent probability that the machine is in 
Condition j tomorrow morning as ^(j|i, 1). The company can instead install a new 
device; denote this decision </ = 2. The associated transition probability is simply 
p{j\2)i and does not depend on the current Condition i. The value of using a machine 
in Condition j during a period is cy If the device is replaced, a cost ro is incurred. And 
if the device deteriorates to Condition J by the beginning of the next period, .an 
additional cost cj is incurred. 

(a) Formulate the appropriate unbounded horizon extremal equations for 0 ^ < 1 

and for = 1 to determine the conditions at which a new device should be 
installed. 

(b) Show how to alter the formulation in part (a) if it takes a day to install a new 
device. If it takes two days. Let v represent the cost per day of having the equip¬ 
ment out of order due to installation of a new device. {Hint: consider adding the 
states "first day of installation” and "second day of installation.”) 

(c) Consider a problem having the data y == 3, ro = 100, ci = 5, cj = 10, Ci = 75, 
and 

1) = .7 ^(211,1) = .2 ^(3|l,l) = .l 

p(112,l) = .l ^(2|2,1) = .7 ^(312,1) = .2 

p{\\2) =.8 >>(2|2) =.l /.(3|2) =.l. 

Write the formulations in part (a) in detail. Let a = 1 and find an optimal solu¬ 
tion using the policy iteration algorithm. Parts (d) through (/) on p. 778. 
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(d) Write the primal and dual linear programs, as described in Sec. 18,5, for the 
example in part (c) for both 0 ^ O' < 1 and of = 1. Exhibit the optimal solu¬ 
tion to the dual problem for o' = 1. 

(e) Determine the number of variables and equations in the primal and dual linear 
programming versions for your formulation in part (a) with of < 1, 

♦(f) Suppose the device is specially made for the Cape Kennedy installation and that 
c(.v) is the cost of manufacturing x units of the device, where r(^) is nonlinear. 
Hard Core wants an optimal policy that combines both the production deci¬ 
sion and the replacement decisions. Explain why these two decisions have an 
interactive cost effect. Formulate a dynamic programming model that jointly 
optimizes the two types of decisions. Assume that h is the holding cost per period 
for each device held in reserve at the end of a period. (Assume for simplicity that 
0 < Of < 1.) Suppose that the maximum production quantity is X. Determine 
the number of variables and equations in the associated primal and dual linear 
programming versions of your formulation. 

43 The Coast Guard operates a navigation guidance and weather ship in the North 
Atlantic. The ship utilizes two pieces of expensive electronic gear that are subject to 
failure. The ship can continue performing its mission, at a degraded level, if one 
piece ha^ failed. But if both pieces arc not in working order, the Coast Guard must 
perform an emergency airdrop with replacements. The Coast Guard has the option, 
however, of sending a replacement by supply ship when only one of the pieces fails. 

Assume each time period is a week. Let State 0 refer to the condition that both 
pieces arc working, State 1 to the condition that only Piece 1 has failed. State 2 to 
the condition that only Piece 2 has failed, and State 12 to the condition that both 
pieces have failed. The cost of being in States 1, 2, and 12 are F,, Fj, and F, 2 , respec¬ 
tively; State 0 is costless. Let the decision d = 0 denote there is no replacement action; 
d = 1 that only Piece 1 is replaced by .sending a supply ship, at the cost A'l; d =2 
that only Piece 2 is replaced by sending a supply ship, at the cost AoJ and </ = 12 
that an emergency airdrop is required at the cost A:, 2 . Note that </ = 0 is relevant for 
States 0, 1, 2; </ = 1 is relevant only for State 1; (/ = 2 is relevant only for State 2; 
and </ = 12 is relevant only for State 12. Accordingly, let pij\i,d) designate the 
probability that the system is in State j next week, given that it is in State i this week 
and the decision is d. 

(a) Enumerate every possible policy and give a verbal description of each. Explain 
why the probabilistic and economic assumptions do not make it worthwhile to 
replace a piece that is in working condition. Explain why />(12| 1, 0) might not 
equal />(12|2, 0), and why />(12|0, 0) might not equal />(12 j 1, 0) •/?(1212, 0). 

(b) Formulate a dynamic programming model for an unbounded horizon using the 
criterion of minimum expected cost per week and show the extremal equations 
to be satisfied. 

(c) Write the corresponding primal and dual linear programs, as described in Sec. 
18.5. Be sure to indicate explicitly those />(j|i, d) that equal 0. 
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44 Consider the stochastic shortest-route model in Sec. 18.2. Suppose that if you make 
decision d in Period t when the system is at Node i, then the system arrives at Node j 
in Period t 4- hij(d) with probability p{j\h ^)y where hij{d) is a nonnegative integer. 
Let ot be the one-period discount factor, where 0 ^ oc < 1. 

(a) Show how to alter (6) in Sec. 18.2 accordingly. 

(b) Consider the extension to the case of an unbounded horizon, where 0 ^ oc < 1. 
Show how to alter (1) in Sec. 18.3 accordingly. 

* (c) Consider the extension to the case of an unbounded horizon, where CC = 1. Show 
how to alter (8) in Sec. 18.3 accordingly. {Hint: review the deterministic case in 
the special material at the end of Sec. 12.3.) 

(d) Suppose that hij{d) is a random variable. Let g{h 1 1 , J, d) denote the probability 
that hij{d) equals h. Show how to alter the formulation in part (b). [Note: in an 
application of this model, the value of cta has to be calculated so as to reflect the 
random variation in A|j(d).] 

*(e) Given the assumptions in part (d), show how to alter the formulation in part (c). 

45 Show that the value iteration, algorithm (8) in Sec. 18.3 ensures at least geometric 

convergence, that is, Ij'J* — | ^ ot^K for every i and a suitably chosen constant K, 

46 Section 18.4 states that the Stationary Policy Theorem is not true when the number of 
states is unbounded. Each part below illustrates this statement. 

(a) Consider a network containing the Nodes 1, 2, 3,..., as shown in Fig. 18.10. 
Notice that there are two decisions (arcs) at each of the odd-numbered nodes, both 
having an associated cost 1, and one decision at each of the even-numbered nodes, 
having an associated cost 1 /n. Assume the process starts at Node 1 and the criterion 
is minimum cost per period over an unbounded horizon. Show that there is no 
strategy of any sort that yields a cost per period of 0 over an unbounded horizon. 
Then show that given any strategy, there exists another strategy having a strictly 
smaller (but positive) cost per period. Hence, there is no optimal strategy. {Hint: 
given an arbitrarily small, but positive, number e, show how to find a strategy 
yielding a cost per period that is less than or equal to e.) 


1111 



(b) Consider the network containing Nodes 1,2,3,..., as shown in Fig. 18.11. 
Let Action 1 denote going from Node i to Node i -f 1 and Action 2 denote going 
from Node i back to Node i. Assume that the process starts at Node 1 and the 
criterion is minimum expected cost per period over an unbounded horizon. Show 
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2 3 4 5 6 Figure 18.11 

that any stationary policy that docs not randomize has cost per period that is 
strictly positive. Show that there is a nonstationary policy that docs not randomize 
and has a cost per period of 0. {Hint: consider what happens if, when the system 
first arrives at each State i, you take Action 2 for i consecutive periods and then 
take Action 1.) Also show that there is a stationary policy that randomizes and 
has expected cost per period of 0. {Hint: consider what happens if at each Node i 
you select Action 1 with probability I/i. Write the Markov chain for this stratcg> 
and examine whether stationary probabilities exist.) 

47 Consider the Stationary Policy Theorem for o' = 1 in Sec. 18.4. Prove the proposition 
that there always exists an optimal policy that does not randomize. 

48 Consider the extremal equations (10) for the case a = 1 in See. 18.4. Assume the 
policy d* is used; to keep the notation uncluttered, we suppress the symbol d* in 
M|o and Cto, which are defined next. Let denote the expected number of periods 
(sometimes called the mean first passage time) for the system to next arrive at State 0 
when it starts at State i; Moo can be called the expected cycle time for State 0. Similarly, 
let C|o denote the expected cost (return) during the same periods; Coo can be called 
the expected cycle cost for State 0. 

♦(a) Show that Mjo and C^ can be calculated from the linear equations 
M,o= 1 +2/i(j|/,d*)M^ fori = 0, 1,..,, r 

Cto = 4- J^p{j\h d*)Cjo for i = 0, 1,.. ., T. 

{Hint: at least 1 period has to elapse in going from State i to State 0; if the first 
transition is to State j, where j ^ 0, then the additional expected number of periods 
is the same as if the system started in State j. Similarly, cia* is the expected cost 
incurred at the first period; if the first transition is to State j, where j ^ 0, then the 
additional expected cost is the same as if the system started in State j.) 

(b) Consider using the following quantities as a solution to the extremal equations 
in (10): 

e* = w, = C,o - c*Mo. 

Give an interpretation of these quantities. Check that the associated value Wq = 0. 
Use the formulas in part (a) to show that these quantities actually satisfy (10). 

(c) Calculate Mio and Cw, for i = 0, 1,..., 4, for the inventory example in Fig. 
18.3, using the optimal policy shown in Fig. 18.7 (« = 2). 

(d) Calculate each Mio and Cto for the optimal policy in exercise 26. Also compute 
c* and Wi according to part (b). 

(e) Same as part (d) for exercise 28. 

(f) Same as part (d) for exercise 34. 


I 



(g) Same as part (d) for exercise 41, part (c). 

(h) Same as part (d) for exercise 42, part (c). 
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49 The Onagen Ophagen Company operates a delicate machine that sometimes works 
properly and sometimes doesn’t. Let j = 0 denote the state in which the machine is 
operating properly, and j = 1 indicate that the machine is in need of repair. If the 
machine is used when it is in good order, the daily production cost is Cq. If the machine 
is used when it is not in good order, the daily production cost is where Cj > Cq, 
Unfortunately, it is only possible to tell the actual state of the machine by making an 
expensive test on the items produced during the day. Let I be the associated inspection 
cost. A repair action ensures that the machine is in good operating condition immedi¬ 
ately and incurs the cost R, 

Let p denote the probability that the machine is still in good operating condition at 
the beginning of the next period, given that it is in good operating condition this 
period, where a period is one day. Assume that once the machine is in the need-of- 
repair state, it remains there. 

Let P denote the probability that the machine is in good operating condition at the 
beginning of a period. Explain why the value of P can be used to represent the state 
of the production system. Exhibit a dynamic programming recursion for an unbounded 
horizon (assume 0 < o; < 1) that determines whether to operate the machine for 
a day without inspection, to perform the inspection, or to take the repair action, given 
the current value of P, (Hint: check that the probability that the machine is in good 
operating condition at the beginning of next period when no inspection or repair occurs 
is Pp,) 


Many important examples of the models in this chapter have a special structure 
that can be exploited to simplify the Markovian formulation and the calculation 
of an optimal solution. The next exercises illustrate several techniques for exploit¬ 
ing special structure. 

50 Separable Markovian Decision Problems, Consider an infinite horizon stochastic shortest- 
route model, like that discussed in Sec. 18.3. Assume, initially, that the nodes (or 
states) are i = 1, 2,..., T, and that there are K distinguished decisions available at 
every slate. The special nature of these K decisions is indicated by the extremal equa¬ 
tions that are to be satisfied: 


yi = minimum 


^)^yi 4- «! + ^1, • • • 4- 


minimum 

(f In D(t) 




for i = 1,2,..., r, 


where 0 ^ ct < 1. 

(a) Give a verbal interpretation of the economic and structural assumptions made 
about the K distinguished decisions. Let Ni be the number of decisions in the set 
D(i), Determine the numbers of constraints and variables in the associated primal 
linear programming model, analogous to that in (1) and (2) of Sec. 18.5. 

(b) Show that by introducing an additional Node 0, a solution to the above extremal 
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equations can be obtained by solving, instead, the extremal equations 

^0 = minimum ..., 

U-i J 

= minimum (^o + minimum «, d)otyj -f- ^tdlj for i = 1,2,..., T. 

[ d In D ( i ) L" J J 

(c) Write the primal linear programming model, analogous to that in (1) and (2) 
of Sec. 18.5, for the extremal equations in part (b). Determine the numbers of 
constraints and variables. 

(d) Let Zi = Qi + yQ ^ yi, hr i = 1,2,..., T*, and show how to eliminate T of the 
linear constraints in part (c) by this change of variables. {Hint: recall that the yx 
are unconstrained in sign.) Explain how to modify the formulation if the K 
distinguished decisions are available only in Nodes i = 1, 2,.. ., t, where t < T, 

♦(e) Derive the appropriate extremal equations for o' = 1, analogous to those in 
part (b), and exhibit a streamlined primal linear programming problem analogous 
to that in part (d). 

♦(f) Apply the analysis above to formulate the equipment replacement model in 
Secs. 17.7 and 18.7. {Hint: you can let the Node i represent the age of a piece of 
equipment entering a period, prior to the decision of whether or not to replace.) 
Exhibit in detail a streamlined programming model corrcs[X)nding to part (d). 

♦(g) Continuing with the analysis in part (f), exhibit in detail the linear programming 
model corresponding to part (e), using the data for the example in Figs. 17.6 
and 18.9. Determine the associated optimal solution. 

(h) Apply the analysis above to formulate the quality control model in exercise 42, 
part (a). Exhibit in detail the streamlined linear programming model correspond¬ 
ing to part (d). 

51 Separable Markovian Decision Problems {continued). Consider an infinite horizon stochastic 
shortest-route model, like that discussed in Sec. 18.3. Assume, initially, that the nodes 
(or states) are i = 1, 2,..., T", and that there are K distinguished decisions available 
in every state. The special structure of these K decisions is indicated by the extremal 
equations that are to be satisfied: 

>, = minimum + a, + + at -t- is, 

minimum [ V PU 1«, d)ayx + ftal 1 for i = 1 , 2,.. ., / 

d \ nD { i ) L^=i JJ 

= minimum \^/'O l«. d)ayi + «, J for i = / + 1,..., r, 
d In D { 1 ) LjTi J 

where 0 ^ OJ < 1, and K ^ t ^ T — K. (Assume that the values for all the a^ bt^ 
and cia guarantee that the extremal equations possess a unique finite solution.) 

(a) Give a verbal interpretation of the economic and structural assumptions made 
about the K distinguished decisions. Let Nt be the number of decisions in the set 
D({). Determine the numbers of constraints and variables in the associated primal 
linear programming model, analogous to that in (1) and (2) of Sec. 18.5. 

(b) Show that by introducing an additional Node 0, a solution to the above extremal 
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equations can be obtained by solving, instead, the extremal equations 
yo = minimum -f A|,... ^yuK + ^r] 

yt = minimum j^o + oi, minimum I'^pUlh for x = 1 , 2 ,...,/ 

{ dlnn(() L^.i JJ 

J>i = minimum T d)ccyj + for « = < + 1 ,..., T. 
a In D{i) |_^ J 

(c) Write the primal linear programming model, analogous to that in ( 1 ) and (2) of 
Sec. 18.5, for the extremal equations in part (b). Determine the numbers of 
constraints and variables. 

(d) Show how to eliminate T of the linear constraints in part (c) by changing variables 

= at +^0 —yu for I = 1,2,...,/. 

Show how to eliminate K more of the linear constraints by making a second 
change of variables 

Zt+jk = + bt — zt^k for A: = 1 , 2 ,..., /C. 

^(e) Derive the appropriate extremal equations for CT = 1, analogous to those in part 
(b), and exhibit a streamlined primal linear programming problem analogous to 
that in part (d). 

(f) Consider a stochastic inventory model in which p{q) denotes the probability that 
demand is q (demand is independently and identically distributed from period to 
period), the purchase cost is .S -f- rx for x = 1 , 2 ,..., where 5 > 0 , and the 
inventory-holding-and-penalty-cost function is //(/ — 7 ), where / is the level of 
inventory after an order is placed (assume delivery is immediate) and before 
demand occurs. Assume that when demand q exceeds /, the excess demand is lost. 

Suppose you want to find a policy of the following type. If entering inventory 
exceeds 10, no replenishment order is placed. If entering inventory is 10 or less, 
then a replenishment order can (but need not) be placed; inventory after an 
order but before demand is not permitted to exceed 15. (Assume that inventory 
at the start of the horizon is 0 ). 

Formulate the model to give a set of extremal equations analogous to those in 
part (a). Determine the numbers of constraints and variables in the associated 
primal linear programming model. Show how to reduce the problem to the 
version given in part (b); again determine the numbers of constraints and vari¬ 
ables in the associated primal linear programming model. Apply the analysis in 
part (d) and exhibit in full the associated linear programming model. [Hint: let 
fli = —ci and b^ ~ S (^ “h 10)r -f* Sc H{k -f- 10 — 9)/^(9)*l 

(g) Consider the example in part (f). Show how the problem can also be characterized 
by the extremal equations in exercise 50, parts (a) and (b). Determine the number 
of constraints and variables in the corresponding primal linear programming 
models. Exhibit in full the associated streamlined linear programming model in 
part (d). Is this alternative formulation preferable to that in part (f) above? 
Explain. 
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CHAPTER 


Probabilistic 
Inventory Models 


19.1 NEW ORIENTATION 

Previous chapters have focused on the development of widely applicable mathe¬ 
matical models and algorithms. For example, Chaps. 2, 6, 8, 10 and 11 dealt 
with linear programming, network, and multistage models; Chaps, 4, 7, and 12 
presented numerical techniques for solving these models. Throughout, the 
mathematical models and the algorithms were illustrated by specific examples 
suggestive of real managerial applications of operations research. You may even 
have been surprised that these examples covered such a broad range of decision¬ 
making situations. 

In Chaps, 19 and 20, and Appendices II and III, the orientation of “models 
first, applications later” is reversed. These chapters deal with particular decision 
areas in which operations research has proved beneficial, namely the management 
of inventories and the design of waiting line systems. The resulting mathematical 
models and algorithms are designed to fit the particular context of the problem. 
The solutions commonly yield optimal decision rules to be applied in an operating 
environment, and not merely guidelines to be used for planning purposes. 

Of course, in a strictly formal sense, each model and algorithm can be classified 
as a special case of the models and algorithms you have already studied. But the 
new insights you will experience come from analyzing the particular nature of 
the models, not from recognizing that the models can be viewed as linear, non¬ 
linear, or dynamic programming problems. Now you will fully exploit the specific 
structure of these models. 

The change in emphasis requires that you make a corresponding change in 
your study orientation. You should seek the fundamental concepts that are critical 
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to the analysis of most problems in the particular area of application. Since there 
are infinite variations in the circumstances describing a particular application, 
it is impossible for any book, let alone an introductory text, to provide enough 
models to cover every eventuality. You can, however, learn the ideas that make 
up the basic building blocks of such models, and these fundamentals are contained 
in the next two chapters and the last two appendices. 

In studying the material, remember to answer the following questions; 

1. What decisions are to be made ? What managerial problem is being solved ? 

2. What causes the real decision environment to be so complex as to require 
the use of operations research models ? What elements of this complexity are 
embodied in the models? How do these elements affect the solution? What 
elements are ignored? 

3. What distinguishes a good decision from a poor one? 

4. If you were a manager, how would you employ the results of the analysis? 
In what ways might you want, or need, to temper the results because of 
factors not explicitly considered in the models? 

Keep in mind that the answers should reflect the day-to-day nature of the deci¬ 
sion problems in these areas. 

19.2 SCIENTIFIC APPROACH TO INVENTORY 
MANAGEMENT 

Seeing inventories everywhere, every day, may have dulled your awareness 
that they are the direct result of managerial decisions. Your awareness is likely 
to be sharpened only when you find some item you want is out of stock in a grocery 
store, a hi-fi shop, or a bookstore. Irritated, you may say, “Somebody slipped 
up!” If the item is a Valentine’s Day greeting card and the date is February 13, 
that “somebody” may ht you, A shopkeeper can hardly be expected to have a 
large inventory of stock when his forecast of future demand is negligible. 

Of course, the sheer pervasiveness of inventory decisions in business enterprises 
is not enough to justify an operations research approach. At least one additional 
condition must be present to warrant a company’s applying this point of view: 
both the likelihood and economic impact of incorrect inventory decisions must 
be substantial enough to outweigh the cost of scientific analysis. To illustrate, a 
large stationery store may well find it profitable to derive an optimal inventory 
policy for stocking its supply of pencils, whereas you would find it silly to derive 
such a policy for the supply of pencils in your desk drawer. Why? 

So that you are not misled by the frequent reference to retail firms we mention 
that applications of inventory models are more widespread in manufacturing and 
wholesale companies, although this historical pattern is rapidly changing. For 
example, you will commonly find scientifically based inventory replenishment rules 
employed in large manufacturing plants to control hundreds, and often thousands, 
of items of raw materials, intermediate goods, and finished products. Typically, 
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in these situations, only a very few men are responsible for all the inventory re¬ 
plenishment decisions. You may wonder how mathematically derived rules can 
be an improvement over the experienced judgment of these men. The answer 
is given below. 

Systemwide economics. Often a manager does outperform a mathemat¬ 
ical formula in the replenishment of any given item. But to do so, he must carefully 
forecast the demand for the item and keep tab on the exact length of time required 
to obtain a resupply. If the item is important to the firm—because of its high cost 
or essentiality to the production process—then continual managerial review may 
well be the best policy. But there is a limit to the number of items that an individual 
can closely scrutinize. Since most companies inventory a myriad of items, it is 
inevitable that many of them will be stocked according to some routine or rule- 
of-thumb approach. Operations research can provide a cost-saving means of 
management for this large segment of the inventory system. 

By establishing methodical procedures for replenishing most inventoried items, 
the scientific approach frees the decision-maker to concentrate his talents on 
discretionary and exceptional situations, where his experience is of greatest benefit. 
Thus an effective scientific inventory system really requires a harmonious com¬ 
bination of human judgment and mathematical formulas. 

Operational simplicity. Inventory models are among the fev^ instances 
in which the resultant optimal solutions of operations research models are imple- 
mentable on a day-to-day basis. The reason is that in most inventory situations, 
the essence of the solution is simply 

(i) A rule indicating when to replenish. 

(ii) A rule indicating the amount to replenish. 

In other words, an inventory policy signals the timing and magnitude of the 
resupply decision. No matter how complex the mathematical model or algorithm 
underlying the replenishment rules, the description of the policy in terms of (i) 
and (ii) is always easy to understand. Sometimes the rules are given by tables and 
only require the manager to state data, such as the item’s price and average 
demand. But frequently the entire system is computerized, in which case the com¬ 
puter keeps an up-to-date record of the inventory level, and at the appropriate 
time prints out a replenishment order showing the correct amount. 

Even though inventory models are widely accepted, the^'e are still many 
environments in which replenishment decisions are made according to arbitrary 
rules and judgment, despite the substantial losses that mistakes can inflict. A major 
contributing factor for the persistence of such rules-of-thumb is that these situations 
have not yet been studied intensively by operations research specialists. Further, 
the few solutions derived so far are much more complicated than (i) and (ii), so 
that implementation is correspondingly more difficult. Typically, these highly 
complex environments include situations in which there are significant inter¬ 
relationships between the demands for the different items (for example, when 
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several of the items can be substituted for each other) and when a firm stocks an 
item at more than one location (for example, at a regional warehouse as well as 
at several retail outlets). In time, operations research will deal successfully with 
these difficult problems. 

Study objective. In the remainder of this chapter and in Appendix II, 
you will see several formulations of inventory models. Each is a prototype of a 
class of models that has been studied by operations research theorists. By under¬ 
standing these basic, approaches, you will become familiar with the main 
principles of inventory theory. Consequently, you will have the mastery required 
of a manager to comprehend the objectives, strengths, and limitations of a scientific 
inventory system. What is more, you will be equipped to study an in-depth text 
devoted to operations research inventory models, if you wish to learn more within 
this area. 

19.3 GROUND WORK FOR INVENTORY AN.A.LYSIS 

What considerations are important for a firm’s decision of when and how much 
to inventory of an item? The significant elements fall into three major categories* 
These factors are discussed below in general terms, and in the subsequent sections, 
they are made precise by the particular mathematical formulations of the models. 
For clarity of exposition, consider the case of the Les Luster Company, a wholesaler 
of fabricated metal supplies, such as aluminum wire, rods, coils, sheets and plates, 
etc., which it buys from a vendor aluminum company. You should have no 
trouble in relating this wholesaler’s decision considerations to those of other firms 
carrying inventory. 

Demand and supply. Certainly the wholesaler needs to forecast, in 
one way or another, its own local customers’ future demand for the item. Such a 
forecast directly influences the order quantity, and most frequently is represented 
by a probability distribution. This chapter assumes throughout that different 
item-demands are independent of each other. Even when this assumption does 
not strictly hold, it is often an adequate approximation on which to base an 
inventory rule. Further, the demand distribution is assumed to be independent 
of the selection of a replenishment rule or any other managerial action. By using 
advanced mathematical analysis, one can find optimal policies in which these 
approximating assumptions are omitted. 

Employing a probability distribution to describe future demand is a useful way 
of combining partial knowledge with partial ignorance. By stating the form and 
parameters of the distribution—such as the mean of a Poisson distribution, or the 
mean and variance of a negative binomial distribution—you can express what 
you know about the relative likelihoods of demand values. Thus the probabilities 
summarize your uncertainty about the precise demand value that will occur. 
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The wholesaler also needs to know the so-called lead time or delivery 
lag between the moment the replenishment action is initiated and the moment 
the items ordered are available to meet its customers’ demands. The interval 
obviously encompasses all the time required by the vendor to process the whole¬ 
saler’s order, including shipping time; but it also encompasses any delays on Les 
Luster’s own premises due to placing the order as well as receiving and unpacking 
it on arrival. The lead time may be specified deterministically or stochastically, 
depending on the situation. 

Occasionally, there may be variations between the amounts ordered and actually 
received—for example, scrap losses are significant in some manufacturing process, 
and the vendor may have some leeway in the amount he ships. Such situations 
will not be considered in this chapter; however, they can be treated by using more 
advanced analysis. 

Replenishment economics. If you examine most large-scale inventory 
systems, you will quickly find that for items frequently demanded, the replenish¬ 
ment quantity is usually sizeable, and certainly greater than 1. There are several 
reasons why. The most important is that very small orders would result in frequent 
reordering and thereby incur a considerable expense associated with processing 
and receiving the order; this expense is referred to as the setup or reorder 
cost. A second and less obvious reason is that a large order protects the company 
against running out too often. Other reasons that are sometimes significant (but 
not treated by the models in this chapter) include quantity discounts or minimal 
order sizes stipulated by the vendor, or the firm’s forecast of rising vendor prices. 

What limits the order quantity? The principal factor is the inventory 
holding cost. Keeping items in stock is costly because inventories tie up 
capital that might otherwise be profitably employed, and also they incur the 
expenses of storage, maintenance, insurance, etc. Other limiting reasons that 
arise in real situations (but will not be embodied in the models of subsequent 
sections) include spoilage, obsolescence, budgetary and space restrictions, etc. 

So far we have not discussed which items the wholesaler should try to keep in 
inventory. After all, even a supermarket or department store does not stock 
everything that its customers might want. Presumably there is a penalty cost 
or profit loss whenever the wholesaler is out of stock of an item a customer 
requests. Obviously, a lost sale means less revenue. But there is a penalty even if 
the customer is willing to have his order backlogged, for then the wholesaler must 
incur some extra expense from keeping backlog records and filling the order in 
a later shipment. As a consequence, the firm will inventory an item if the out-of- 
stock cost is high; this qualitative statement is made precise for each of the inven¬ 
tory models in this chapter. 

The optimality criterion for inventory models is stated traditionally in terms 
of expected cost rather than expected profit. Since most models consider the item’s 
selling price to be fixed, this cost convention is legitimate, provided sufficient 
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care is exercised in defining the penalty cost to include lost or delayed revenue 
when a sale is lost or backlogged. Another reason for this cost convention is that 
frequently such inventory models arc applied to items used by a firm’s own per¬ 
sonnel; consequently, the items are not really being “sold” at a profit. 

Assuming an item is to be stocked, an optimal inventory policy is one that 
strikes the proper balance among the reorder, holding, and penalty costs. As you 
will see, all three cost elements influence how much and when to reorder. The 
timing decision usually amounts to a specification of a reorder point: whenever 
the inventory position falls below the reorder level, a replenishment action is 
taken. The higher the reorder point, the more protection there is against stoekouts. 
Thus in changing the reorder level, there is a direct tradeoff between stockout and 
holding costs. Incidentally, if the order quantity is so large as to give considerable 
protection against stockouts, and if customers are willing to have their demands 
backlogged, the company may find it is optimal to allow a backlog to build up 
before placing a replenishment order. You will see how this happens in Sec. 19.5. 

As you know from the analysis in previous chapters, if the firm’s criterion func¬ 
tion is in terms of expected present value, then the tradeoff between present and 
future returns must also be accounted for in the optimization process. To il¬ 
lustrate, as the interest rate increases, the optimal policy will order less in the 
current period, because postponement becomes less costly when reckoned in 
terms of present value. 

It is important for you to realize that in real applications of inventory models, 
the task of estimating reordering, holding, and penalty costs can be quite difficult. 
Nevertheless, the problem is not insurmountable, as demonstrated by the wide¬ 
spread adoption of these models. What you seek are approximate and reasonable 
cost estimates. More will be said about such matters of implementation in Sec. 
19.7. 

Systems specifications. The third category of elements influencing the 
selection of an inventory policy deals with a disparate variety of environmental 
factors, only some of which are under the control of the decision-maker. 

One such specification relates to whether the inventory level is continuously 
or only periodically reviewed. In the continuous case, a replenishment order is 
initiated as soon as the inventory level drops below the reorder point. In the 
periodic or discrete review case, the inventory status is checked only at specified 
intervals—for example, each Monday. In this system, replenishment occurs only 
at the interval point—even when inventory drops below the reorder point at an 
early moment in the interval. Section 19.6 analyzes a continuous review model 
with probabilistic demand; Appendix II treats the periodic review case. 

Like all mathematical models, these inventory models are only approximations 
to reality. In an actual situation, then, the choice between a continuous and a 
periodic review model depends on which is the better representation of reality, 
A detailed discussion of how to make this choice is best left to specialized texts on 
inventory control. If you read Appendix II, however, you should carefully com- 
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pare the two types of models and their underlying assumptions so that you learn 
in what ways they are essentially different. 

A related specification concerns restrictions on the order quantity. Sometimes 
it is required or convenient to order only in multiples of a standard amount, such 
as a dozen. (Note that, in this case, it is not generally permissible merely to change 
the scale of measurement—to dozens, say—since customer demand may not occur 
in the same transformed unfts.) Sometimes the order quantity is limited because 
the company has limited warehouse space available for inventory. These kinds of 
restrictions have been successfully treated in the operations research literature, but 
they will not be considered in this book. 

Finally, the replenishment rule may be affected by interactive factors. For 
example, Lcs Luster Company, the metals-supply wholesaler, may be constrained 
to ordering a freight-car load at a time from the fabricating mill. Therefore, its 
order quantities for individual items must reflect this overall constraint. Or an 
item may be stocked at several different warehouse locations, so that more than 
one inventory level must be examined in determining a replenishment action. 
Again, for the sake of simplicity, this text will not treat inventory optimization 
models with interactive constraints, although such models have been solved by 
operations research technicians. 

Summary and a look ahead. In brief, the models in this chapter and 
Appendix II separately consider each different type of item inventoried. They 
take as given the environmental factors of continuous or periodic review, the 
planning horizon, the lead time, future demand, purchase cost, setup cost, holding 
cost, penalty cost, and the discount factor. Then each model finds an optimal rule 
for when and how much to order. No additional system constraints are imposed. 
In studying these models, be sure to ask yourself how each of the environmental 
factors affects the optimal policy. 

Although the goal of this chapter is to analyze models in which demand is 
treated as being probabilistic and the inventor)' decision problem is recurring, 
that is, dynamic, you will find it helpful to consider each of these two elements in 
isolation. Specifically, Sec. 19.4 discusses finding an optimal inventory policy 
when demand is probabilistic but the ordering decision is made only once. Section 
19.5 treats the situation where the decision problem is dynamic but demands are 
perfectly forecastable (in terms of an average rate). Section 19.6 and Appendix II, 
then, will combine the two elements so as to highlight the interaction between 
them. 

In each case analyzed, the aim is to completely characterize an optimal policy 
in terms of merely two numbers: the reorder point and the order quantity. The 
resultant algorithmic problem of calculating the optimal values becomes one of 
optimizing a nonlinear objective function of only two variables. As you will see, 
knowing the form of an optimal solution and having so few decision variables make 
the algorithmic task fairly easy and also allow you to use simple, yet very effective, 
numerical approximations. 
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19.4 STATIC (single decision) model 

This section treats the so-called static, or one-period, model in which only a 
single inventory decision is made in anticipation of demand, which is viewed as 
a random variable. 

The model is important for two reasons. First, occasionally the model does lit 
a real situation, as exemplified by the e.xample in the next paragraph. Second, the 
optimization analysis provides a useful starting point for considering the sub¬ 
sequent dynamic models. 

The following illustration, based on an actual application, suggests how a static 
model can be employed. (The discussion overlooks several details that influenced 
the exact policy implemented, but that are unimportant here.) The Daily De¬ 
fender^ a large newspaper in a major metropolitan area, has been concerned with 
controlling the number of papers to be distributed to individual newsstands. 
Unsold papers are returned to the publisher, and as a result the total out-of- 
pocket costs attributable to these returns amounts to over SI million annually. 
Better stockage rules would therefore promise sizeable savings. 

Although the daily decision as to the total number of papers to put on each 
stand is a recurring one, the decision problem can be treated by a static model. Each 
day’s newspaper is unsalable by the following day, so there is only one inventory 
decision to be made about these papers. The daily demand for newspapers can 
be viewed as a random process that is only partly predictable by such factors as 
the day of the week and the importance of the headline story. Also, the unit cost 
of the papers differs daily; for example, the Sunday edition, being larger, costs 
more to print than the other daily editions. Consequently, the Daily Defender 
applied an operations research model such as that given below by the linear 
holding and penally cost case. (For the obvious reason, this model is sometimes 
referred to as the newsboy problem.) On any particular day, the appro¬ 
priate costs are used in the model and the probability distribution for demand 
employed depends on the predictable factors for that day. 

Can you think of any other commercial situations in which a one-period 
inventory model is applicable? Under what circumstances would it be profitable 
to use an operations research approach for these situations? 

Form of replenishment policy. To begin the discussion, we define 
the symbols 

i = level of inventory prior to the ordering decision 
X = amount ordered (.v > 0) 

j; = z + = total inventory available to meet future demand 

q = actual demand quantity (a random variable, where ^ > 0) 
p[q) = probability that demand equals q. 

For expository convenience, we assume^/, x, j;, and q are integer-valued. (The 
continuous variable case will be treated in the advanced material at an appro¬ 
priate juncture in the discussion.) Typically, optimal values for x and y will 
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( 1 ) 


depend on the level of starting inventory i, and so the symbols x(i) and j^(i) will 
be used to denote an optimal policy. 

It is easy to construct plausible-looking examples that have surprising-looking 
optimal policies. If you arc curious to see such an illustration, you can read the 
special material in a later part of this section. Most often, managers use the fol¬ 
lowing simple type of replenishment rule: 

y{i) = i and x(i) =0 for z > j (do not order) 

y(i) = S and x{i) = S — i for i < j (place an order). 

According to ( 1 ), no order is placed if initial inventory i'>s; but if i is small 
enough {i < s), then an order is placed so that the total inventory available for 

future demand is S. The rule in (1) is referred to as an (s, S) policy, where 

s denotes the reorder point and S the reorder level. 

In the special material in the second part of this section, we also give sufficient 
conditions on the underlying cost structure to ensure that an (j, S) rule is the 
optimal form of policy. But prior to considering a general one-period model, we 
treat in detail the most important special case of the model. 

Linear holding and penalty cost. Suppose that the expected cost 
for the entire period is the sum of the purchase cost and the e.xpected holding 
and penalty cost. Specifically, let the cost of ordering the amount x be 

fO for x = 0 

(2) c{x) = ] 

^ ^ [K + cx for X > 0, 

where K '>0 represents the setup cost and r > 0 the unit purchasing cost. 

Assume that the expected holding and penalty cost during the period depends 
only on the total amount of inventory available to meet demand, namely, the 
level which equals initial inventory i plus the order amount .v. Let L{y) denote 
this expected cost function, and in particular, assume that L{y) is calculated from 
the formula 


(3) 


^y) = q)p{q) + L’t-(? -y)p{q) for j > o, 


where /t > 0 is the unit holding cost per item left over at the end of the period, 
and TT > 0 is the unit penalty cost per item short at the end of the period. Thu 
the first summation is the expected holding cost and the second summation is the 
expected penalty cost. We also make the reasonable assumptions that c -r h > 0 
and 7C > c, implying that A + tt > 0 , 

To illustrate, let 

(4) 

Verify that 


p{q) = y for ? = 0 , 1 , 


4 so that E[q\ = 2. 


(5) 


L{y) = 


(k.(y- 2 ) 


S (q-y) 

q^V+l 


for > 4 
for 0 ^3. 
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The values of L{y) for holding cost A = 5 and penally cost tt = 0, 5, 10, 20, 25 
are shown in Fig, 19.1 [To test your understanding, calculate (5) for A = 5 
and TT = 10; check your answers with the values in Fig. 19.1, and plot L{y) to 
see its convex shape.] Note that the value of_y that minimizes L{y) increases as 
the penalty cost n increases. 


Penolty Cost tt 
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Figure 19.1 


When there is a salvage value v (where 0 < y < c) for each item remaining at 
the end of the horizon, this value can be subtracted from the unit holding cost. 
Consequently, A in (3) really represents the net holding cost (A = A' — y), and can 
be negative. By the same token, if the item is being stocked for sale, and demand 
in excess of inventory on hand is lost, then n includes the sales price r (tt = 
tt' -1- r). 

It is easy to show that an (j, S) rule is the optimal form of policy for this special 
case. We now explain how to find optimal values for s and Let 


( 6 ) 


R = 


n — c 

Tt + h' 


and denote the value of R as the critical ratio. Observe that 0 < < 1 

because we assumed that it > c and ;r + A > 0. Then it can be shown that an 
optimal reorder level 5 ^ 0 is the smallest integer such that 

(7) P{S) = 2 Pi^) ^ ^ (determination of reorder level S), 

Q -0 

Since P{S) is simply the cumulative distribution function at ^ = 5, an optimal 
reorder level S is set such that there is at least a probability R of satisfying all 
demand. 
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Observe from (6) that 

(i) R increases as tc increases, so that the reorder level iS* is a nondecreasing 
function of the penalty cost tc. 

(ii) R decreases either as the holding cost h or purchase cost c increases, so 
that the reorder level .S' is a nonincreasing function of h and c, 

(iii) If TT > 2r + A, then R'>^ and .S' is at least as large as the median of the 
demand distribution. 

To illustrate, consider the example in (4) and (5). The graph of the cumula¬ 
tive distribution 

(8) =•^^■4^ forji = 0, 1,..., 4 

appears in Fig. 19.2. The graph is drawn as a staircase to assist in the exposition. 


1.0 

.8 



.4 - - 

. 2 < — 

Figure 19.2 Cumulative , , _ , , , * , , | _ 

Distribution P{y). -2 —10123456 y 


Suppose the holding cost ^ = 5, the penalty cost ;r = 10, and the purchase cost 
c = 0 so that jR = ^. Then according to (7) and (8), 5 is the smallest integer such 
that 


( 9 ) 



and therefore. 


(10) reorder level 6' = 3. 

An optimal S can always be read off immediately from a graph of the cumulative 
distribution function such as Fig. 19.2. You merely locate the value of the ratio 
R on the vertical axis, extend a horizontal line over until it intersects the cumula¬ 
tive function P(^), and the reorder level 5 is the associated value oiy, [If/? coincides 
with a horizontal segment of P(j^), then S is the smallest value of^ associated with 
that segment.] What is the optimal S if the holding cost A = 5, the penalty cost 
TC = 10, and the purchase cost c = 5? 

To summarize, an optimal reorder level S can be found by first calculating the 
critical ratio R in (6) and then using the cumulative function P{y), Note it is 
not necessary to calculate the expected holding and penalty cost function L{y) 
to obtain S, 
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Suppose the setup cost K = 0. Then, as we demonstrate later in this section, it 
is optimal to let s = S, This means* that if initial inventory i is any amount less 
than Sy you order the quantity x ^ S ^ i. And if initial inventory is larger than 
Sy you order nothing. 

Consider the more challenging case where the setup cost /C > 0. Now an 
optimal value for the reorder point s may be strictly less than S. The rcaison is 
that if the setup cost K is relatively large and initial inventory i (:< S) is sufficiently 
close to then the additional expense of the setup and purchase costs may not be 
offset by the reduction in expected holding and penalty costs. For simplicity of 
c.xposition wc will assume that an optimal j > 0 . 

The {Sy S) rule states that you do not order when initial inventory equals s. Hence, 
the expected holding and penalty cost of letting^ = s (that is, amount ordered 
X = 0), which is L{s)y must be no greater than the ordering cost plus expected 
holding and penalty cost of letting 7 = S (that is, amount ordered x = S — x), 
which is A" + ^(*9 — j) + -^(‘9). But since you do order when initial inventory is 
less than s, the preceding cost advantage must go the other way for^» = j —• 1 . 

We can summarize the reasoning so far by stating that you choose s to be the 
smallest number such that 

( 11 ) L{s) < A + c{S — j) + L{S) (determination of reorder point s). 

Thus, when A > 0, you calculate the reorder point s by computing and comparing 
the expected holding and penalty cost function L{y) with the sum [A -f c{S — j’) 
+ ^,(*9)], for successively smaller trial values of 7 , starting at ^ = 6 ', and then 
continuing until j j is the smallest trial value for which (11) holds. [In per¬ 
forming the calculations, you may find it easier to make the equivalent com¬ 
parison between L{y) + cy and A + + L{S), since then the latter value is a 

constant.] 

In the example ( 8 ) with h ^ 5, x = 10 and r = .1, the inequality (II) yields 

(12) Z{s) < A + .1(3 - j) + 8 = 8.3 + A - As, 

Suppose A = 4. Then j = 2, since 

A( 2 )= 9<8.3+4-.l(2) = 12.1 
L(l) = 13 >8.3-1-4-.1(3) = 12, 

where the values ofZ,(x) are found in Fig. 19.1. What is the optimal j if A = 12? 
When is an order placed in this case? 

You have just seen how to find an optimal (j, *9) policy in the important special 
case of linear holding and penalty costs. The remainder of this section treats the 
static model from a more general point of view. The discussion shows how to 
formulate and solve the problem for other cost functions and discusses sufficient 
conditions on the cost functions that imply an {s, S) policy is optimal. 

♦General description. We continue to use the definitions for i, at, q, 
and p{q) given above. Also define 
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= expected cost when is the inventory available to meet demand 
after ordering, given that initial inventory is z. 

In most actual applications of the static model, you start with initial inventory 
i = 0. But this is not always so. Indeed, we will even permit the value of i to be 
negative so as to indicate the presence of backlogged demand from pre¬ 
vious periods. (Thus i might be termed the net inventory position prior 
to the ordering decision.) 

Note that j instead of x has been used in the description of the expected cost 
function ^(j>|z). Since {x =y — z), employing either x or j; is valid, but using 
is mathematically more convenient. Also observe that^(^|z) represents an expected 
cost. Given specific values for^ and z, the actual cost during the horizon must be 
viewed as a random variable because the demand quantity q is described as 
random. 

Reminiscent of dynamic programming analysis, the optimization problem can 
be indicated as finding 

(14) f{i) = minimum ^(^lO, 

y 

so thaty‘(z) represents the minimum expected cost when initial inventory is i. 
In the discussion below, we will always restrict the minimization to values of^ 
at least as large as z, but there is no conceptual difficulty in permitting y to be 
smaller than z, thereby denoting a disposal decision. Given an associated optimal 
value j(z), you have that the optimal order quantity is 

(15) x{i) =y(z) - z. 

Typically, the function ^(j|z) is easy to compute and eventually increases with¬ 
out bound asy increases. Explain why. So even if^(^10 has no special mathematical 
properties to be exploited by the minimization in (14), the computational task of 
finding an optimal y(i) is straightforward—if worse comes to worst, you can 
enumerate j = z, z + 1,. . ., until you find the minimum g(yjz)- (In the com¬ 
putational process, you must take care not to be fooled by a /ocal minimum that is 
not also a global minimum.) 

But you would think that knowing j>(z) for one or more values of i should help 
you infer the values of j(z) for other values of z. Here is a test of your intuition. 

Assume initial inventory can be any of the values z = 0, 1,. .. , 8, and that 
maximum demand is 8. Then you can suppose that ^(z) < 8 for every z, if the 
cost function is reasonable. In solving the minimization (14) suppose you find 

Case L For z = l,y{i) = 8 so that x{i) = 8 — z = 1. Then what do you think 
are the values ofj;(z) and ;c(z) for z = 0, 3, 4? 

Case 2. For i = 3,^(z) = 4 so that x{i) = 4 — z = 1, Then what do you think 
are the values o£y{i) and x{i) for z = 0, 4, 7? 

Write your answers to Cases 1 and 2 on a piece of scratch paper and jot down a 
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sentence or two recording your justification. You will want to. refer to your answers 
after reading the example below. 

*Lump-sum penalty cost. This illustration is intended to show how a 
model that contains seemingly straightforward assumptions can have a surprising- 
looking optimal solution. Assume that there are only two possible values, 4 and 

p{q) =0 for ^ ^ 4, 8. 

for^ = i (a: = 0) 
for j > i {x — i > 0), 

so that a setup cost /T > 0 is incurred if an order is placed, and to this is added 
a unit purchase cost r > 0 times the amount ordered. 

Assume there is no holding or inventory cost, nor is there any salvage value for 
items left over at the end of the horizon* But there is a lump-sum penalty 
cost > 0 that occurs if demand exceeds y. Given these assumptions, the 
expected penalty cost function can be written as 

'TT forj — 0, 1, 2, 3 
(18) L{y) = np{q) = < ^ for = 4, 3, 6, 7 

B for = 8. 

Using (16), explain how (18) indicates that the actual loss is the same (tt), no 
matter by how much the actual demand q exceeds the available supply^, and then 
verify the numerical expressions on the right of (18). 

The following is a frivolous but indicative illustration of when a lump-sum 
penalty, such as that in (18), is applicable. Suppose you go shopping downtown 
and must park your automobile on a street having parking meters. Let q be the 
quantity of time you need to conduct your business and j be the amount of time 
you “purchase” by putting nickles in the parking meter. The street is well policed, 
so that if you spend more time on your errand than you paid for {q > j), you are 
certain to get a parking ticket costing tt. Since you cannot predict q perfectly, 
your decision problem in selecting ^ is a matter of balancing the cost of nickles 
in the meter against the expected cost of a parking ticket. 

(A commercial example of a lumpy penalty cost arises in the scheduling of what 
is known as the “air-shuttle” service between New York and Washington, D.G. 
A passenger does not need to have reservations for a flight on the air-shuttle; if 
he arrives at the airport on time, the airline guarantees him a seat. The airline’s 
decision problem is to select the size of aircraft to schedule for a given flight. If 
the scheduled aircraft fills up, the airline must make ready and fly a second plane, 
even if only one additional passenger is waiting to board!) 


8, for demand, having the probabilities 


(16) 


/>(4) = />(8) = and 


Let the ordering cost function be 
(17) c{y 


_|0 
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To summarize, the expected cost function g(j\i) is the sum of the ordering cost 
(17) and the expected penalty cost (18): 

( 19 ) =cu — 0 + Hy)- 

After a few moments reflection about (16), (17), and (18), you will conclude that 
for any i, the only policies that need be considered in the optimization process are 

(20) y = i, 4, or 8 and x = 0, 4 — i, or 8 — i. 

Why is this so? 

Now look at two numerical examples. First let the unit purchase cost r = 1, 
the setup cost K = 4.5, and the lump-sum penalty cost tt = 12. For initial inven¬ 
tory i = 7, the optimization in (14) indicates 


f(7) = minimum ^(^|7) = minimum [^(7|7), g(8l7)] 


( 21 ) 


= minimum 




4.5 + 1*1 + 


»] 


= 5.5 for ji(7) = 8. 

Therefore, when initial inventory i = 7, it is optimal to order x{7) = 8 — 7 = 1. 
Consider next i = 0: 

/(O) = minimum ^(ji|0) == minimum [^(0|0), ^(410), g(8|0)] 

( 22 ) 




= minimum 


[o + 12, 4.5 + 1.4 + 4.5 + 1.8 + o] 

for j-(O) = 0. 


(24) 


= 12 

Verify that for f = 3 and 4 

(23) ^(3) = 8 and y{^) = 4, 

and in summary, 

y{i) = i and x{i) = 0 for i = 0, 4, 5, 6, 8 
y(i) = 8 and x(i) = 8 — / for i = 1, 2, 3, 7. 

Translate (24) into a verbal description of the inventory policy. Do your answers 
in Case 1 above agree with (24) ? If not, explain why. 

For the second example, let the unit*purchase cost r = 1, the setup cost iC = 1, 
and the lump-sum penalty cost = 5. Check that 

j;(i) = i and x(i) = 0 for i^3^7 
(25) y{3) = 4 and x(3) = 1 for f = 3 

j(7) = 8 and x(7) = 1 for i = 7. 

Translate (25) into a verbal description of the inventory policy. Again, do your 
answers in Case 2 above agree with (25) ? If not, explain why. 

Although the policies in (24) and (25) are optimal, you may have difficulty in 
explaining to an “experienced manager” that he should not order when initial 
inventory i = 0, even though it is to his benefit to place an order (and incur a 
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setup cost) when initial inventory i = 3. We now specify sufficient conditions on 
the cost functions to ensure that an “casy-to-explain” {s, S) policy is optimal. 


'''Optimal (8, S) policy. First, we need to review the notion of a convex 
function from Chap. 9. A function L{y) defined for integer values of 7 is said to 
be convex if 


(26) L{y+l) ^ L{y) ^ L{y) --L{y - \) for all Convex; 

(You will find a number of convex functions illustrated on pp. 293-294.) 

Suppose that the expected cost function is the sum of an ordering cost and a 
term representing expected holding and penalty costs: 

(27) 5 (^|t) = c{y - t) + L{y). 

Let the ordering cost c{y — i) consist of a setup cost TC > 0 and a unit purchase 
cost c'^0: 

fO for y = i (x = 0) 

(28) c(y "“0=1 Orderin g cost. 

(/C + c*{y ^ i) [ory > i (x =y — i> 0 ) ====— 

Assume that cy + L{y) grows without bound as \y\ —> 00, and that the expected 
holding and penalty cost function L(y) is convex. 

Suppose that holding cost is expressed as an increasing function h{y — q) of 
inventory left over {y — q) provided actual demand q ^y\ in formal terms, 
assume 



for 7 > 0 
forj < 0, 


and h{j) is increasing for j > 0. Similarly, suppose the penalty cost is an increasing 
function 7t{q ^y) of unfilled demand {q — y) provided q> y\ again, in formal 
terms, assume 



for 7 > 0 
for 7 < 0, 


and 7t{j) is increasing for 7 > 0. Then the expected holding and penalty cost 
function is 


(29) 


L{y) 


f; h(y - q)p{q) + S it{q -y)p{,q) 

q=Q Q>y 

'ZAq -y)p{q) 

9S0 


for 77 > 0 
hry < 0 


(expected 
holding and 
penalty cost 
function). 


It can be proved that: if the sum of the actual holding and penalty cost functions h{j) 
7 c(—j) is convex for every integer then the expected holding and penally cost function 
L{y) in {29) is convex, [When holding cost is to be assessed on the level of inventory 
before demand occurs, the function h{y) replaces the first summation in (29).] 
The following result is easy to demonstrate. 
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Optimality of (j, S) Policy for Static Model, Given the cost functions in (27) 
and (28), and that L{y) is convex, then the form of an optimal policy is (j, 5), defined 
in (1). Further, the value for the reorder level S does not depend on the setup cost K 
in (28), and if iC = 0, then the reorder point s = S. 

Knowing the form of an optimal policy makes the computation of the rule 
much simpler, for you only need to find the values of the reorder point s and the 
reorder level S. You already saw how simple the computations can be when you 
considered the case of linear holding and penalty cost functions. (The above 
optimal policy theorem remains true under more general hypotheses about the 
expected cost function, but we do not explore these variations here.) 


► A proof of the optimal policy theorem as well as a general computational technique 
can be established by the following line of argument. Observe first that 


(i) 


(ii) 


( minimum g{y\i) 
y>i 


= minimum < 


( K — ci + minimum [ty -f I»(^)] 
y>i 

L{i), 


Since both cy and L{y) are convex, their sum is convex and by assumption increa.ses 
without bound as 1^| gets very large. Consequently, there exists an S such that 


(iii) minimum [cy + Z.(^)] = -f L{S^. 

y 

Further, a locally optimal value S is also globally optimal. 

Suppose i > .S'; then according to (iii), 

(iv) minimum [cy -f L(y)] ^ ci + ■^•(0 (* ^ S) 

y>i 

so that 

K — CI + minimum [cy + L(y)] ^ /.(i) (i ^ 5). 
y>i 

Therefore from (ii), you have for i'^S 

(v) = y{i) = i and x(0 = 0 

Now suppose i < S; then according to (ii) and (iii), 


(vi) 


m 


|iC — ci 4“ 4" 

= minimum < 

U(o 


for = S 
foTy = i 




Consider the case K = 0; (iii) implies that 

(vii) L(i) ^ -CI 4- 4- MS), 

so that the minimum in (vi) is given by 

(viii) /(i) = c(S — i) 4- Z(S) y(i) = S and x(i) = iS* — i for i ^ 5 and it = 0. 
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Next consider the case JT > 0; now L(i) may be the smaller-valued term in (vi) for i 
near iS*. Let s be the smallest number such that 

(ix) Z.(j) JC - « + -f L(^), 

or, equivalently, 

(x) L(s) -hes^K+cS-h L(S), 

Then (v) also holds for i ^ s. But for i < s, 

(xi) /(O = K+c(S- i) + L{S) y{i) = S and x{i) ^ S - i. 

In summary, the values for the reorder level S and the reorder point j in (12) ire found 
according to (iii) and (x), and 


(xii) 


/(»•) = 


\K + c{S - i) + L{S) 

Mi) 


for i < s 
for i > j. 


We gave simple formulas (6) and (7) for finding an optimal value of S in the case of 
the linear holding and penalty cost model. The result can be derived as follows. 

A consequence of (iii) is that S must satisfy 

(xiii) [c(S + 1) -f- L{S -f- 1)] > [cS + L(S)l 

or 

(xiv) L(S+l)-L(S)'^-c. 

The holding cost component on the left of (xiv) is 

2 (-y + 1 - q)p(<l) - 2 (•? - q)Pi9) = k-EPiq) = hP(S). 

(J«0 QaO J Q = 0 

Similarly, the penalty cost component on the left of (xiv) is 


(xvi) ttF 2 (q-S - \)p{q) - 2 (? - 5)^(V)1 = Jrr-2/>(?)l = -7r[l - i>(5)]. 

gnS+l J L V>s J 

Adding the rightmost terms in (xv) and (xvi), the inequality in (xiv) becomes 
(xvii) (A + n)P(S) - k ^-c, 

which simplifies to (7). 

Consider the linear holding and penalty cost case in which p(q) is a probability density 
function, the variable is continuous, and 




Jo — ?)/>(?) </? + J^ ’r-(? -y)P(q) dq 
Jo -y)p{q) dq 


for^ ^ 0 
for^ <* 0 . 


Then if dL{y\ldy = L/{y) exists, the^ = .S' that minimizes [(y + Z,(j>)] must satisfy 
(xviii) ’ c -f L'i^y) = 0. 

It can be shown by advanced calculus that, for^ ^ 0, 

L'[y) = h{y —y)p(y) + [ hp(q) dq — T({y —y)p{y) — f np{q) dq 
(«x) ^ * 

= (A + tt) dq - jr. 
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Therefore from (xviii) and (xix), S satisfies 
(XX) 

The value of s is found by solving (11) expressed as an equality. 

Throughout this section the holding cost formulas have been based on the value of 
inventory at the end of the horizon. If the holding cost is linear and assessed on the 
value of^, then 

_ x: - (< + A) 

7C 

If the holding cost is linear and assessed on the expected average value of inventory, 
namely [,by + .5(^ — ^)], then 

p _ TT — (g -f .5A) 

" “ ;r + .5A ◄ 

19.5 ECONOMIC ORDER QUANTITY MODELS 

The assumptions to be made here depart substantially from those employed 
in the models of the preceding section. Now consider the situation in which the 
horizon is unbounded so that an infinite number of replenishments will occur. 
Having complicated the analysis by introducing a dynamic factor, let us make an 
offsetting simplifying assumption that demand is perfectly forecastable at a 
stationary rate: 

M = number of items demanded per unit of time. 

The particulars of any actual application will dictate a convenient unit of time 
to use; but for the sake of definiteness in the discussion to follow, let the unit be 
a week. Then if M = 30, the demand assumption is that exactly 30 units are 
demanded in one week, 60 in two weeks, 15 in a half a week, etc. 

Wv; pause momentarily to mention that most textbooks treat the model below 
in chapters on deterministic inventory models, like Chap. 9. But we have waited 
until now to discuss this model in detail because in practice it almost always is 
modified to take account of uncertainties in demand. So you should look upon 
this section as establishing the groundwork for the modified versions that are 
presented in the next section. 

In the lot-size models below, both time itself and the inventory level are treated 
as continuous variables. Note that this approach is in sharp contrast to most of 
the dynamic models you have studied in previous chapters. The impact of the 
continuity postulates is easily conveyed by a diagram. Suppose again that M = 30 
items a week and the replenishment rule is to order 45 whenever the inventory 
reaches the 0 level. Assume for this illustration that there is no lead lime, so that 
the items arrive immediately after an order is placed. Then the pattern showing 
inventory on hand at any moment in time appears in Fig. 19.3. Notice three 
features of this sawtooth pattern: 

(i) The inventory repeatedly drops from the level 45 continuously to the 
level 0 as time moves forward continuously 
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5=0=45 s = 0 M = 30 


45 


45 


45 


45 


45 



N 


week week ' week week week week 


Note: M demand rote 


5 reorder point 
5 reorder level 


0=S-s reorder quantity 


Figure 19,3. Sawtooth Pattern of Inventory. 


(ii) The downward slope of the sawtooth is —30, representing the rate of 
withdrawal due to demand M. 

(iii) A replenishment occurs every ^4 = 1*5 weeks. 

Clearly, you can rarely be certain that demand behaves in so precise a manner. 
Both the stationarity and perfect predictability of demand are severe postulates. 
Nevertheless, many industrial firms have been able to employ these models and 
have thereby realized substantial inventory cost savings. To do so, however, the 
models below are usually modified so that demand is treated probabilistically. 
The generalization required to handle uncertain demand is explained in the 
next section. You will find it helpful in preparing for that section to first study how 
an optimal inventory policy can be obtained when demand is perfectly predictable. 

Let the ordering cost for items consist of a setup cost K'^0 plus a unit purchase 
cost ^ ^ 0 times the amount ordered x: 


for X = 0 
for a; > 0 , 



( 1 ) 


and let the holding cost be linear, where 

(2) h = cost for each item per unit of time (a week) 


and h'^0. Observe that the costs in (1) and (2) are stationary (unchanging over 
time). Also note that (2) implies it is equally costly to hold five items in inventory 
for one week, or one item for five weeks, or 2.5 items for two weeks, or two items 
for 2.5 weeks, etc. Ordinarily, A is a function of the purchase cost c, such as 
_ h'-c)y but to keep the notation simple, only the symbol h will be used. As 
you study the formulas, remember that a change in c will typically imply, a cor¬ 
responding change in the value of A. Without much difficulty, you can analyze 
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models with more complicated cost assumptions—for example, the ordering cost 
function can exhibit quantity discounts. But such cases will not be treated in this 
chapter. (See exercise 52.) 

Assuming repeated replenishments are to occur, suppose next that all demand 
must be met. (The formulas to follow actually indicate when it is preferable not 
to stock an item.) You can allow a backlog to build up, however, before placing 
a replenishment order; in such a case, the entering inventory would be immediately 
allocated to the backlogged demand, and the remainder then used for future 
demand. A backlog condition is represented by a negative value for i. For many 
reasonable stationary functions representing the penalty cost of a backlog, an 
optimal policy will be of the (j, S) form; 

(3) when inventory level i = j, order Q = 5 — j. 

The form in (3) has been written more succinctly than the description of an (j, S) 
policy in the preceding section. The reason is that if, at any instant, the inven¬ 
tory level equals the reorder level 5, then Q/M weeks later, it will exactly equal 
the reorder point j, because of the continuity and demand rate assumptions. 

After a few moments thought, you can see that a strictly positive value of j is not 
optimal, for then there would always be at least j > 0 units of inventory on hand 
that incur a holding cost and serve no function in meeting demand. By the same 
token, given that repeated replenishments are to occur, you can conclude that, for 
a reasonable penalty cost function, a strictly negative value of iS* is not optimal. 
Why? Consequently, the search for optimal values of s and S can usually be 
restricted to j < 0 and 5 > 0. 

Assuming that lead time is 0, so that a replenishment occurs as soon as the 
reorder decision is made, the resultant sawtooth cyclic pattern forageneral 
(j, S) policy is shown in Fig. 19.4. (Actually, the essential lead-time assumption 



underlying the pattern in Fig. 19.4 is that lead time is a known constant value. 
If you know that lead time is Z., and you want the order to arrive at the moment 
when inventory equals j, then you would place the order L weeks earlier. So if 
LM < Q, the reorder point is simply LM plus the value of j as determined for 
L = 0.) Note that when the reorder point j < 0, demand continues at the rate M 
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per unit of lime, so that the slope of the inventory function in Fig. 19.4 remains 
—M throughout each cycle. 

Finally, assume that the optimality criterion is minimum cost per unit of time. 
Provided the item is to be stocked, the purchasing and holding cost contribution 
to the average cost per unit of time is 


( 4 ) 


A,. 


Each term in (4) can be derived as follows: 


(i) Since M/Q^is the average number of setups per unit of time, KMjQ^is the 
average setup cost per unit of time. 

(ii) Since all demand must be met, cM is the average purchase cost per unit 
of time. 

(iii) Given the sawtooth pattern in Fig. 19.4, inventory is positive for a fraction 
of time equal to SjQ^, The average level of inventory, during the interval 
when inventory is positive, is SI2, Consequently, (iS/Q,) • (•S'/2) represents 
the average level of inventory per unit of time, and hS^I2Q^ is the corre¬ 
sponding holding cost. 

If the reorder point j < 0 so that a backlog builds up before a replenishment 
occurs, then an average penalty cost should be added to (4) to give the overall 
average cost per unit of lime. Since in this model any backlogged demand is 
filled by the next replenishment, no sales are lost and the penalty cost will not 
include a factor representing lost profit. But a backlog condition frequently does 
require extra paper work to maintain the records of which customers are awaiting 
delivery. Further, there may be a loss in good will if a customer has to wait at all 
for delivery. In these situations you would expect that the penalty cost contains 
a factor proportional to the size of the backlog right before a replenishment, 
namely, the amount (—j). In contrast, if the item is being used by personnel 
within the company, then frequently the penalty cost includes a component that 
is not only proportional to the size of the backlog but also to the length of time 
the backlog condition is present. The implications of each of these two types of 
penalty costs are explored below. In order not to obscure their effect on the selec¬ 
tion of an optimal policy and also to keep the mathematical manipulations simple, 
the two types of penalty costs will be handled separately. They can, of course, 
be combined into a single model, but we will not do so in this text. 


Size-of-backlog penalty cost. Suppose here that each time a backlog 
occurs the associated cost is tc times the maximum amount short (—j), where 
TT > 0 and J < 0. Then the cost 7t{—s) is incurred once during each cycle, and 
so the average shortage cost per unit of time is 

( 5 ) 

Since Q, = 5 — J, you can substitute (Q, — S) for {—s) in (5); verify that doing so 
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and adding the result to (4) yields 

( 6 ) average cost per unit of time = AG = + cM + + nM — 

The discussion below is considerably simplified if you assume further that the 
backlog cost factor n > ^/2KhfM\ this postulate is not very restrictive, since a 
backlog cost factor is usually large. 

Given that the item is to be stocked, it is shown in the special material below that 
the optimal value for S is Q, and therefore ^ = 0. But it may not be optimal to 
keep the item in inventory at all. 

To begin, suppose S = Q^is optimal. Then ( 6 ) simplifies to 

(7) AC = ^+cM + !^- 

The optimal Qis found by differentiating AC with respect to Q., setting the deriva¬ 
tive equal to 0 , and solving for Q, giving 

( 8 ) optimal Q minimal AC = cM + A/2KhM. 

The formula for the optimal Q, is frequently called the economic order 
quantity, or EOQ. (Sometimes you will find it referred to as the lot-size 
formula, or the Wilson lot size.) 

If the item is not stocked, then there is a cost associated with not meeting the 
demand of M per unit of time. For simplicity, suppose that cost is represented by 
tcM, In this instance, you would ordinarily include in ^ a factor indicating lost 
profit, so that k need not equal tt. The economic test is 

(9) if kM < cM + ^2KhM, do not stock, 
and otherwise order according to ( 8 ). 

^The following abbreviated argument underlies a proof that the optimal S is either Q, 
or 0, given that tc > a/tIoIJM. Define f such that 

(0 =f(L 

and, consequently, 0 ^ 1, since j ^ 0 and .S' ^ 0. Further let C(/) = cM\if> 0, 

and 0 if/= 0. Then substituting (i) into (6), the optimization problem can be stated 
as minimizing 

(ii) AG(a./) + 

A necessary condition for the optimal Q, is that it satisfy 

(iii) ^ 0 for/>0. 
yielding the condition 

(iv) opUmal Q, = 

Substituting (iv) into (ii) gives AC[(i(/),/], which is a linear function of /. 
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Calculating dAG /> 0 results in 

(v) ^ = ^/WhM - kM for/> 0, 

SO that dACIdf < 0 since tt > Js/lKhjM by assumption. Then /= 1 is optimal provided 
AG [0,(1), 1] is no greater than the cost of not stocking, which is the comparison in (9). 

We also mention that this model is closely related to the case in which demand is lost 
when there is no inventory on hand. Assuming that each unit of demand lost reduces 
proBt by tt, the optimal policy again is to either let .S = Q, (no demand is lost), or 5" = 0 
(all demand is lost). ^ 

Sensitivity analysis of EOQ,, Notice that the EOQ, increases less 
than proportionately with increases in the setup cost K and the demand rate M, 
and decreases in the holding cost L For example, if the rate M quadruples, then 
the EOQ,only doubles. Analogously, if the value of h quadruples, then the EOCi 
only decreases by a half. Observe that the time interval between orders is 

( 10 ) 

Thus as the demand rate Af gets larger, not only does the optimal (2,get larger but 
the optimal time interval between orders T gets smaller so that ordering occurs 
more frequently. 

In many situations the vendor requires that the amount ordered be a “con¬ 
venient” number. To illustrate, if the optimal = 53, you may have to place 
an order for either 50 or 60. You should note that there is very little impact on 
average cost when only a w^eir-optimal Q, is -employed. Let 

(11) actual (2, = r • optimal Q^ = r 
where r > 0, and let the variable cost be written as 

„ox X km . /i.actual d KM . h-rV2KMih 

(12) VG(r) =■• ^ -= r^2KMIh +-2-’ 

so that the Jixed purchase cost component cM is omitted. Verify from (12) that 

As you can see by calculating (13) for trial values of r, the increase in variable cost 
from a near-optimal Q, is slight. For example 

(14) ^^<1.1 for .64 <r:^ 1.56. 

[Also note in (13) that the ratio has the same value for a given r and its reciprocal 

r' = l/r.] 

►The analogous model in which both time and the items are in discrete units was 
previously analyzed by means of a regeneration approach, and is contained in the 
advanced material of Sec. 11.4. For ease of comparison, the model is briefly summarized 
here, employing the notation used in this chapter. 
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For the sake of definiteness, let M (an integer) be the amount of demand per week 
and 

Ct = total cost associated with ordering MT units of the item, 
where T = 1,2,..,, Therefore Ct represents the entire cost over T weeks, and 

(i) Ct = K+ cMT + 

The objective is 

(ii) minimize j = minimise -f cM -f -LIJ. 

Ignore the integer-value constraint on T momentarily, differentiate the expression 
on the right of (ii), and set the result equal to zero, obtaining 



which is the same value as (10). Accordingly 

(iv) ft* = MT* = 7^- 

Since T* is usually not an integer, calculate the expression on the right of (ii) for the 
next smallest and largest integers and select the better value. 

To check your understanding of the derivation of the average cost formula in (6), 
find Ct for the continuous model and calculate Cr/T. You ^ould get an expression 
equivalent to (6) after MT has been substituted for Q,- ^ 


Average-backlog penalty cost. Now suppose the penalty cost is 
assessed against the average level of backlog per unit of time, analogous to the 
inventory holding charges. Looking again at Fig. 19.4, you will see that a backlog 
condition exists for a fraction of time —where the reorder point ^ ^ 0. The 
average level of backlog, during the interval when inventory is negative, is —j/ 2 . 
Consequently, the average level of backlog per unit of time is which must 

be multiplied by the corresponding penalty cost. It is convenient for the purpose 
of comparison to again let the penalty cost be denoted by the symbol tc '> 0, 
But you should realize that the cost measurement units are different in the 
previous case than they are here. Specifically, now tc represents the cost per item 
per unit of time the backlog is present, whereas previously tc was simply the cost 
per item backlogged. Using the identity (5 — j* = (2.) to eliminate j, you obtain 
after simplification. 


(15) average cost per ^ ^ ^ ^ {k +jt)S^ + 

' * unit of time Q, 2(i 2 


(Here it is not necessary to explicitly assume that tc is large relative to the other 
costs.) 

Taking the partial derivative of AC in (15) with respect to iS, setting the result 
equal to 0 , and simplifying yields 


( 16 ) 
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Doing the same with respect to (^and using (16) to eliminate S gives 


optimal Q, = ^2KM 

optimal S = y 

'Jh + n 

opUmal,- 

minimal AC = cM + - 

^IKM 

/ 1 1 




What is the optimal time interval between successive orders? As in the previous 
case, the decision of whether to stock at all depends on a comparison of the minimal 
AC in (17) with the cost of not meeting demand. 

As you were already cautioned, the measurement unit for the penalty cost in 
(17) is not the same as in (8), since here tt represents the cost per item per unit 
of time a backlog is present, whereas previously tt represented the cost per item 
backlogged. Nevertheless, it is instructive to compare the solutions in (17) and (8) 
assuming that tt has the same numerical value in both sets of formulas. Specifically 
notice in (17) that for a finite value of the penalty cost tt, the optimal order 
quantity Q, is larger, the optimal reorder level S is smaller, the optimal reorder 
point s is smaller (actually negative), and the minimal average cost is smaller. 
Only when tt is infinite do the values in (17) equal the corresponding solution in 
(8). Also, only when the holding cost factor h is infinite is the optimal S = 0; 
but even then, Q,and s are finite. 

Study (17) to see how each optimal quantity varies with changes in the values 
ofiC, hy Tty and M. In particular, note that the optimal order quantity decreases 
as the penalty cost tt increases, and the optimal reorder point s decreases as either 
the setup cost K or the demand rate M increases. Observe from (16) that if 
/i = TT, then there is a backlog condition during the second half of the interval 
between successive orders. 

^Production lot sizes. So far the discussion of economic order quantity 
models has been in the context of a company replenishing its inventory by order¬ 
ing an item from a vendor. Accordingly, the inventory policies indicated when 
and how much to reorder. Let us now turn to a related problem of a company that 
manufactures certain of its items in lot-size quantities. Many of the economic 
considerations encompassed in the preceding models are just as relevant in this 
situation: a setup cost, the direct manufacturing costs, and holding and shortage 
costs. Assuming the stationarity and demand assumptions remain reasonable, 
can the {Sy S) policies derived in (8) and (17) be used here as optimal lot sizes and 
production trigger points? Sometimes, but not usually. 

The significant difference between ordering an item from a vendor and produc¬ 
ing it in your own plant is that a production activity requires the use of other 
limited resources, namely, men, equipment, and component materials. Most 
manufacturing plants carefully schedule ahead, taking into account available 
manpower and equipment capacity limitations, and ensuring that sufficient 
materials are on hand. Consequently, if an economic lot-size approach is used for 
all the manufactured items and is based on the above models—or those in the 
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next section—the result may not yield a feasible overall schedule for a production 
plant. In short, a single-item EOQ, model applied to a multitude of internally 
manufactured items is typically a poor approximation to reality, for it ignores the 
operating constraints on scarce resources. 

What is required in a production setting is a dynamic planning model that 
combines the cost elements in EOQ models with the constrained multi-item con¬ 
siderations. The component time periods in the model must be short enough so 
that the plan can be implemented by production decisions. Several breakthroughs 
have occurred in operations research models capable of handling these situations, 
but the complexity of the analyses goes beyond the scope of this text. 

♦Raw materials inventories. A related area of inventory decisions is 
the stockage of raw materials to support a manufacturing firm’s production 
activities. Here too the economic considerations encompassed in the EOQ, models 
are relevant to storing raw .materials. But analogous to the production lot-size 
situation, the manufacturing activity itself, through a production schedule, also 
exerts an influence on the inventory policy. Only in the case of continuous and 
nearly level production would you expect an EOQ, approach to be useful. A more 
widely applicable operations research approach for the stockage of raw materials 
is a dynamic programming model like one of those in Chap, 9. Such a model 
employs the production schedule requirements as “demand” data, and while 
minimizing costs, at the same time ensures that the required raw materials are on 
hand when needed. 

19.6 STOCHASTIC DYNAMIC CONTINUOUS 
REVIEW MODEL 

This section extends the preceding economic lot-size model analysis to include 
probabilistically described demand. Continue to assume that time is a continuous 
variable; the ordering and holding costs are stationary 

fO for X = 0 

(1) C{x)=\ (ordering cost) 

lA + cx for X > 0 

(2) h = holding cost for each item per unit of time, 

where -/C > 0, c > 0, and A > 0. Suppose that any demand occurring when the 
inventory level is 0 is backlogged and eventually filled. Assume here that the 
penalty cost tt > 0 is stationary and proportional to the size of a backlog just 
before a replenishment order arrives. Now the objective is to minimize expected 
average cost per unit of time. Let 

M = expected number of items demanded per unit of time 
L = lead time, the length of the interval between placing and 

(3) receiving an order 

Ml = expected number of items demanded during an interval 
of L units vof time. 
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where L is fixed and known. Since “the expected value of a sum is always equal 
to the sum of the expected values,” 


(4) Mr. = ML, 

If the lead time L = 0—^an order arrives immediately after it is placed— then 
the analysis of the average cost formula (6) in the preceding section needs to be 
modified only slightly. Provided the items should be stocked, and the optimal 
reorder point j = 0, the optimal order quantity and minimal cost formulas in 
(8) above apply, where the cost expression now represents an expected value. 
The reason is that when Z, = 0, there is never an opportunity for a backlog con¬ 
dition to arise. 

If the lead time L > 0, however, the situation is quite different. Let 

qr = actual demand during an interval between placing and receiving an order. 


Here the random variable may exceed j, which is the level of inventory at the 
start of the interval, so that a backlog condition occurs. How then should and 
s be chosen optimally ? 

A naive approach is to calculate according to the economic order quantity 
formula (8) above, and to pick s using the static model analysis in Sec. 19.4, in 
which the demand probability distribution relates to demand duriiiig lead time 
and the expected holding and penalty cost function L{y) is given by (3) in that 
section. Then 5 = j + Q,. 

For example, suppose 


purchase cost: r = 1 setup cost: K = 31.25 

penalty cost: tc = 20 demand rate: M = 2 
Verify that 


holding cost: A = 5 
lead time: L = 1. 


( 6 ) 




Suppose further that the probability distribution of demand during a lead 
time is 


(7) fory = 0, E[q,-] = M, = 2. 

The distribution in (7) is the same as that in (4) of Sec. 19.4, and as the entries 
in Fig. 19.1 show for penalty cost factor tc = 20, 

(8) reorder point s = 3 so that reorder level S = s + = S, 

Therefore this naive analysis leads to the policy (j, S) = (3, 8). 

The logical error in the above approach is that the analysis ignores the cost 
interaction between the reorder point s and the order quantity Q^. Specifically, 
if you keep s fixed and let (iget larger, then the actual penalty cost component of 
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average cost will decrease. The reason is there are fewer replenishments and 
corresponding exposures to stockouts per unit of time. Consequently, you can 
reduce average cost by letting the order quantity Q,be larger than indicated by the 
economic lot-size formula. By the same token, you may obtain further cost savings 
from an even lower value for the reorder point j. 

Having recognized that optimal values for s and Q, are interdependent, you 
are ready to explore a way to calculate them. As usual, the first step is to develop 
the appropriate expression for the criterion function, and afterwards to optimize 
the value of the expression with respect to s and Before taking the first step, we 
will impose a few additional assumptions. Of course, all assumptions, in one way 
or another, represent approximations to a real situation. But the ones to follow 
are imposed primarily to keep the exposition simple and the mathematical 
manipulations tractable. Nevertheless, the model below (and minor variants of 
it) have proved in practice to be very effective in establishing workable inventory 
policies. 

Assume for the mathematical development of the criterion function that 

(i) The probability distribution of demand during a lead time does 

not depend on when the inventory reaches the reorder point s, 

(ii) The inventory level i can be treated as a continuous variable. 

(iii) After a replenishment order arrives, there exists a future moment in time 
when the inventory level i = j, and a reorder action occurs as a con¬ 
sequence. 

(iv) In an optimal policy, the reorder point s> 0 and during any lead time, 
actual demand does not exceed the order quantity (qL^ Q,)- 

The meaning of these assumptions is clear, but you snould observe where each is 
used in developing the criterion function. The postulate that the optimal Q,> 
desei*ves further comment. Note that just before a replenishment order arrives 
the inventory level is s — and just after it is j -f- Q,, which is at least s 
given postulate (iv). As a result, there is never more than one replenishment order 
outstanding at any instant. Explain why. 

► Observe that the discussion stressed that the above assumptions are made for the 
purpose of writing a mathematical expression for the average cost function. Here is why 
they were not introduced as assumptions about reality. 

After giving a precise mathematical description of (i) and (iii), you can prove they 
logically imply that for any interval of lime T, the probability distribution for demand 
during T must be a Poisson distribution with a single parameter AIT that equals both 
the mean and variance. This is much too restrictive a conclusion about reality. Rarely 
in practice will you find that the mean and variance of a demand distribution arc equal 
—most often, the variance is several times greater than the mean. Assuming (i) and 
(iii) only for mathematical convenience permits you to use an arbitrary distribution 
form for prXqi) in all the computations. In summary, the model is often an excellent 
approximation despite its containing a few internal inconsistencies. ^ 


814 Probabilistic Inventory Models 


. 19.6 • 


Expected cost. The purchase cost component of the objective function 
is just like the lot-size models: 


(9) 


KM , 


(average ordering cost). 


We next obtain the expected holding and penalty cost. You can, if you want, 
skip this derivation and go directly to the end result in (16) below. 

Consider the time interval between two successive reorder actions; two examples 
of what can happen are shown in Fig. 19.5, one for actual demand during lead 
time less than the reorder point, < j, and one for qj^ > j. In the case qj, > J, 
taking into account that the inventory level is 0 before the replenishment arrives 
makes the holding cost formulas complicated. Therefore, as a mathematical ap¬ 
proximation, assume that when qi, > j, inventory becomes 0 just before the re¬ 
plenishment arrives. Then, as you can verify. 


( 10 ) 


=±i[s+{s- q,)]p^{q,) + S 0]p,{q,) 

during lead time 


Cose < s 



Case > s 



Note: demand during leod time L 

0 reorder quantity 
s reorder point 


Figure 19.5. Sawtooth 
Patterns for Probabilistic 
Demands. 
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( 11 ) 


expected average inventory 
level after replenishment 
until next reorder 


= y[(j — Ml, + Q,) + == ■“ M^ + Q,)- 


The expression in (10) must be weighted by MijQ^y the fraction of time the system 
is wailing for a replenishment, and correspondingly, the expression in (11) must 
be weighted by (1 — MJQ,). You can verify that after applying these weights, 
adding the results, and rearranging terms, you get 


( 12 ) 


- §[-2. + M, - a+, +1. («- uMu)] 

unit of time 


2 Q. 

+ ±(2r-M,+ (i). 


As you may want to check for yourself, 


(13) L (^ - = -f - Mi - 2 (j - 

</io0 Qt>* 

so that the expression in (12) can be written as 


(14) - Mi + s + ^ - ^)pL{qL)- 

Finally, 

( 15 ) 

which must be weighted by MjQ^to obtain the expected shortage per unit of time, 
just as in the economic lot-size model of the previous section. 

Combining (9), (14) multiplied by the holding cost A, and (15) multiplied by 
M/Q, times the penalty cost 7t gives 


(16) 


expected average 
cost per unit 
of time 


= E[AC] 


i^ + cM + h{^-M^ + s) 


To see the relation between (16) and the formulas derived in the preceding sec¬ 
tion, observe that if demand is deterministic, then the optimal reorder point 
s = Ml and the summation on the right of (16) would be 0, since each of the 
probabilities would be 0; the resulting expression agrees with the average cost 
formula (7) in Sec. 19.5. 

Partially differentiating E[AC] with respect to Q., setting the derivative equal 
to 0, and solving for gives the formula 

(17) optimal d = "" 

(determination of order quantity). 


Define the cumulative demand distribution as 
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Then it can be shown that the optimal j > 0 is the smallest integer such that 
(19) ^ R (determination of reorder point), 


where here the critical ratio is 
(20) R = 1 


AQ. 


4 - Mn 


More precisely, the order quantity Q^and reorder point s are optimal values if they 
simultaneously satisfy (17) and (19). Substituting (17) into (16) yields 

(21) -ECAC] =cM + ^2I^KM + [qj. — 

+ h{s - M,), 

assuming that s is found according to (19). What terms in (21) indicate the impact 
of uncertainty? [To decide whether the item is to be inventoried at all, the value 
in (21) must be compared to the cost of not stocking,] 

Notice the second term under the radical sign in (17) causes the optimal value 
for Q,to be larger than the economic lot size for the case of deterministic demand. 
Also the smaller the value of j, the larger the value of Q,. By the same token, the 
larger the value of Q,in (20), the smaller the value of R and the smaller the value 
of s satisfying (19). The value of Pl{s) is sometimes referred to as the service 
level and the amount s — E[qi\ as the buffer or safety stock. 

Unless you make additional assumptions about the form of the probability dis¬ 
tribution you are not able to state much about the sensitivity of the optimal 

policy to the economic parameters, the mean demand M, and the lead time L, 
To see why, consider an increase in the penalty cost tt. On the one hand, it tends 
to directly increase the second term under the radical in (17); on the other, it 
tends to indirectly decrease that term due to its direct effect on and j in (19) 
and (20). Several illustrative examples showing policy sensitivity will be given in 
the discussion of a Normal distribution approximation below. 


► The formulas developed in (17) and (19) have a wider applicability than simply the 
model speciBed by the various assumptions so far. To illustrate, suppose lead time L 
is random but the assumption that demand during lead time ^ Q. still holds. Then 
Pl{Ql) Is defined accordingly to be the probability distribution of demand during a lead ' 
time. 

As another illustration, suppose the value of s is determined by (19) but is a proba¬ 
bility prespecified by the decision-maker; that is, R represents the minimum allowable 
probability of filling all demand during a lead time. (The value 1 — is the maximum 
allowable probability of having any shortages.) Postulating that only a policy of the 
{Sy S) form is to be considered, and that expected average purchase and holding cost 
per unit of time is to be minimized, then the optimal Q,is given by (17) with tc = 0, 
Recall from Chap. 16, however, that given the chance-constraint P(s) > /?, a randomized 
strategy might yield an even lower expected purchase and holding cost. 

Finally, with only minor modifications, the approach can be applied to the situation 
where unfilled demand is not backlogged but is lost. (See exercise 56.) A 
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Algorithmic solution. The following simple procedure can be used to 
yield values for Q,and s that simultaneously solve (17) and (19): 

Step L Let the initial trial value of Q, be ^/2KMpl, 

Step 2, Compute R in (20) using the current trial value for Q,, and find a corre¬ 
sponding trial value for j in (19). 

Step 3, Stop if the new trial s is the same as the previous trial value. Otherwise 
calculate a new trial value for according to (17), and return to Step 2, 


Notice, as the iterations progress, the trial values of s get smaller and of Q, get 
larger. The method will always converge in a finite number of iterations provided 
the optimal s is positive. A sufficient condition ensuring convergence is that 


( 22 ) 




+ Mtc 


where is the expression on the right of (17) evaluated for j = 0 (so that the 
summation term is simply M^), Once again, because no assumptions have been 
made about the form of piigi) other than that it is a discrete distribution, the 
terminating value for s may not be globally optimal; at worst, a neighboring value 
of j and the associated optimal have a lower expected cost. 

Consider the following example, which contains the same data as (5) and (7): 


c= 1 


X = 31.25 A = 5 7C = 2Q 


(23) 


/'i(?i)=y for? = 0,1,..., 4 £[^J = M^ = Af = 2. 


According to Step 7, 

(24) initial trial Q, = 5. 

Then for Step 2, 

(25) /?=i__^=.44 and /»(2) ^ .44. 


Using the trial value .r = 2, the computation in Step 3 is 

(26) trial d = V25 + (18) (.6) = 5.98. 

Repeating Steps 2 and 3 gives 

(27) 72 = .34 and P(l) ^ .34 trial Q,= V25 + (18)(1.2) = 6.83. 
The final iteration is 


R = ,24 and P{1) ^ .24 optimal ^ = 1 
^ ^ optimal d = 6.83 optimal 5 = j + Q, = 7.83 

and as a consequence, 

(29) minimal E[AC] = 2 + V10[62.5 + (45)(1,2)] + 5(1 - 2) = 31.13. 
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Observe how the optimal solution differs from the policy in (6) and (8) obtained 
by the naive analysis. That approach led to a policy (j, S) = (3, 8); the associated 
expected cost according to (16) is 33.9, which is 9% larger than the optimal value 
in (29). 

To test your understanding of the algorithm, apply it to the case in (23) with 
7 C = 30. (You should find that j = 2 is optimal.) 


’"Normal approximation. When applying the model in this section to 
an inventory system containing a multitude of items, you typically would assume 
as an approximation that the distribution is of a particular type, such as 

Poisson, negative binomial, uniform, etc. Then you would only need to specify 
the parameters of the selected distribution. A very convenient approach is to use 
a Normal distribution (which is continuous), letting the mean and variance be 

(30) E[q^] = M, Var [q,] = 'E 

From a practical point of view, the approximation method turns out to be very 
effective, even though the theoretical properties of the Normal distribution (such 
as symmetry, and the possibility of arbitrarily small and large values of the 
random variable) may depart considerably from the distribution Pl{^l)- OT 
course, you should not apply this or any other approximation unless you investigate 
how well it holds for the range of parameters in which you are interested. 

Define 


(31) 
so that 




s-Ml . 


(32) 

Accordingly, as before, let 

(33) 


s = Ml, + a,v^ 


R=l 


hd 

f^ + Mn 


Denote the cumulative standardized Normal distribution as 


(34) 


rU 






dt, 


and the so-called standardized Normal loss integral as 

(35) I„{u) = dt. 


The loss integral is used for the approximation 

(36) 2 (?£ ~ ^ V^’/jv(ai). 

The functions P,,(b) and It,(u) are tabled in Fig. 19.6. 
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u 

0 
.1 
.2 
.3 
.4 
.5 
.6 
.7 
.8 
.9 
1.0 
I-l 
1.2 

1.3 

1.4 

1.5 

1.6 

1.7 

1.8 
1.9 
2.0 
2.1 
2.2 

2.3 

2.4 

2.5 

2.6 

2.7 

Figure 19.6. Tables of 2.8 

Cumulative and Loss Inte- ^ 9 

gral for a Standardized 
Normal Distribution. ^ ^ 




P^fru) 


.5000 

.3989 

.5000 

.3989 

.5399 

.3509 

.4601 

.4509 

.5793 

.3068 

.4207 

.5068 

.6180 

.2667 

.3820 

.5667 

.6555 

.2304 

.3445 

.6304 

.6915 

.1977 

.3085 

.6977 

.7258 

.1686 

.2742 

.7686 

.7581 

.1428 

.2419 

.8428 

.7882 

.1202 

.2118 

.9202 

.8160 

.1004 

.1840 

1.0004 

.8414 

.0833 

.1586 

1.0833 

.8644 

.0686 

.1356 

1.1686 

.8850 

.0561 

.1150 

1.2561 

.9032 

.0455 

.0968 

1.3455 

.9193 

.0366 

.0807 

1.4366 

.9332 

.0293 

.0668 

1.5293 

.9453 

.0232 

.0547 

1.6232 

.9555 

.0182 

.0445 

1.7182 

.9641 

.0142 

.0359 

1.8142 

.9713 

.0110 

.0287 

I.9II0 

.9773 

.0084 

.0227 

2.0084 

.9822 

.0064 

.0178 

2.1064 

.9861 

.0048 

.0139 

2.2048 

.9893 

.0036 

.0107 

2.3036 

.9919 

.0027 

.0081 

2.4027 

.9938 

.0020 

.0062 

2.5020 

.9954 

.0014 

.0046 

2.6014 

.9966 

.0010 

.0034 

2.7010 

.9975 

.0007 

.0025 

2.8007 

.9982 

.0005 

.0018 

2.9005 

.9987 

.0003 

.0013 

•3.0003 


Then, instead of (19), the value t/, is determined by 
(37) P„{u,) = R, 


and, instead of (17), the value of found by 


(38) optimal ft = + t) 

Write down the expression corresponding to (21) for the minimal jB[AC], 
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The algorithm is 

Step 1, Let the initial trial value of Q, be ^/2KMjh, 

Step 2. Compute R in (33) using the current trial value for Q,, and find a corre¬ 
sponding trial value for ti, from (37) and s from (32). 

Step 3, Stop if the new trial s is approximately the same as the previous trial 
value. Otherwise calculate a new trial value for according to (38) and return 
to Step 2. 

Notice in Step 3 you must specify a numerical tolerance defining when two trial 
values for s are approximately the same. 

To illustrate the method, consider the example 

/i: = 32 h=\ 7r = 9 

(39) 

^ ' L = 1 M = = 16 Px = 48. 

From Step i, 

(40) initial Q, = 32, 
and for Step 2, 

(41) R =z(l _= 789 

^ i^+144; 152 ’ 

so that from Fig. 19.6 you have 

(42) trial = .8 and trial j = 16 + (.8)V^ = 21.6. 

Then in Step 3, 

(43) trial Q. = V(32)(32) + (16 + 144)V48(.12) = 35.71. 

Returning to Step 2, 

(«) 

so that 

(45) trial ii, = ,72 and trial s = 16 + (.72)V48 = 21.01. 

Then in Step 3 

(46) trial Q, = V(32)(32) + (16 + 144)V48(.14) = 36.24. 

Further iterations will affect only the decimal parts of s and (2,in (45) and (46). 
The associated expected cost is 

(47) E[AC] = d6 + 41.3. 

Check your understanding of the method by finding an optimal policy for 
il/ = 16, L = 5, Ml, = 80, Vi = 240, and the other parameter values as given 
in (39). (The optimal s is about 91.) 

Several optimal policies found by the approximation method are shown in 
Fig. 19.7. Note that an increase in each parameter 7t, K, Af, L, and Vj, tends to 
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Case 

• Varied 
Parameters 

s 

0 

£[/IC] 

EOO 

1 

None 

II 

26 

28,0 

24 

7r = 99 


25 

31.2 

24 

II 

cr> 

10 

36 

37.2 

33.9 

7r = 99 

/C = 64 

14 

35 

40,2 

33.9 

II 

19 

34 

37.4 

32 

/.= 5 

49 

28 

32.5 

24 

li=27 

11 

28 

29.6' 

24 

2 

None 

109 

52 

80.7 

45.3 

o> 

II 

88 

56 

64.2 

45.3 

K=Z2 

III 

38 

69,3 

32 

II II 

j 

91 

42 

53.2 

32 

M= 9 

65 

39 

59.0 

33.9 

L= 1 

29 

48 

61.0 

45.3 

II 

00 

o 

97 

49 

65.9 

45,3 


Nonvoried Parameter Values: 

Casel:/?=1 tt = 9 /C = 32 c=0 yvf=9 Z.= 1 S) 1^ = 9 

Case2:/> = 1 7r = 99/C = 64 c = 0 M = I6 L=5 (M^=80) = 240 

(The values for s and 0 have been rounded to the nearest integer,) 

Figure 19.7. Sensitivity Analysis for Normal Approximation. 

increase both the optimal s and (I, except that an increase in it decreases Q, 2 tnd an 
incre 2 ise in K decreases s. This sensitivity pattern holds provided that the proba¬ 
bility that qi, is negative is negligible in the Normal approximation to as 

is true when is larger than 2. Also observe that the optimal Q, can be 

significantly larger than the deterministic EOQ, formula value, but even when this 
occurs, the values are found in only three or four iterations. 

19.7 CONGLUDINa REMARKS ON IMPLEMENTATION 

Any detailed discussion of the steps required to implement a scientific inventory 
management system would go far beyond the intended scope of this chapter. But 
because the text‘mentioned that a model as simple as the one in the preceding 
section has proved effective, the paragraphs below briefly indicate how in reality 
you would handle the approximating assumptions. Remember throughout that 
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the context of the discussion is the employment of a mathematical approach to 
stocking those items that are to be managed qn a routine basis. 

Stationarity* The foregoing dynamic continuous review model postulated 
that the economic parameters as well as the demand distribution are unchanging 
over time. In an application, each time the policy is calculated for an item, you 
would use the current economic parameters and a probability distribution esti¬ 
mated by giving greatest weight to the most recent demand data. In many inven¬ 
tory systems, each policy is updated at most every three months, and often not 
until six months or a year elapses. If the economic parameters and demand dis¬ 
tribution are shifting so rapidly as to invalidate the approximations made in the 
above model, you then will have to resort to a dynamic programming approach, 
such as that given in Appendix II. 

Form of the economic functions. Ordering and holding cost 
functions as simple as those exhibited in this chapter are obviously approximations. 
Nevertheless, as such they are still capable of being estimated within an acceptable 
degree of accuracy. Ordinarily you would arrive at the setup cost element by 
looking at the cost of the total annual manpower and associated costs required to 
support the reordering function (consisting mainly of placing and receiving 
replenishment orders) and averaging this cost over the corresponding reorder 
activity level. Often the principal component of the inventory carrying charge 
is the firm’s cost of working capital. The additional holding costs due to warehous¬ 
ing and insurance are easily calculated in aggregate, and typically are prorated 
among the different items stocked. 

The shortage or penalty cost is the hardest economic factor to estimate. Typi¬ 
cally, you would first make a rough guess and then perform a sensitivity analysis 
to refine your estimate. A little more will be said about this aspect under the topic 
of “Validation” below. 

Form of the demand distribution. In an inventory system with'a 
multitude of items, you necessarily have to develop a routine procedure for 
estimating the demand distributions using historical data. Standard statistical 
approaches for estimating means and variances are usually employed. You must 
apply a certain amount of caution, however, in the data processing step. For 
example, you must be careful not to use data so old as to be irrelevant for future 
demand prediction. Similarly, you must be careful to cull out of the data so-called 
extreme values, which can exert a strong upward bias in the value of a sample 
mean, and which in fact represent occasions when a demand is so large that it 
has to be treated on a nonroutine basis. 

You will ordinarily assume a standard form for the probability distributions. 
For example, if you use the Normal approximation approach in the preceding 
section, then you need to estimate only the mean and variance of demand during 
lead time. Often, by examining a sample of items, you will find through statistical 
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regression analysis that you can employ an approximating relationship between 
the mean and variance. To illustrate, you may find there is a reasonably constant 
ratio between the mean and variance. In such a case, you would estimate the 
variance indirectly from an estimate of the mean. 

Calculation of optimal policies. There is a variety of techniques for 
computing the specific optimal policies. The primary factor determining the 
appropriate method is whether an electronic computer is employed to maintain 
current inventory records. When a company utilizes a computer to keep an up- 
to-date record of slock on hand, then ordinarily it also has the computer calculate 
the policies according to formulas such as those in the preceding section. The 
electronic calculator uses recent demand data to update the policies, and as 
already mentioned, the policies are revised from one to four times a year. In the 
most sophisticated computer applications, as soon as the inventory level reaches 
the reorder point, the electronic calculator prints out a reorder slip to be mailed 
to the vendor. 

The scientific approach to inventory decisions has also been implemented in 
companies making only minor use of a computer for stock control. In such situa¬ 
tions, tables (or nomographs) are constructed so that an inventory manager is 
not required to perform any complex computations, but only needs to enter a 
modest amount of economic and demand data to find the optimal policy. The 
tables themselves are usually calculated by means of an electronic computer. 

Validation. By now, having seen a succession of simplifying assumptions 
piled one on top of another, you must be wondering how it is possible to tell 
whether the resultant recommended policies will really bring about an improve¬ 
ment in an actual inventory system. The real test is how well all of the policies 
will operate in the aggregate. 

The most commonly employed technique for performing this test is computer 
simulation, and several standard computer programs are available for just this 
purpose. Since the topic of simulation is taken up in detail in Chap. 21, only 
a few comments are given here. A simple, yet effective, approach to the validation 
phase is the following. You take a sample of items, use early historical data to 
estimate the statistical parameters, and thereby determine the recommended 
policies. Then, employing these policies, you simulate how the inventory system 
would have operated using more recent data. All of the calculations are done by 
an electronic computer, so little manual effort is involved in the simulation. 

The simulated results are then compared against data showing the historical 
performance, looking at, for example, average inventory levels, frequency of 
orders, size of backlogs, etc. Usually you would perform a sensitivity analysis, 
especially with respect to the stockout parameters. The higher the penally cost, 
the higher the average inventory. 

In evaluating the simulation sensitivity results, the exercise of managerial 
judgment is far more important than technical expertise. THe reason is that it is 
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operating management who must accept the final responsibility for balancing 
low purchase and holding costs against maintaining customer service at a com¬ 
petitive level. 


REVIEW EXERCISES 

1 In each part below, discuss the elements of the situation that may make the inventory 

decisions nontrivial. Specifically, describe what you think might represent the appro¬ 
priate considerations relating to supply and demand, the replenishment economics, 

and the systems specifications, as illustrated in Sec. 19.3. 

(a) Refilling an auto’s gasoline tank. 

(b) Refilling a gasoline station’s storage tank. 

(c) Maintaining an adequate balance in a personal checking account. 

(d) Maintaining an adequate balance in a corporation’s checking account. 

(e) Determining an adequate level of cash reserves and liquid assets for a bank. For 
a savings and loan company. For a life insurance company. For a fire insurance 
company. 

(f) Determining the number of cans of tomato juice to keep in your kitchen cupboard. 
In a restaurant’s store room. On a grocer’s shelf. 

(g) Determining the numbers of new and used automobiles to be stocked by a car 
dealer. 

(h) Determining the number of buses and their replacement dates by a local transit 
authority. 

(i) Determining the number of spare flashlight batteries to keep at home. 

(j) Determining the number and size of auxiliary power generators to install in a 
hospital. 

(k) Determining the number of paper clips an executive should keep in his home and 
office desks. 

(l) Determining the number of spare jet engines an airline should keep at its different 
maintenance bases. 

(m) Determining the number of different size scratch pads to stock in a busy office, 

(n) Determining the number of different size boards of lumber to stock in a lumber 
yard. In a lumber mill. 

(o) Determining the number of towels a department store should stock at its annual 
white sale. Determining the number of bathing suits the store should stock at the 
beginning of the summer season. 

(p) Determining the total number of seats in a new movie theater. The number of 
loge seats. 

(q) Determining the number of first-class scats in a new supersonic jet aircraft. 

(r) Determining the number of prizes to award in a state-run lottery. 

(s) Determining the number of men to enlist monthly into the armed services. 

(t) Determining the amount of ammunition to keep in a field-supply area. In a 
continental U.S. depot. 

(u) Determining the amount of funds to allocate each year to aid so-called disaster 
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areas. Determining the amount of funds to put in a Community Chest contingency 
fund. 

(v) Determining the amount of water to let spill over a dam. 

(w) Determining the number of life jackets and life boats on an ocean liner. Determin¬ 
ing the number of bottles of champagne to have in store for passengers. 

2 Suggest some commercial situations in which a one-period inventory model is appli¬ 
cable. Discuss the circumstances under which it would be profitable to use an opera- • 
tions research approach to derive an inventory policy for these situations. 


3 (a) Consider the inventory policy (j, 5) = (3, 10). What is the order quantity if 
entering inventory equals 0? Equals 2? Equals 3? Equals 5? Equals 10? Equals 
15? In each of these cases, what is the amount of inventory available to meet 
demand after the order is placed? Assume no delivery lag. 

(b) Let c{x) have the form given by (2) in Sec. 19.4, and let X = 5 and c = 2, Draw 
a graph of c{x) for a: = 0, 1, 2, . .., 5. 

(c) Consider the expected cost function (3) in Sec. 19.4. Explain why it is reasonable 
to assume that r 4- // > 0 and rr > c. Show that these two inequalities imply 
^ -h TT > 0. 

(d) Verify that the probability distribution in (4) yields the expression Li^y) given in 
(5) of Sec. 19.4. 

(e) Let h — b and tt = 10 in (5) of Sec. 19.4. Calculate L{^y) and check your answers 
with the values in Fig. 19.1. Also plot 

(f) Rework part (e), except let tt = 20. 

(g) Rework part (e), except let zr = 5. 

(h) Rework part (e), except let h— 10. [Hint: make use of your answer in part (f).] 

(i) Explain in detail why the holding cost h in (3) of Sec. 19.4 can be interpreted as a 
net cost, where the salvage value has been subtracted out. What would be an 
optimal policy if the salvage value v were greater than the unit purchasing cost c? 

(j) Suppose entering inventory is 0, and optimal j > 0. Derive a formula for the 
“cost of uncertainty” appropriate to (2) and (3) in Sec. 19.4. 


4 Consider the critical ratio i?, given by (6) in Sec. 19.4. 

(a) Assume c = 0 and A = 1, What value of tt yields /? = 2 ? 7?=3?/? 

(b) Assume c = 1 and A = 1, and answer part (a). 

(c) Assume c = 10 and A = 1, and answer part (a). 

(d) Assume c = 10 and A = 10, and answer part (a). 

(e) Assume c = 100 and /i = 10, and answer part (a). 

(f) Describe the optimal policy when c = A = 0, 

(g) Describe the optimal policy when zr ^ r. Justify your answer. 


J? 


5 Consider the critical ratio /?, given by (6) in Sec. 19.4. 

(a) Plot the region of values for nje and Ajc that yields R in the interval .8 ^ ^ ,9. 

(b) Plot a region of values for zr/A and cfA that yields R in this interval. 

(c) Plot a region of values for cItc and A/zr that yields R in this interval. 
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6 Consider the critical ratio /?, given by (6) in Sec. 19.4. Suppose .8 ^ ^ .9, and 

find the corresponding interval for tt, given that 

(a) = 0 and /i = 1. 

(b) c = 1 and A = 1. 

(c) c = 10 and h = \, 

(d) c = 10 and h = 10. 

(e) c = 100 and h = 10. 

(f) Answer parts (a) through (e) for the interval .7 ^ ^ .8. 

(g) Answer parts (a) through (e) for the interval .85 ^ ^ .95. 

7 (a) Consider the cumulative distribution in Fig. 19.2. Determine an optimal S from 

(7) in Sec. 19.4 for A = 5, tt = 10, and c = 5. For /? = .8. For R = .6. 

(b) Plot cy 4- ‘^(>)> where A = 5, tt = 10, and r = 5, and indicate the optimal S, 

8 Consider the critical ratio /?, given by (6) in Sec. 19.4. In each part below, find an 
optimal S for values of R equal to j, |, and 

(a) p(q) = (v 4- 1)/15, for ^ = 0, 1,..., 4. 

(b) p{0) = ^(4) = i,p{l) = ^(3) = 1,^(2) = S. 

(c) p{q) = (5 - ^)/15, for 7 = 0, 1,..., 4. 

(d) p(0) = pw = A, p{l) = ^(3) = A, P(2) = A. 

9 Consider the probability distribution (8) in Sec. 19.4 and assume that h = 5, vr = 10, 
and c = .1 in (2) and (3). 

(a) Verify that the optimal S = 3. 

(b) Plot L{y) Tory = 0, 1,..., 5. 

(c) Let X = 4 and plot K 4- c{S — ,y) -f L(5), where S = 3, Show graphically that 
the optimal j = 2. 

(d) Let K = \2 and answer part (c). 

(e) Find a range of values for K such that j = 1 is optimal. 

10 Assume p{q) = ^ hr q = 1, 2,..., 10. Let A = 5, tt = 10, and c = .1 in the one- 
period model (2) and (3) of Sec. 19.4. 

(a) Find an optimal S, 

(b) Find an optimal j for X = 1. For X = 2. For X = 5. For X = 10. 

(c) Find a range of values for X such that j = 2 is optimal. 

(d) Let r = 5 and answer parts (a) and (b). 

*11 Consider the general description of the static model in Sec. 19.4. Assume p{q) = A> 
for q = 1, 2,..., 10. Let the expected cost be 

^(j>| i) = < • (^ - i) + A • - 9)^>(9) + 2 - yypiv) ^ 0. 

g=0 q>y 

when^ is the inventory available to meet demand after ordering, given that initial 
inventory is i. 

(a) Assume i = 0, A = 5, tt = 10, and c = .1. Find an optimal^. 

(b) Rework part (a), except let c = 5. 
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♦12 Consider the example (16), (17) and (18) in the special material of Sec. 19.4. 

(a) Using (16), explain how (18) indicates that the actual loss is the same tc, no matter 
by how much the actual demand q exceeds the available supply y. 

(b) Verify the numerical expressions on the right of (18). 

(c) Why does (20) contain the only policies that need to be considered in the optimi¬ 
zation process? 

(d) Justify the calculations in (21) through (23). 

(e) Translate (24) into a verbal description of the inventory policy. 

(f) Check the results shown in (25), and translate (25) into a verbal description of the 
inventory policy. 

♦ 13 (a) Consider the example in exercise 11, part (a). Find an optimal (r, S) policy where 
the purchase cost function includes a setup cost X = 1. 

(b) Do the same for A" = 5. 

♦14 Justify in detail the derivation (i) through (xii) in the advanced material at the end of 
Sec. 19.4. 

♦15 Justify in detail the derivation (xiii) through (xvii) in the advanced material at the end 
of Sec. 19.4. 

16 Consider the economic order quantity examples displayed in Figs. 19.3 and 19,4. 
Assume Q, = 45, j = —10, Af = 30. 

(a) Calculate S, the average number of setups per week, the average level of inventory 
during the interval when inventory is positive, and the average level of inventory 
per week. 

(b) Suppose lead time L = 1 week. When should a replenishment be initiated ? 

(c) Suppose lead time L = 2 weeks. When should a replenishment be initiated? 

(d) Answer parts (a), (h), and (c) for Q.= 60. 

(e) Answer parts (a), (b), and (c) for Af = 15. 

(f) Answer parts (a), (b), and (c) for j = — 5, 

17 (a) Verify the algebra leading to the formula for average cost per unit of time as given 

by (6) in Sec. 19.5. 

(b) Show that if 6* = Q,, then (6) simplifies to (7). 

(c) Show that if the square root formula for Q^sls given in (8) is used, then the minimal 
average cost is that also given in (8). 

♦(d) Differentiate AC in (7) with respect to Q,, set the derivative equal to 0, and solve 
for 0,; verify that the result is the formula in (8). 

18 (a) Consider the lot-size formula (8) in Sec. 19.5. Determine the value for the ratio 

K/h such that an order is for one week’s demand. For two weeks’. For four weeks’. 
For 12 weeks’. For 26 weeks’. 

(b) Define the turnover ratio as the average inventory level divided by the rate of demand 
per unit of time. Assume that you employ (8), and derive a formula for the turn¬ 
over ratio. Explain how the ratio varies with the demand rate. With the ratio 
Klh. 


828 


Probabilistic Inventory Models 


19 (a) Verify from (12) in See. 19.5 the sensitivity analysis formula (13). 

(b) Find a range for r in (13) such that VG(r)/VC(l) < 1.15. 

(c) Explain why the ratio in (13) is the same for a given r and its reciprocal r* = 1/r. 

(d) Draw a graph of the ratio in (13) for r in the range .1 ^ r ^ 10. 

20 Consider the economic order quantity formula given by (8) in See. 19.5. Suppose M 
is the true demand rate, but that you mis-cstimatc the rate to be r-M, where r > 0. 
Let VC(r) be the variable cost, analogous to (12), assuming that r«iVf is used in (8), 
instead of M. 

(a) Derive a formula for VC(r)/VC(l). 

(b) Find the interval of values for r such that VC(r)/VC(l) ^1.1. 

(c) Plot VC(r)A^C(l) for r in the range .1 ^ r ^ 10. 

21 Perform the analysis in exercise 20, except suppose that the setup cost K is mis-cstimated 
to be r-iC. 

22 Perform the analysis in exercise 20, except suppose that the holding cost is mis-cstimated 
tober*^. 

23 Consider the average-backlog penalty cost model expressed by the cost function (15) 
in Sec. 19.5. 

(a) Verify the formula in (15). 

♦(b) Take the partial derivative of AC in (15) with respect to 5*, set the result equal to 0, 
and solve for S; verify the formula in (16). 

♦(c) Verify the results in (17). . 

(d) What is the optimal time interval between successive orders ? 

(e) Simplify and interpret the results in (17) for the case h = tc, 

(f) Simplify and interpret the results in (17) for the case 3h = tc, 

24 Assume h = 1, tt = 9, iC = 32, c = 0, M = 9. 

(a) Evaluate the formulas in (17) of Sec. 19.5. 

(b) Let A = 4 and rework part (a). 

(c) Let TT = 36 and rework part (a). 

(d) Let M = 36 and rework part (a). 

25 Consider the formulas for an optimal policy, as given by (17) in Sec. 19.5. Suppose 
that you actually use an order quantity that equals r times the order quantity in (17), 
where r > 0. Let VC(r) be the associated variable cost (exclusive of cM), 

(a) Derive a formula for VC(r)/VC(l). 

(b) Assume h = 1, tt = 9, X = 32, M = 9, and plot VC(r)/VC(l) for r in the range 

.1 10 . 

26 Consider the formulas for an optimal policy, as given by (17) in Sec. 19.5. Suppose M 
is the true demand rate, but that you mis-cstimale the rate to be r*Af, where r > 0. 
Let VC(r) be the corresponding variable cost (exclusive of cM), assuming that fM 
is used in (17) instead of M, 

(a) Derive a formula for VC(r)/VC(l). ^ 
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(b) Assume A = 1, tt = 9, X = 32, Af = 9, and find the interval of values for r such 
that VC(r)/VC(l) <1.1. 

(c) Plot VC(r)/VC(l) for r in the range .1 < r ^ 10. 

27 Perform the analysis in exercise 26, except suppose that the setup cost K is mis-estiraated 
to be fK. 

28 Perform the analysis in exercise 26, except suppose that the holding cost h is mis¬ 
estimated to be r* A. 

29 Perform the analysis in exercise 26, except suppose that the penalty cost tt is mis¬ 
estimated to be r*7r. 

30 Consider the formulas for an optimal policy, as given by (17) in Sec. 19.5. Assume 

A = 1, 7 r = 9, = 32, A/ = 9, unless stated otherwise below. Plot each of the 

optimal quantities in (17), as a function of 

(a) The holding cost A. (c) The setup cost K, 

(b) The penalty cost n, (d) The demand rate M, 

*31 (a) Exhibit all the intermediate algebraic steps to justify the derivations in (9) through 

(15) , leading to the formula for the expected average cost per unit time, given by 

(16) in See. 19.6. 

(b) Partially differentiate E[AC) with respect to Q., set the derivative equal to 0, 
and solve for Q,; verify that the result is that given by (17). 

*(c) Derive the formula (19) that determines an optimal’j. 

(d) Substitute the optimal Q, formula given in (17) into the E[AC1 formula in (16), 
and verify that the result is (21). What terms in (21) indicate the impact of 
uncertainty? 

32 (a) Verify the calculations in the numerical example (23) through (29) of Sec. 19.6. 
(b) Find an optimal policy using the data in (23), except let tt = 30. 

33 Assume pL^qi) = iV for ^ = 1, 2,..., 10, where L = 1. Let A = 5, ;r = 10, and 
r = .1. In each part below, find an optimal inventory policy using the algorithm in 
Sec. 19.6. 

(a) Let X = 1. (d) Lef ;r = 20 and X = 10. 

(b) Let K = 5, (e) Let tt = 50 and K = 10. 

(c) Let a: = 10. 

34 Assume pL{qL) = ^or ^ = 0, 1,..., 19, where L = 1. In each part below, apply 
the algorithm in Sec. 19.6 to graph the optimal Q,, S, and expected cost, as functions 
of the indicated parameter. (Only consider cases where optimal j > 0. Suggestion: 
select a few sample points for the indicated parameter, and, by eye, fit the curves 
through the optimal policies for these sample points.) 

(a) Let A = 1, ;r = 50, and let the parameter be /C, where 0 < AC ^ 30. 

(b) Let A = 1, AC = 20, and let the parameter be tt, where 0 < tt ^ 100. 

(c) Rework parts (a) and (b), assuming that L = 2. 
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*35 Consider the Normal approximation approach described at the end of Sec. 19.6. In 
each part below, find the approximately optimal Q,, s, 5, and the associated approxi¬ 
mate value for the expected cost. 

(a) M = 16, L = 5, Ml = 80, Vl = 240, and all the other parameters are those 
given by (39). 

(b) Case 1 in Fig. 19.7. 

(c) Case 2 in Fig. 19.7. 


*36 Study the sensitivity analysis in Fig. 19.7. Give a plausible explanation of why an 
increase in each parameter tt, K, M, L, and Vl tends to increase both the optimal s 
and Q,, except that an increase in tt decreases Q. and an increase in K decreases s, 
(Trace the direct impact of the economic parameters on the policy variables and the 
consequent indirect impact due to the interactive effect among the policy variables.) 


37 Explain your understanding of the following terms: 


lead time (delivery lag) 
setup (reorder) cost 
inventory holding cost 
penalty cost (profit loss) 
static (single decision) model 
newsboy problem 
(j, S) policy 
reorder point 
reorder level 

linear holding and penalty cost 
critical ratio 
backlogged demand 


♦net inventory position 
♦lump-sum penalty cost 
economic order quantity (lot-size) models 
sawtooth pattern of inventory 
size-of-backlog penalty cost 
EOQ, (lot-size formula, Wilson lot size) 
average-backlog penalty cost 
demand during lead time 
service level 
safety (buffer) slock 
♦standardized Normal loss integral. 


FORMULATION AND COMPUTATIONAL EXERCISES 

38 The Sayure Prairie Airlines is ordering a fleet of new jet aircraft. Provisioning engine 
spare parts in advance is less expensive than waiting until they are needed. Specifically, 
suppose Cl is the unit purchase price now, and C 2 the price if ordered subsequently, 
where Ci < C 2 . Assume that any parts left over when the fleet becomes obsolete have 
a salvage value v per unit. Let p{d) be the probability that d parts will be needed by 
SPA over the lifetime of the fleet. 

(a) Exhibit a formula for the expected total cost associated with purchasing Q, spares 
now. 

*(b) Derive a formula for determining an optimal Q,. (State any assumptions you are 
making about the value of v,) 

39 The Sayure Prairie Airlines must also decide how many first-class and tourist scats 
to install in the airplanes. Assume that the total number of rows of seats is R, Let ri 
be the number of rows of first-class seats, and r 2 the number of rows of tourist scats. 
There arc four seals in each first-class row and six seats in each tourist row. Assume 
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that p\{d{) is the probability that customers desire first-class scats on each flight. 
For simplicity, assume that if all the first-class scats arc filled, these customers call 
another airline. Let Vi be the profit value from selling a first-class scat. Similarly, let 
^ 2 (^ 2 ) be the probability distribution for tourist-class seats and the profit value per 
seat. Write the appropriate expression for expected profit, and explain how you would 
find an optimal seat allocation. 

(^» Q.) Inventory ReplenisKment Policy, Consider the one-period model with costs given by 
(2) and (3) in Sec. 19.4. Suppose that an order quantity must be an integer multiple 
where Q, is a prespecified number. (For example, Q, ^ ^ dozen.) 

*(a) Show that the form of an optimal policy is the following: if entering inventory is 
less than some number k, the smallest integral multiple of Q, is ordered such that 
inventory after ordering is at least k\ if entering inventory is k or greater, no order 
is placed. For example, suppose the policy is (A:, Q,) = ( 8 ,5); if entering inventory 
is 8 or more, then no items are ordered; if entering inventory is 3, 4,..., 7, then 
5 units are ordered; if entering inventory is —2, — 1,,.., 2, then 10 units arc 
ordered, etc. 

(b) Given the validity of the policy in part (a), describe how to calculate an optimal k, 

(c) Apply your method in part (b) to the numerical example (4) and (5), with A = 5, 
TT = 10, and c = .1. Find optimal policies for 0,= 2 and Q.= 4. 

(d) Assume p{q) = ior q 1, 2,.. ., 10. Let h = 5, tc = 10, and c = .1. Apply 
your method in part (b) to find optimal policies for Q, = 2 and Q, = 4. 

♦41 Consider a one-period inventory model with linear holding and penalty costs and 
demand described by a probability density function, as discussed in the special material 
at the end of Sec. 19.4. Suppose p{q) = for 7 ^ 0, where m > 0 and jE[^] 

= 1 /m. 

(a) Derive an analytic expression for the optimal iS". 

(b) Derive an expression for the determining optimal j. [Hint: the optimal s is the 

solution of the equation L{s) cs = K cS + Let D = 5 — and use 

part (a) to express your answer in terms of D, 

(c) Derive an approximation to the optimal D from the formula in part (b) by using 
a series expansion 1 -}- x -Sx*. 

42 In each part below, assume that h = H*c and tc = /7*r, where c is the unit purchasing 
cost. Discuss how the optimal policy depends on the value of c, 

(a) The lot-size model given by (7) and ( 8 ) in Sec. 19.5. 

(b) The lot-size model given by (17) in Sec. 19.5. 

(c) The stochastic demand model given by (17) through (20) in Sec. 19.6. 

43 Consider the lot-size models in Sec. 19.5 yielding the average cost formulas ( 6 ) and 
(15). 

(a) Show how to modify ( 6 ) and (15) if the purchase cost of (iis cQ?, instead of cQ,. 
♦ (b) Derive the associated optimal lot-size formulas. 

44 Consider the lot-size model yielding formula (7) in Sec, 19.5 for the average cost per 
unit of time (where optimal j = 0). Suppose that the firm can request a fraction / 
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of the order quantity to be delivered right away, and the remainder to be delivered 
after the quantity/Q, is exhausted. 

(a) Suppose / is a given fraction. Derive the optimal Q, analogous to (8). (Note: 
inventory holding cost is assessed against only the inventory on hand.) 

♦ (b) Suppose both f and Q, can be determined by the firm. Find formulas for the 
optimal values of f and Q,. What is the optimal time interval between the arrival 
of/Q, and the remainder of the order ? 

45 Optimal Production Lot-Size Model, Consider a lot-size model like that giving rise to the 
sawtooth pattern shown in Fig. 19.3. Suppose, however, that the order must be 
manufactured by the firm itself, and the production rate is N per unit of time, where 
N > M, Hence, the level of inventory- no longer exhibits a vertical jump at the arrival 
of the lot of Q. items. Rather, the inventory level increases at a slope of W — Af until 
the lot (2,has been produced, and then decreases at a slope —M (as before). 

(a) Suppose N = 75 and the rest of the data are as given in Fig. 19.3. Draw the resul¬ 
tant diagram for the inventory level. 

(b) Assuming j = 0 so that .S = Q., derive the formula for the average cost per unit 
of time, analogous to (4) in Sec. 19.5, and determine the corresponding EOQ,. 
Also determine the corresponding minimal average cost and the optimal time 
interval between successive production startups. 

(c) Show how to modify your answer in part (b) where you permit s to be negative 
and you assume an “average-backlog penalty cost.” Derive the formula for the 
average cost per unit of time, analogous to (15) in Sec. 19.5, and determine the 
corresponding EOQ, optimal s and S, minimal average cost, and the optimal 
time interval between successive production startups. 

(d) Apply your answers in parts (b) and (c) to the data /i = 1, /C = 32, r = 0, Af = 9, 
N = 36, 7C = 9. 

46 A^ume Pl(^l) = for qL= 1, 2,..., 10, where L = \. Let /t = 5, tt = 10, and 
c — .1. In each case below, apply the algorithm in Sec. 19.6 to find an optimal policy. 
(Compare the results with those in exercise 10.) 

(a) Let 7C = 5. 

(b) Let K = 10. 

47 Assume pziqz) = for y = 0, 1,..., 19, where L = \, Let A = 1 and K = 20. 

(a) Find the values of s corresponding to Pl(s) = .75, .8, .85, .9, .95, according to 

(18), (19), and (20) in Sec. 19.6. 

(b) Calculate the corresponding values for (posing (17) and letting ;r = 0. Calculate 
the associated expc-cted average iaventory per unit of time using (12). 

(c) Summarize the results of the above parts by plotting a graph of expected average 
inventory as a function of the service level Pl(s)> Give an economic interpretation 
of your results. 

48 Assume Pl^Hl) = sVj for ^ = 0, 1,..., 19, where L = 1. Let A = 1 and K = 20. 

(a) Find the values of s corresponding to Px(j) = .75, .8, .85, .9, .95, according to 

(18), (19), and (20) in Sec. 19.6. 
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(b) For each s in part (a), find a value of tc such that using this value and s in (17), 
the resultant value of R in (20) equals 

(c) Employing your answers in part (b), calculate the associated expected average 
inventory per unit of time using (12). 

(d) Summarize the results of the above parts by graphing expected average inventory 
as a function of the service level Pl{s)- Also plot a graph of the service level as a 
function of tt. Give an economic interpretation of your results. 

49 Let p{q) represent the probability of demand in a week, where ^ = 0, 1,..., Z). Let 
t(L) represent the probability that lead time is L weeks, where Z, = 1, 2, — , /. 
Assume weekly demands and lead times are completely independent random variables. 

(a) Derive a computational method for finding pL{qi), which represents the proba¬ 
bility that demand during lead time is qi, 

(b) Illustrate your method for p{q) = i, ^ = 0, 1, 2, 3, and /(Z.) = j, Z. = 1| 2, 3. 

(c) Using the data in part (b), calculate the probability of stockout if the reorder 
point is 5. Is 8. Is 10. 

*50 Consider Case 1 in Fig. 19.7. 

(a) Suppose you estimate the demand rate to be 6 anej use the corresponding policy, 
but in fact the actual demand rate is 9. What is the expected average cost increase 
due to mis-eslimation ? Answer the same question if you incorrectly assume the 
demand rate is 12 when in fact it is 9. {Suggestion: calculate the actual expected 
average cost, the expected average cost if you had correctly estimated the pa¬ 
rameter, and the expected average cost if the mis-estimated value of the pa¬ 
rameter were truly correct.) 

(b) Suppose you estimate the variance of demand during lead time to be 6 and use 
the corresponding policy, but in fact the actual variance is 9. What is the expected 
average cost increase due to mis-estimation ? Answer the same question if you 
incorrectly assume the variance is 12, but in fact it is 9. 

(c) Answer parts (a) and (b) where you mis-estimate both the demand rate and 
variance to be 6. To be 12. 

*51 Consider Case 2 in Fig. 19.7. Suppose you estimate the lead time to be 4 and use the 
corresponding policy, but in fact the actual lead time is 5. What is the expected 
average cost increase due to mis-estimation ? Answer thesame question ifyou incorrectly 
assume the lead time is 6, but in fact it is 5. {Suggestion: calculate the actual expected 
average cost, the expected average cost if you had correctly estimated the parameter, 
and the expected average cost if the mis-estimated value of the parameter were truly 
correct.) 
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52 Consider the lot-size model yielding formula (7) in Sec. 19.5 for the average cost per 
unit of time (where optimal s = 0). In each part below, devise a computational 
technique for finding an optimal order quantity, given that the vendor offers quantity 
discounts for purchasing large-size lots. 
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(a) Assume that Cj is the price per item if the order quantity Q.satisfies ^ Q, < 

where cj > c^-i ^ 0. Also assume that h = //-Cj, where is the unit cost incurred. 
Illustrate your procedure with the data X = 32, Af = 9, //= 1, and C| = 16, 
for 0 ^ Q,< 7; C 2 = 9, for 7 Q,< 75; and C 3 = 1, for 75 ^ Q.. Determine 

whether the solution differs if ^2 = 9» 7 ^ (2.< 50; and C 3 = 1, for 50 ^ Q,* 

(Suggestion: explain your approach using a graph that contains for each C) the 
average cost per unit of time as a function of Q,.) 

(b) Assume that C(Q.) is the total purchase cost and 

C(Q.) = C(qj) + Ci-id - qj) for qj^(l< 
where Cj > Cj-i ^ 0, and C(0) = 0. Thus the discount is offered on an incremental 
basis. Illustrate your procedure using the data in part (a) and h = H^C((DIQ^' 
(Suggestion: explain your approach using a graph that contains for each Cj the 
average cost per unit of time as a function of Q..) 

♦(c) Determine in parts (a) and (b) whether an optimal (2. can ever occur at a price 
break q^. 

53 Consider a firm that purchases n different kinds of items from a single vendor and 
wants to determine the optimal lime interval T between successive orders of all n 
kinds of items. For each kind of item j, let Mj be the demand rate, cj the unit purchas¬ 
ing cost, hj the unit holding cost. Assume that if n = 1, the appropriate average cost 
per unit of time would be given by (7) in Sec. 19.5 (where j = 0). For n > 1, continue 
to assume that the setup cost is K for the entire order, which now consists of n different 
kinds of items. 

(a) Find the optimal T and the associated order quantities dj each kind of item j, 

(b) Apply your answer in part (a) to the data n = 2, each Aj = 1, Cj = 0, X = 32, 
and = 9 and Afo = 36. 

(c) Suppose the firm has the option of separately ordering both items in part (b), 
except that the individual setup cost Kj = 32r, where r > 0. Find the value for r 
such that the firm would be indifferent between separate and joint ordering 
policies. 

54 Consider the example (23) in Sec. 19.6. Suppose there are two locations in the firm. 
Assume these data are applicable at each location, and that the demands at the two 
locations are completely independent. Suppose the firm has the option of either 
stocking the item at each location, thereby leading to the policy in (28) and twice the 
cost in (9), or combining the two sources of demand, and stocking in a single location. 

(a) Derive an optimal policy for the combined operation. What is the cost differential 
between this combined stocking and the independent stocking at two separate 
locations? [Hint: calculate the probability distribution of the sum of two inde¬ 
pendent random demands during a lead time, where the probability distribution 
for each demand is given by (23).] 

♦(b) Perform the same analysis as in part (b) except use the data in (39) and the 
Normal approximation technique. (Hint: the variance of the sum of two 
independent random variables is the sum of the variances.) 

55 You can develop as follows a cruder approximate expected average cost per unit of 
lime analogous to (16) in Sec. 19.6. Assume that, on the average, the lowest level of 
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inventory is j — E[q£\ so, as an approximation, the average inventory level is the sum 
(e/2) + J - E[q£\, 

(a) Derive the cost function analogous to (16). 

(b) Find formulas for the optimal order quantity and reorder point, analogous to 
(17) through (20), and compare the results. 

56 (a) Show how to modify the derivation in (9) through (16) in Sec. 19.6 if excess 
demand during a lead time is lost. (Hence, the level of inventory never falls 
below 0, as it does in Fig. 19.5.) Specifically, find the appropriate expected aver¬ 
age cost per unit of time, analogous to (16), and derive formulas for the optimal 
order quantity and reorder point. Compare your results with those in (16) 
through (20). 

*(b) Derive a formula for the expected average cost per unit of time in part (a) by 
taking the ratio of the expected cost between successive reorders and the expected 
time between successive reorders. (Assume that lead time is constant and that there 
is never more than one order outstanding at any moment in time.) 
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CHAPTER 


Waiting Line ^ 
Models 


20.1 INTRODUCTION 

Waiting lines, like slocks of inventory, are omnipresent. Think back for a mo¬ 
ment on the times you waited for service during the past few days—perhaps at 
a cafeteria, bookstore, library, bank, gasoline station, dean’s office, and the like. 
Less apparent examples occur when you wait for a telephone operator to answer, 
for a traffic light to change, for the morning mail to be delivered, and for a mid¬ 
term examination to be corrected and returned. Common to all of these cases 
are the arrivals of people or objects requiring service and the attendant delays 
when the service mechanism is busy. 

Operations research can very effectively analyze such queuing or congestion 
phenomena. But, as is true of all practical applications of operations research, 
the relevant decision problems must be of sufficient economic significance to 
warrant whatever expenditure of effort is needed to perform the scientific analysis. 
There are two important situations in which it has proved to be economically 
advantageous to use waiting line models; as you will quickly recognize, there is 
no clear line of demarcation between these two situations, and any particular 
application may fail somewhere in between. 

The first relates to a company that must design and operate many similar 
service facilities. Examples include deciding the number of 

• Checkout stands in each store of a large grocery chain. 

• Tellers’ windows in each local branch of a statewide bank. 

• Gasoline pumps and attendants in each station of a major oil company. 

• Trunk lines in each local telephone exchange. 

• Maintenance men to service leased photocopy equipment in each city. 

•fThis chapter requires a knowledge of elementary differential and integral calculus. 
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• 20.1 • 


Although the environmental data vary from store to store, branch to branch, 
station to station, and so forth, the same procedure can be used to analyze each 
service facility decision. Consequently, once a company develops the methodology 
for an operations research approach, the technique can be used repeatedly; for 
each separate application, the companv need only apply the relevant aata. 

The second category relates to a company that must make a facility decision 
on the number and capacity of automatic elevators to be installed in a large office 
building, or furnaces in a steel processing plant, or runways at an airport. 

To round out the discussion of applications, we mention two examples that 
combine the characteristics of both categories above: determining the number 
of check-in counters at an airport, and of fire engines at each local fire station of 
a metropolitan area. 

Once again, reflect on your activities during the past few days, and this lime 
pick out those queuing situations that you think might actually warrant a scientific 
study. In which of the two categories above do they fall? What are the decisions 
that might be made as a result of such a study? How would you distinguish a 
good decision from a poor one? Try to answer these questions for several of the 
examples you picked. 

Perspective. What is said about birds is also true of waiting line models: 
their variety and number seem infinite. Even a treatise of several hundred pages 
would be too small for a complete survey of all the mathematical results of 
queuing models. And then it would have to be significantly enlarged within a 
year or so to encompass the continual growth in new research findings. Con¬ 
sequently, this chapter and Appendix III cannot possibly provide a survey of all 
waiting line models; instead this chapter highlights typical and critical insights 
to be garnered from queuing model analysis, and presents several models that are 
now fundamental in the literature of waiting line theory. 

This chapter and Appendix III deal with many waiting line models that are 
amenable to mathematical analysis. As you will see, the underlying waiting line 
systems in these models appear considerably simpler than most of the queuing 
situations you encounter in reality. What we demonstrate in this chapter is that 
you can sometimes employ these relatively simple models to provide qualitative 
and approximate quantitative information about the behavior of more complex 
structures. Then later in Chap. 21 you can learn how to apply computer simula¬ 
tion techniques to analyze intricate waiting line situations. 

In previous chapters, we have offered at least heuristic proofs of the most 
fundamental results. When the underlying mathematical intricacies were beyond 
the intended level of the text, we tried to present verbal arguments that could be 
made rigorous. Our rationale was that you would reach a better understanding 
of the results by having a “feel” for the way they are derived. There are some 
places below where we continue to follow this expository practice. But many 
mathematical derivations of waiting line results are not, in our opinion, very 
enlightening to a person intent on applying the theory. (The proofs are important, 



• 20.2 • 


Taxonomy 839 


of course, to technical specialists who may want to apply similar approaches to 
solving new waiting line models.) Consequently, some of the formulas in this 
chapter are presented in ‘'cookbook” fashion. 

We will mostly stress the insights to be gained from performing sensitivity 
analysis in waiting line models. In reading the material, you should pay special 
attention to the differing assumptions in the various models, and observe the 
qualitative effect these differences have on the results. You also should notice the 
impact of varying the numerical values for the parameters in models. As you will 
see, the scientific analysis of queuing phenomena can be complex; you will not 
be able to rely solely on unaided intuition to determine the impact of varying 
arrival rates, service rates, the number and arrangement of service facilities, etc. 

In sharp contrast to the. rest of this text, the emphasis in this chapter and 
Appendix III is on a model’s operating characteristics, such as the 
average number of customers in a system, their average waiting time, the proba¬ 
bility that all service facilities are busy, and similar measures of operating ef¬ 
fectiveness. Once these characteristics have been obtained, you can proceed to 
construct an appropriate economic framework that encompasses them, and 
subsequently find optimal decisions. 

The optimizing task may be either easy or hard, depending on the complexity 
of the queuing system and the range of alternatives you wish to consider. For 
example, if you want to choose between having four versus five checkout stands 
at a grocery store, you can find the optimal decision by examining the effects of 
each separate alternative. But if you wish to design an air traffic control system 
at a busy airport, you may need a more sophisticated optimization technique than 
“examine each separate alternative”; the number of such alternatives may be 
unlimited. 

At present, there is no unified body of optimization theory for waiting line 
models. In most applications, the optimizing technique employed is ad hoc. But 
advanced research is underway in extending dynamic programming techniques 
analogous to those presented in Chap. 18 to assist in optimizing queuing system 
design. 


20.2 TAXONOMY OF WAITING LINE MODELS 

Before examining the solution of specific queuing models, you will find it useful 
to have an overview of such systems. This section provides a general framework, 
and also serves the purpose of building your vocabulary of the terms frequently 
employed in the description of waiting line models. An outline for the rest of the 
chapter appears at the end of this section. 

You can describe a queuing system by its input or arrival process, its queue 
discipline, and its service mechanism. Several possibilities for each of these are 
discussed in general terms below. In subsequent sections, more precise mathe¬ 
matical assumptions are specified in treating particular cases. 


840 Waiting Line Models 


. 20.2 • 


Input process. The usual description of the pattern of arrivals into the 
system is given by the probability distribution of time between successive arrival 
events, and the number of individuals or units that appear at each of these events. 
For example, an arrival event at, say, a barber shop or a restaurant may occur 
on the average of once in any 10-minute interval. For the barber shop, each arrival 
event consists of a single customer entering; for the restaurant, a party of one or more 
customers. (When more than one customer can enter the system at an arrival 
event, the situation is termed bulk arrivals.) Often, successive interarrival 
times are in fact statistically independent and stationary over long inteivals of 
time, but, of course, they need not be in any particular situation. The two extreme 
assumptions about interarrival times are that they are predetermined or com¬ 
pletely random, a notion to be made precise in the next section. 

Usually the source population from which the arrivals are drawn is considered 
as unlimited. To illustrate, such an assumption seems warranted in the case of 
train passengers seeking help in New York’s Grand Central Station. 

But in other situations, the source population is more appropriately modeled 
as being finite. One such example might be the number of drill presses breaking 
down and requiring service from a factory’s regular staff of repairmen. 

In some situations, a customer seeing a long line may balk, that is, not join 
the line. Depending on the circumstance, he may return later for service. Some¬ 
times a customer cannot join the queue because there is no room left for him to 
wait in. Thus the input or arrival process may depend in part on the status of the 
queuing system. 

Q^ueue discipline. This characteristic describes the order in which cus¬ 
tomers entering the system are eventually served. Frequently, the discipline is 
first come, first served. Certainly it is the easiest ordering to handle 
in mathematical models; it is also the ordering applied to customers who have 
entered a particular line. If you reflect on most of the waiting line situations you 
are familiar with, however, you will realize that many other disciplines are pos¬ 
sible. Sometimes it is last come, first served. For example, consider 
what happens when you enter an empty elevator near the top floor of a multistory 
building, and as the elevator descends, it becomes crow'ded with additional pas¬ 
sengers. The discipline in this situation may well be viewed as last come, first 
served, if you define “service” to be your exiting from the elevator when it reaches 
the ground floor. 

Sometimes the service order is virtually random. For example, teachers often 
try to use this discipline in calling upon students for recitation. And sometimes the 
discipline is governed by a priority system, such as “women and children first,” 
when waiting for a seat in a lifeboat on a sinking ship, or ‘ age before beauty,” 
when waiting your turn in a revolving door. Finally, a customer may become 
impatient and decide to leave the system before being served; this behavior is 
termed reneging. 
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Service mechanism. In common with the arrival process, a specification 
of the service mechanism includes a description of time to complete a service, and of 
the number of individuals whose requirements are satisfied at each service event. 
Referring again to the barber shop and restaurant, a service event is the departure 
of a single customer or of a party having several patrons. The time needed for 
service partly depends on the nature of the customer’s or patron’s requirements. 
But it may also depend on the state of the system—for example, the serv'ers may 
hurry if many customers are waiting. Likewise, for each service facility, successive 
service times may, but need not, be described as independently and identically 
distributed random variables. Occasionally, it is appropriate to add a probabilistic 
contingency of a server breaking down over a limited period of time. 

The service mechanism also prescribes the number and configuration of 
servers or channels. When you arrive at a British airport, for example, 
you must go through Passport Control. There all passengers wait in a single line. 
When your turn comes, you will be directed to whichever official has just finished 
processing a passenger. In contrast, when you enter a bank during rush hours, 
you will find lines in front of each window. There you must select a single line to 
wait in. If you make a poor choice, you may wait longer to be served than someone 
who arrived after you did. (If the lines are not too crowded, you may be able to 
jockey into a line that is getting shorter than the one you are in.) 

Both the Passport Control and bank counter illustrations are examples of 
servers (or channels) in parallel. There are also numerous queuing 
systems in which the servers are arranged in series (tandem), and the cus¬ 
tomer must proceed from one server to the next, possibly waiting in line in front 
of each channel. One example is a job-shop, in which an order to be manufactured 
may be delayed at each of several machine facilities. Another example arises in 
driving an automobile from one end of the main downtown shopping street to the 
other, if the car has to wait at several stoplights. 

Waiting line analysis. Even with as sketchy a description of possibilities 
as the above, you can quickly appreciate that the number of conceivable waiting 
line models is enormous. Although it is easy to give precise mathematical formula¬ 
tions for these possibilities, they often do not yield usable mathematical results 
about the system’s characteristics. Consequently, in order to analyze most waiting 
line systems, you will find it standard practice to combine two approaches. The 
first is to use simple models, such as those given in this chapter, as rough ap¬ 
proximations to the real system. Then, with these results for insight and guidance, 
you may go on to develop a computer simulation model that takes account of 
those facets that are important but hard to deal with by mathematical analysis. 
Since simulation is treated in Chap. 21, no more will be said about this approach 
here. 

Of course, the context of an application determines the particular system 
characteristics that are most important for design decisions. But usually you are 
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interested in the probability distributions of the number of customers in the system 
and their waiting times, or at least in the long-run average values for these random 
variables. In addition, you sometimes want to know the probability of all the 
servers being idle or busy, the probability distribution for the length of the idle 
and busy periods for each server, the probability that the line length will exceed 
a specified number, and the probability distribution between successive departures. 
If the queuing model is not too complicated, then explicit and easy-to-compute 
formulas can be derived for most of the above-mentioned quantities; the results- 
in the* present chapter are of this sort. But as soon as more complex assumptions 
are made, you may only be able to determine these quantities by so-called 
transforms^ which are implicit formulas. A few such examples are given in Appendix 
III for the sake of illustration. 

The preceding section emphasized that an operations research approach to 
studying congestion systems is used to aid managerial decisions. Review the 
several examples in this and the previous section, and suggest a variety of mana¬ 
gerial options that can be selected for the design of a system. You should find that 
in practically every instance, the decision-maker can affect all three aspects of the 
queuing system: the input process, the queue discipline, and the service mecha¬ 
nism. What is more, there are some intricate tradeoffs among the various decision 
alternatives. For example, the average time a customer spends in the system may 
be decreased by changing the arrival rate, adding servers, using faster servers, or 
reducing the variation in service time. The sections below will illustrate the rela¬ 
tive effectiveness of each of these possibilities. 

A time to be born and a time to die. You can view the rest of this 
chapter as illustrating systems synthesis. We begin in the next section with 
a description of the birth or arrival process for customers entering a sys¬ 
tem. We examine the probabilistic characteristics of perhaps the most important 
birth process (in waiting line models, that is), namely, Poisson input. In Sec. 20.4, 
we perform a similar study of service times, or as it sometimes is gravely termed, a 
death process. A classic synthesis occurs in Sec. 20.5, which treats a system 
having a single server. The extension to a multiple-server situation is examined in 
Sec. 20.6. The advanced material at the end of the chapter presents a fairly 
general birth-and-death model which can be used to derive many of the earlier 
results as special cases. Appendix III is devoted to advanced mathematical 
techniques for systems synthesis. 

20.3 PROBABILITY DISTRIBUTIONS OF 
INTERARRIVAL TIMES 

From here on we assume that an arrival event is synonymous with a single 
individual coming to the system. (Bulk arrivals will only be treated in exercise 34.) 
The most convenient way to describe the arrival mechanism is to specify the 
probability distribution of interarrival times. 
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Assume that the lengths of the intervals between arrivals are independently 
and identically distributed, and described by a continuous density function. This 
sort of input is an example of what is called a renewal process, and the 
succession of arrivals demonstrates what is termed a sequence of recurrent 
events. Let 

f{t) = density function for the time interval / between any two 
successive arrivals, 

where / > 0, and also define 

= mean time between arrivals. 


so that 


X = arrival rate per unit of time. 

Jot down the verbal description if 1/X and X (lambda). You can determine X 
from f{t) by taking the mathematical expectation of t\ 


r*® 1 

(1) \ tf{t) dt = — (mean time between arrivals). 

Jo X 

For example, if the unit of time is an hour and (X == 4) is the average number of 
arrivals per hour, then 1/X equals one-fourth (.25) of an hour between arrivals 
(that is, one arrival on the average in any quarter of an hour). Similarly, if one 
arrival occurs on the average in any 10-minute interval, then the arrival rate X is 
one-tenth (.1) per minute. 


Random arrivals. The most important example of an interarrival time 
distribution is that associated with completely random arrivals. Com¬ 
plete randomness means that the probability of an arrival occurring in any small 
interval of time (T, T -r h) depends only on the length of the interval h and not 
on the interval’s starting point T or on the specific history of arrivals prior to T. 
In other words, the arrival process is both stationary, or as it is often called, 
homogeneous, and memory less. In the special material below, we indi¬ 
cate how the assumption of completely random arrivals corresponds to 
postulating 

(2) /(/)=!= ^ 0 negative exponential distribution, 

(mean = 1/X, variance = 1/X*), 

where ^ = 2.71818... . Several examples of the (negative) exponential distribu¬ 
tions for different values of X are given in Fig. 20.1. 

To check the memoryless property of the exponential distribution, suppose / = 0 
represents the system’s starting point in time. Then the probability that no arrival 
occurs in the interval (0, T) is the same as the probability that the first arrjval 
occurs after T : 


(3) 


n = r 

J T 


dt = 
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Now, the conditional probability that no arrival occurs in the interval (0, T + h) 
given that no arrival occurs in the interval (0, T) is, by definition, 


(4) 


Jr** ^ 


= = P[t ^ h]. 


which depends only on L According to (4), the probability of no arrival in the 
interval (T, T h) is the same regardless of whether there is no arrival in'(0, T) 
or whether an arrival occurs at T and thereby “renews” the arrival process. 

There is another way of describing the completely random nature of the ex¬ 
ponential input process. The idea is given here in rough terms, but can easily 
be made exact. Suppose there are n arrivals in the interval (0, T), Then if the 
intcrarrival times are exponentially distributed, the n arrival times are 
independently and uniformly distributed over the interval (0, T). This observation 
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provides the basis for several statistical tests to determine whether an exponential 
distribution adequately describes an actual input process. 

A complementary insight into the assumption of exponential interarrival times 
is gained by expressing in its Taylor series expansion 


(5) 


P 


no arrival in 
any interval 
of length h 


= I - 


^ 3! 


For a very small, but positive, value of A, the term 1 — in (5) is relatively large 
as compared to the remaining terms in the summation. Therefore, this value 
can be used to approximate the probability in (5) when h is very small. We use the 
symbol (=) to denote such an approximation. So we have, for very small A > 0, 


( 6 ) 


P 


no arrival in 
any interval 
of length h 


= 1 - AA 


(A small). 


A verbally inexact, but nevertheless helpful, way to explain the mathematical 
manipulation below is to state that at most only one arrival occurs for a time 
interval A > 0 sufficiently small. Since the approximate probability of no arrival 
occurring in the interval of length A is given by (6), the corresponding approximate 
probability of one arrival occurring is 


(7) 


P 


single arrival 
in any interval 
of length A 


= AA 


(A small). 


A more precise way of expressing the reasoning would be to display the exact 
probability of a single arrival, in a manner similar to (5), and then show that 
for very small A, the term AA is relatively large as compared to the remaining terms. 
Throughout this chapter you should always interpret the symbol (==) as meaning 
that a quantity oirelatively negligible magnitude is being ignored in the approxima¬ 
tion. 

To illustrate, suppose A, equals four arrivals per hour. Then the probability of 
no arrival occurring in an interval of .01 hour is exactly .96079 from (5), and 
approximately 1 — .04 = .96 from (6); the probability of an arrival is ap¬ 
proximately .04 from (7), 

Given that the density function for interarrival times is exponential (2), an 
immediate consequence is that the density function of the total arrival time^ for 
any n consecutive arrivals is 


(8) g{y) = —> y^^ (gamma distribution), 

where n is a positive integer. You may interpret y as the sum of n independent 
values drawn from the same exponential density (2). Then 


(9) 


P 


total interval for 
any n consecutive 
arrivals < T 


= [\{y)dy=l 

J 0 


tA j\ 


as can be verified by repeatedly applying integration by parts. 
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Finally, you should note that assuming exponential interarrivals is tantamount 
to postulating that the probability distribution of the number of arrivals n in any 
interval of length T is Poisson: 


( 10 ) 


p 


"n arrivals in’ 
any interval 
_of length T ^ 


n\ 


for n = 0, 1, 2,... 


(Poisson distribution), 


with 


(11) E[n\T]=XT and Var[«|r]=xr (Poisson—interval of length T). 

For this reason, a synonym for the term exponential arrivals is Poisson 
input. (Sometimes the term Markovian is also used, and abbreviated by the 
symbol M,) 

From (9) and (10), you should verify that 

(12) p [total interval for any n 1 _ p [number.of arrivals in’! 

^ ' [consecutive arrivals ^ 7" J [any interval T'>n J 

Actually, (12) is valid for any recurrent input process, not merely the Poisson,-4f 
the interval starts right after an arrival. 

It is straightforward to apply the foregoing results to a pure-birth model. 
Consider a system that starts at Time 0 with no customers. Assume that customer 
arrivals obey a Poisson process, and that customers never depart from the system 
after they have arrived. Then at Time T, the Poisson distribution in (10) gives 
the probability of n customers in the system. Similarly, (9) gives the density func- 
ion for the total arrival time of the first n customers. 


►The following discussion outlines how assuming certain properties of the arrival process 
leads to the derivation of exponentially distributed interarrival times and Poisson input. 
We postulate that 

(A) The time intervals between successive arrivals are independently and identically 
distributed; further, the probability of an arrival occurring in the time interval 
between T and T + h depends only on the length h of the interval and not on T, 
The corresponding interarrival density function is designated as/(/). 

(B) In any interval of time ^ > 0, there is a positive probability of an arrival. 

(G) In any sufficiently Small interval of time, at most only one arrival can occur. 

Suppose for simplicity that the system starts at Time 0 and the first arrival occurs 
at Time t, where t > 0. Therefore f{t) represents the density function for both the length 
of the arrival intervals as well as the actual time for the first arrival. 

Define 

(i) r(T) = I - jlm dt, 


SO that 


(ii) 


r(T) = P[first arrival occurs after Time T]. 


Then given postulates (A) and (B), 

(iii) r(r+A)=r(r)r(A) 


for all T, A > 0, 
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and it can be proved that the only function that satisfies (iii) is 
(iv) r{T) = 

where \ is a positive constant. Therefore, 

(V) = 1 - J /(0 dt, 

SO that 

(Vi) m = 

as was to be shown. 

Define 

(vii) Pn{T) = P\n arrivals occur in the interval (0, T)]. 

Let / = A* be the time of the first arrival event, and according to postulate (C), only one 
customer enters at a. By postulate (A), we can write 

(viii) />„( r) = (r - x)f{x) dx = P„.x (>)/( T-y)dy for n = 1, 2,..., 

where ^ = T — a. Using (vi), 

(ix) P„{T) = dy. 

Differentiating with respect to T yields 

(x) -XP„{T) + XP„.,(T) for «= 1,2,.... 

Since Po(T) = g{T) = the equations in (x) can be solved recursively, starting with 
n = 1, as each is a linear first-order differential equation with constant coefficients. The 
complete solution is simply the Poisson distribution, 

(xi) P.{T) = for n = 0,1,2. 

Verify that this solution does in fact satisfy (x) by differentiating T') in (xi). The same 
line of reasoning applies to any interval T, thus yielding the result given in (10). A 

Taxicab Example. The following illustration indicates both the memory¬ 
less property of the exponential distribution and the kind of surprising result that 
can occur in queuing systems. Consider the illustration of hailing a taxicab 
on a corner of Times Square in New City. Assume, on the average, one empty 
cab goes by the corner in any 30-second interval, that is, the mean time between 
arrivals is 1 /A. = 30 seconds. Suppose you come to the corner at an arbitrary 
instant. On the average, how long will you wait until the first empty cab arrives? 
Most persons reply 15 seconds when asked this question. Did you? As you will 
see below, that answer is correct only if an empty cab arrives at intervals of exactly 
30 seconds. If there is any variability in the interarrival times, the answer is 
always larger than 15 seconds. 
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It can be shown that if you examine the system at an arbitrary moment, then 

(*fo first arr^al "" t[x interarrival times) j • 

Hence if the term ‘‘variance of interarrival times” is positive, then the average time 
to the first arrival is larger than ^(l/X.). Note that when the interarrival distribu¬ 
tion is exponential, its variance is l/A.% so that the average time to the first arrival is 
1/X. And when the variance is larger than l/A,*, the average time to the first arrival 
is actually larger than the average intcrarrival time! 

Erlang arrivals. Another important example of an interarrival time 
distribution is 

(14) /(O =-> />0 (Erlang distribution-order «), 

where ;z is a positive integer; the expectation and variance arc 

(15) £•[/]= X Var[<]=J^ (Erlang). 

(Frequently, this distribution is denoted by the s>mibol £■„.) Replacing in (14) 
by X yields the gamma distribution, shown in (8). 

By varying X and n appropriately, you can use the Erlang distribution to 
provide good approximations for many different interarrival distributions; several 
illustrative examples are given in Fig. 20.2. Be sure you notice that when w = 1, 
the Erlang distribution is simply the exponential distribution. What is more, 
when you let n —> oo, so that Var [/] —> 0 as a result, then the Erlang distribution 
represents the case of periodic or regular arrival times, that is, a con¬ 
stant interarrival interval of 1/X. Examples employing this distribution arc given 
in Appendix III. 


20.4 PROBABILITY DISTRIBUTIONS OF SERVICE TIMES 


The considerations in spe'eifying the probability density for each service time 
are very similar to those for interarrival times. We assume that each server, or 
channel, processes one customer at a time. (We will not consider bulk service in 
this book.) For a specified server, assume that successive serving times are inde¬ 
pendently and identically distributed, and described by a continuous density 
function. Let 


(1) ^(0 = density function for the length of time / to serve any customer, 
where t > 0, and also let 

(2) mean service time = f tg{t) dt^ — 

Jo 

so that 

(3) fi ==• service rate per unit of time that the server is busy. 

Jot down the definitions in (2) and (3). 






Figure 20.2 (a) Erlang Distribution for X = 1. 
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To illustrate, if the unit of time is an hour and fi (mu) equals live services per 
hour while the server remains busy, then the average service time l/yx is one-fifth 
(.2) of an hour. Similarly, if a service on the average requires one-half an hour, 
then the service rate /x- is 2 per hour, during an interval when the server remains 
busy. 

Frequently, the service time distribution is assumed to be exponential: 

(4) g{t) = 1^0. 

Mathematical convenience is the usual reason for such an assumption. But the 
exponential service assumption can also provide guiding insight into the 
operation of any system, since it represents the extreme case of service times that 
are memoryless. When the service distribution is exponential, the probability of 
completing a service to a customer in any subsequent interval of length h is 
independent of how much service time has already elapsed for that customer. 

Accordingly, given assumption (4), if a customer is being served at Time t 
and you observe the system at Time t + h, then 

(5) P [service is not completed in interval of length h] = 

Consequently, for A > 0 very small, 

(6) P [service is not completed in interval of length /j] = 1 — fih {h small) 
and 

(7) P [service is completed in interval of length ^] =(Ismail). 

Consider the following pure-death model. Let ^ = 0 represent the 
system’s starting point in time. Suppose that there are M customers in the system 
at Time 0, and that no more customers ever arrive. Assume that there is a single 
server having an exponential time density (4), and let 

(8) Pn{T) = probability that n customers are in the system at Time T. 

[The dependence of P„(T) on the value of M has been suppressed in the notation.] 
The formula for Pn(T) can be derived using an exact approach analogous to the 
derivation of (10) that we gave in the special material at the end of the preceding 
section. But the result can also be obtained by employing (6) and (7) above, and 
the approach is illustrated next because it is so useful for analyzing more complex 
models. 

As in the preceding section, we calculate approximate probabilities by ignoring 
relatively small quantities. And in the same approximative vein, we say that at 
most only one departure*occurs during a very small interval of time A > 0. Hence, 
when there are n customers in the system at Time T -f h, we consider only the 
possibilities that at Time T either there were n customers and none have departed, 
or there were n + 1 customers and one departed during the very small interval 
of length A. (Actually, the other possibilities of there being more than n -f- 1 
customers and accordingly more than one departure are relatively improbable 
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when h> 0 is very small.) Consequently, for 1 ^ n < M, 

(9) P„{T + A) = (1 - iih)P„{T) + 

The first term on the right is the approximate probability that no service occurred 
in the interval of length h and that n customers were in the system at Time T. 
Interpret the second term on the right of (9). Rearranging terms yields 

( 10 ) P„{T-\.h)^-^„{T) ^ + iiP„^,{T), 

SO that on letting A — > 0, 

(11) ^ = -pPn{T) + llPn*x{T) for 1 < R < M. 


The reason equation (11) holds exactly, instead of approximately, is that all the 
terms of relatively small magnitude that were ignored in (10) actually disappear 
in the process of letting h approach 0 in the limit. 

By a similar line of reasoning, you can determine that 


( 12 ) 


i^ = -^P^{T) for« = M. 


The unique solution to the system of linear differential equations (11) and (12) is 


(13) 

(14) 


Pn{T) 


(tir 


{M-n)\ 

Po{T) = \ -tPn{T) forn = 0. 


for n = 1, 2,..., M 


The distribution in (13) and (14) is sometimes called a truncated Poisson. 

If the A/th person is the last to be served, then the total time y that he spends 
in the system, including his own service time, has the density given by the sum of 
AI exponentially distributed variables 


(15) h{y) = (gamma distribution), 

with 

(16) Var[j.]=4 (gamma). 

Although an exponential service time distribution is by far the most mathe¬ 
matically convenient form to assume, it is possible to derive usable mathematical 
results for certain simple queuing models when the service distribution is assumed 
to be Erlangian, as given by (14) in the previous section. An illustration appears 
in Appendix III. 


*Self-service model. Now suppose, instead of there being only a single 
server, that at Time 0 each of the'M customers starts self-service. Assume that the 
service time is the same exponential density for every customer, namely (4). This 
assumption is reasonable enough in a self-service situation. Consider a very small 
interval of time A > 0. Then because the servers are independent, you can apply 
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binomial probability calculations, using the approximate expression in (6), to 
obtain 

(17) P[nonc of n customers departs] = (1 — yxA)” == 1 — nfih (Ji small) 

(18) P[one of n customers departs] = n^h {h small). 

Once again, the justification for (18) is that when interval A > 0 is small, we 
can restrict attention to the events of no and one departure; the possibilities of 
more departures have relatively negligible probability. Consequently, when there 
are n customers in the system at Time T + h, we consider only the possibilities 
that at Time T there were n customers and none have departed, or there were 
n + 1 customers and one departed—giving, for 0 ^ n < M, 

(19) P^{T+ A) = (1 - h^n)P„{T) + (n + (A small). 

Bringing Pn{T) to the left-hand side of (19), dividing by A, and letting A —> 0 
yields 

(20) g = -n^P^{T) + (n + l)/tP„^,(7’) for 0 < n <M. 

Similar reasoning gives 

(21) ^=-MiiPm{T) (or n = M. 


As you can verify by substitution, the complete solution to (20) and (21) is 


( 22 ) 

with 


Pn{T) = 


for « = 0, 1, 2,. . ., M 

(binomial distribution), 


(23) E[n\T] = Me-^^ Var[n| T] = Me-^^{\ - 


*Q,ueuing model nomenclature. In reading the queuing theory litera¬ 
ture, you will find that a standardized terminology is employed. (It is often called 
Kendall’s notation.) Each model is signified by three symbols: the first 
designates the input process; the second, the service distribution; and the third, 
the number of servers. The standard symbols for the probability distributions are 

M = exponentially distributed interarrival or service time {M is an abbrevia¬ 
tion for Markovian) 

D = deterministic (or constant, regular) interarrival or service time 
En = Erlangian distribution of order n for interarrival or service time 

[some books also use the symbol and then employ the formula 
for a gamma distribution (15)] 

GI = general independent distribution of interarrival time 
G = general distribution of service time. 
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To illustrate, the model of Poisson input, exponential service, and single server 
is denoted by MjMjl, If the input were deterministic instead, then the notation 
would be Z)/M/l; further, if there were S servers, the notation would be Z)/M/5. 

20.5 SINGLE-SERVER MODEL WITH POISSON 
INPUT AND EXPONENTIAL SERVICE 

Most waiting line systems are composed of several servers, and more often than 
not, the queue discipline is quite complex. For example, a woman choosing a 
checkout stand at a supermarket looks at the number of women already in line 
and the number of items in their carts. She also considers which lines are closest 
to the aisle she is emerging from, as well as which checkout clerks are the fastest. 
Similar considerations are relevant when a man selects a line in front of a teller’s 
window at a bank, or when a driver chooses a toll booth on a turnpike. But some¬ 
times the discipline is on a strict first come, first served basis. One example is a 
bakery that gives each entering customer a number, and then serves the customers 
in numerical order. Other examples occur at gasoline stations, camera shops, 
shoeshine stands, and in a crowded but orderly cafeteria line. 

As we have stated earlier, developing an operations research model for an 
actual situation inherently involves making approximations. This is so regardless 
of whether, in the case of a waiting line model, you analyze the problem using 
either mathematics or a computer simulation or both. Often you can obtain ap¬ 
proximate information about a complex system’s operating characteristics by 
examining certain extreme or limiting cases. One such approximate approach is 
to consider a queuing system with multiple servers as being comprised of a collec¬ 
tion of separate and independent single-server systems. For example, suppose the 
system consists of five servers and the input process indicates that 20 customers 
arrive per hour. Then as an approximation, the system can be viewed as five separate 
single-server systems, each with an arrival rate of 4 per hour. The approach is 
approximate for two reasons: a customer is assumed to join each line with 
equal probability, regardless of the length of the several lines; and once a customer 
joins a line, he is assumed to stay there. The expected number of customers that 
are in each system, summed over all the separate systems, and the average time a 
customer spends in a system will usually be larger than in most actual multiple- 
server systems. The reason is that if this queuing system were to operate in real 
life, then there would be more times than you would expect when one server is 
idle and yet customers arc waiting in line for other servers. 

If you also consider the approximation that there is only a single line in a 
multiple-server system and that customers arc processed on a first come, first 
served basis, then you may obtain smaller than actual values for the expected 
number of customers and the average time each spends in the system. 

Fortunately, single-server systems often do yield to mathematical analysis, as 
do many cases of multiple servers with a single line and a first come, first served 
discipline. We therefore begin here with a simple single-server queuing system. 
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and treat a simple multiple-server model in Sec. 20.6. More complex single-server 
models are studied in Appendix I IF. 

Model description. The simplest single-server model having both input 
and service processes described probabilistically is one with exponential interar¬ 
rival and service times (denoted by M/M/1). Specifically, assume 

exponential interarrival density: 
exponential service time density: 

Define the number of customers n in the system at any point in time as including 
the persons waiting in line plus those in service. Let t = 0 represent the system’s 
starting point in time, and define 

(2) Pn{T) = probability that n customers are in the system at Time T, 

Actually Pn{T) depends on the number of customers i in the system at Time 0, but 
this dependence is suppressed in the notation. 

Let A > 0 be a very small interval of time. If there are n> 0 customers in the 
system at Time T + h, then we consider only the possibilities that there were 
either w — 1, ;i, or n + 1 customers at Time T. Any other possibilities are of 
relatively insignificant probability. Consequently, for n > 0 

(3) ■” ph)P^-^{T) -f (1 - - M)Pn(r) 

^ ^ + {\h){fMh)Pr^{T) + (1 - Xh){fih)Pn,,{T) (h small). 

The first term on the right corresponds to the event of one arrival and no departure 
when n — 1 customers are in the system at Time T. The second and third terms 
refer to the events of no arrival and no departure, and of one arrival and a depar¬ 
ture when n customers are in the system at Time T. And the final term relates to 
the event of no arrival and one departure when there are n -f- 1 customers in the 
system at Time T. As the symbol (=) indicates, the expression in (3) is approximate; 
it can be made exact by adding probability terms with coefficients where 
k'^2. 

Bringing the term Pn{T) to the left-hand side of (3), dividing by A, and letting 
A —> 0, yields 

(4) ^ = ?iP„_,(7’)-(X + /t)P„{7’)+;4/>„^,(r) forn>0. 

This expression is exact because the terms neglected in (3) become 0 as A —> 0. 
Similarly, you can show that 

(5) ^ for « = 0. 

With moderately advanced analysis it is possible to solve the system of linear 
differential equations (4) and (5) for each Pn{T)- (To'do so, you must also state 
the number of customers i in the system at Time 0.) The result is called the 
transient solution, since it depends directly on the value of T, 
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Suppose, however, we examine the values of Pn{T) as 7*—> oo. If Pn{T) ap¬ 
proaches a limiting value, say, and if E\n\ is finite for this limiting distribution, 
then we will say that the system reaches statistical equilibrium. Wc 
denote the resulting values as equilibrium or stationary proba¬ 
bilities. The label “stationary” derives from the property that if the number 
of customers in the system at any Time t is given according to the probability 
distribution P„, then for any h> Q, Pj, is also the probability that n customers are 
in the system at Time t + h. The value ofP„ can.also be interpreted as the limiting 
fraction of an arbitrarily long period of time during which the queue contains n 
customers. 

Provided 


(6) p = _ < 1 (traffic intensity assumption), 

the stationary probabilities always exist; the symbol p (rho) in (6) is frequently 
called the traffic intensity. Give a verbal interpretation of the assumption 
in (6). 

You can find the equilibrium solution PniP) ^ Pm for all T, by using the 
consequence that each dP^^jdT must equal 0, if the solution is indeed indepen¬ 
dent of T, Hence to obtain all you need to do is set the time derivatives in (4) 
and (5) equal to 0, yielding 

(7) 0 = XPn^x — (X* + p)Pn + /^Pn+l for « = 1, 2, 3, . . . 

(8) 0 = -\P, -I- tiP, for « = 0. 

The system of difference equations (7) and (8) is easily solved recur¬ 
sively, starting with (8), 

(9) P, =/>,(A) = />,p 

and proceeding to (7) for n = 1,2,..., 

(10) i>„ = PoP”. 

[Verify that P„ in (10) does satisfy (7).] Given (6), 

(H) = = 

7l»0 71-0 1 - P 

and it follows that P© = 1 — P> so that 

(12) P„ = (1 — p)p" for n = 0, 1, 2, ... (geometric distribution) 
with 


(13) 


E 


number of 
customers 
in system 


= £[«] 


— P — x> 

1 — p ft — X. 

P\n ^N]= p^. 


Var [n] = 


P 

(1 - py 


Notice the probability distribution in (12) depends only on the traffic intensity 
ratio {XlfjL = p). Since p(= 1 — P^) is also the fraction of time the server is busy, 
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the quantity p is sometimes called the system’s utilization factor. It is 
significant that this interpretation of p remains valid even when both the inter¬ 
arrival and service time distributions are general (that is, for the model G//G/1). 


^Letting P„(7’|i) denote the transient solution to (4) and (5) when i customers are in 
the queue at / = 0, it can be shown that 

(i) 

Consequently, P„( Tli) approaches at least exponentially by a factor proportional to T. 
Note, however, that the proportionality factor, which can be written as 
approaches 0 as X approaches fi. Hence T may have to be very large before P„(7’|i) and 
Pn are nearly equal; this is especially so when p is large and i is small. A 


Operating characteristics. The expected line length can be found by 
noting that 


(14) 
so that 


„ , , (number in system 

line length = < ^ 

(number in system — 1 


if = 0 
if n > 0, 


(15) 


>£[line length] = 0*Po + S (« — l)^)i = 'I!,nP„ — YiP„ 

n-1 71=0 71=1 




Next consider the time intervals when the server is idle. Since these begin when 
a service terminates and end when a new arrival occurs, the length of the idle 
periods has the same distribution as the interarrival time, that is, exponential 
with mean 1 jX. Let the period of length T be so long that we can safely utilize 
expected values. Then the server is idle for [TPq = ^(l — p)] units of time, and 
[^{l — p)/(l/X,) = ^^(l — p)] is the number of separate idle periods during T. 
Because idle and busy periods alternate, XT(1 — p) is also the number of separate 
busy periods during T, and pT is the total duration of all busy periods. Con¬ 
sequently, 


(16) £ [length of busy period] = — 
and 

(17) ^friumber of customers 1 _ pflength of ”1 __ 1 

^ ' [served per busy periodJ ^ [busy period] “ l _ p' 

The relations (16) and (17) are actually valid for any service time distribution 
(that is, for the model M/C/1). 

Now turn to the probability density for the time a customer spends in the 
system, which is defined as the interval a customer waits in line plus his time in 
service. Suppose the system is in statistical equilibrium, so that when a new cus¬ 
tomer arrives, he finds n customers in the system ahead of him with probability 
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n given by (12). Assume that the queue discipline is first come, first served. Then 
his total time spent in the system is composed of the sum of n + 1 independent 
and identically distributed exponential random variables, and has a gamma 
density 

(18) for^^O, 


as you already saw in (15) of Sec. 20.4. Hence the density of time that a customer 
who arrives at an arbitrary instant spends in the system is given by 


(19) 

with 


h{w) =2(1 

n=0 



= /v(l - 


(exponential distribution). 


( 20 ) 


£[time in system] = = 


1 

Ml-p) 



jE'[time in line] = -Eftime in system] — jE[ service time] 

(21) _ 1/ p X. 

fl\l — p) — X) 

,For fixed p, the expected times in the system and in line vary inversely with the 
service rate /x. 


► Suppose you look at only the lime that a customer waits in line, where you exclude 
his service time as well as that of any customers who arrive when the server is idle. Then 
it can be proved that the conditional probability density of the time spent in line by those 
customers who do have to wait in line is also given by (19) for any interarrival distribu¬ 
tion, that is, for the model C//A//1. Consequently, E\u}\ in (20) is also the conditional 
expected time waiting in line given that a customer does have to wait in line. 

Sensitivity analysis. Various operating characteristics of this simple 
queuing system are displayed in Fig. 20.3 for different values of traffic intensity p 
and service rate fi. 

Notice that as the traffic intensity p increases, the expected values of the number 
of customers, the line length, the time in the system, and the time in line [formulas 
(13), (15), (20), and (21) above] all increase rapidly. Although these quantities 
can be made arbitrarily large for sufficiently large p < 1; the system may take 
an accordingly long time to reach steady-state equilibrium. For a given service 
rate yx, when intensity p is small, most of the expected time a customer spends in 
the system is due to the average service time 1/p; but as intensity p increases 
(that is, as the arrival rate X increases), most of the expected time spent in the 
system is due to waiting in line. 

For the sake of illustration, suppose the unit of time in Fig. 20.3 is an hour (or 
60 minutes) and that p = .8. Then on the average, the server is idle .2 hour (or 
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^ = service rate per unit of time 
P =X//x traffic intensity 

Figure 20.3 Operating Characteristics of Af/Af/1 System. 


12 minutes per hour) and there are four persons in the system. = 10, so that the 

service rate is 10 per hour (or at the rate of six minutes per customer), then the 
average time a customer spends in the system is .5 hour (or 30 minutes), and .4 
hour (or 24 minutes) of this is due to waiting in line. If p remains .8 but both the 
arrival rate and service rate double, so that p. = 20, then the average times spent 
in the system and in waiting are cut in half. 

Secretaries example. The following hypothetical example will indicate 
briefly how the model’s operating characteristics, such as those shown in Fig. 20.3, 
can be used to assist decision-making. Noah Peale, the manager of a law office, 
is deciding whether to employ two or four secretaries. To simplify matters, sup¬ 
pose he faces only two alternatives: to hire two experienced legal secretaries who 
can each type at the rate of 20 documents a day, or four novice secretaries who 
can each type at the rate of only 10 documents a day. On the average, there are 
36 documents to be typed per day. Copy these data on a piece of paper for easy 
reference, since this example will be used again later in the chapter. 

Suppose the manager uses, as an approximation, the simple model above. In 
the case of the two faster typists, each will be assigned documents at the rate 
= 18 per day, and in the case of the four slower typists, at the rate 
^ = 9 per day. Verify that in either case, traffic intensity p == .9. 
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Then the average number of documents on each girl’s desk is 9, according to 
(13), so that there are an average of 18 documents in the entire system having two 
secretaries and 36 in the system having four secretaries. The average time for a 
document to be in the system is y [= 1/(20 — 18)] day for the two faster secretaries 
as compared to 1 [= 1/(10 — 9)] day for the four slower secretaries, according to 
(20). By the same token, each experienced secretary will be busy over intervals 
that average one-half a day, whereas, each novice secretary will be busy over 
intervals that average a full day. 

In what ways do you think this simple model approximates the real situation? 
Why would the average delay for a document tend to be smaller than (20) in 
actuality? If you were the office manager, what else would you take into account in 
deciding between these two alternatives? 


*Finiite queue. So far no limitation has been imposed on the total number 
of customers present in the system at any time. Now suppose that at most M 
customers are allowed to be in the entire system, and therefore no more than 
M — 1 persons are permitted to wait in line at any instant. (An example is a 
one-pump gasoline station with a small driveway leading into a busy street.) 
If a customer arrives when M persons are already in the system, then he is restricted 
from entering, and is said to be lost from the system. (Consequently, this model 
is sometimes teimed a combined loss-delay system.) An important dif¬ 
ference between the previous model and this finite-queue model is that 
here statistical equilibrium is reached for any value of the intensity ratio \//x. 
Give a reason why. 

The steady-state difference equations (7) and (8) still apply for n = 0, 1,. . ., 
Af — 1, but the equation for n = M is 

(22) 0 = XP.if-i — 

The corresponding solution for = 0, 1,. . ,M is 


(23) 


Pn 



1 

M+ 1 




for /I 
for \ = /i. 


,Of course, when X < and M— > oo, in (23) agrees with (12). By elementary 
calculations it can be shown that 


(24) 


number of 
customers 
in system 


p n - (M + \)p^ + 

- (nr7)[- r=^' J 

_ p (M+I)p* 

- 1 1 - 


for X^ fi. 


Observe when X <. fi, the expected number of customers in this system is smaller 
than that for the previous case of an unlimited line length (13). Similarly, it can 
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be proved that for X = /x 


(25) 


E 


number of 
customers 
in system 



for X = fi. 


A delicate question arises in defining the amount of time a customer spends in 
the system (and waiting in line). If a customer arrives when there are already M 
persons in the system, he does not enter, and consequently he literally spends no 
time in the system. Hence average time spent in the system can be defined so as 
to refer either to all customers who arrive, regardless of whether they enter, or 
only to those customers who are permitted to enter. We adopt the latter, since 
in most situations the interest in delay time is only for those who actually do enter 
the system. So, given that a customer arriving at an arbitrary moment does join 
the system, and that the discipline is first come, first served, it can be shown that 


,26) 

and 


£[time in system] = 


P 

/i{i — p) 
1 

/4(l-p) 


[i 



^ J 


Mp* 

Ml - P") 


for fb. 



(27) . jB[time in system] = E[w] = —. ^ 

. 2 


for \ = fjL, 


►By advanced analysis, it is also possible to derive dreadful-looking, but nevertheless 
computable, formulas for the transient solutions to the finite queue model. These are 
given here only so that you can see how intricate the mathematical form of the transient 
solution turns out to be. Let i denote the number of customers in the system at Time 0. 
Then for any X and ft. 


Pn{T[i) ^Pn + 


(i) 


Af-f- 


2 M ^-(A^M)7’+2^^/XMcos [i>r/(Jf+l)lp(n-l)/2 


^ ^=1 


(, _2pcOS;;;i^ + p) 


j^sin 


M+ 1 


-VT sin 




M + 1 JL- “ M + M + 

where the are given by (23). As T -* co^ the summation term approaches 0. 

For the case of Af = I, in which a customer is allowed to enter only if the server is free, 
and no customer is in the system at Time 0, (i) simplifies to 


(ii) 




^-(X+#i)r 


P,{T\i=0)=: 


. g-a+li)T 


X -f- ft X -f- ft 

for all X and ft. Thus as T -> oo, the transient corrections to the stationary probabilities 
die out exponentially. A 


Arbitrary service distribution. Other versions of single-server 
models in which the interarrival and service distributions are not exponentially 
distributed will be explained in the advanced material in Appendix III. But it 
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is possible to state in simple terms the expected number of customers in the system 
and the average time spent in the system for the case in which the interarrival 
distribution is exponential and the service distribution is arbitrary (that is, for the 
model Af/G/1). 

Let 

(28) V = Var [service time] = g{t) dt, 

where, as before, 1/yx is the mean service time and g{i) is the service time density 
function. Then 


(29) 

(30) 

(31) 


[number of customers in system] = p + 


2(1 - p) 


£[line length] = 

P [server is idle] = Pq = \ — p. 


where the intensity factor p = Xjfi < 1, as usual. 

Assuming that the system operates on a first come, first served basis, then 
the average time a customer spends in the system is given by 


(32) 


i?[time in system] = £'[time in line] + ^[service time] 

- /t»L2(i - p)J ^ fi \ r + 2(1 - p)J 


[Frequently, (29) and (32) are referred to as Pollaczck-Khintchine 

formulas.] 

^ Verify that when g(t) is exponential, so that V = 1//a*, then (29), (30), and 
(32) reduce to (13), (15), and (20), respectively. The above formulas also give 
the expectations when service time is constant 1//^, since then F = 0 (the resultant 
model is designated as Af/Z)/1). Observe that all the averages vary linearly with 
F, and depend only on the arrival rate X,, the traffic intensity p, and the service 
variance F, and not on any other parameters of the input and service distributions. 
Note further the equilibrium relation 


(33) E [number of customers in system] = XE [time in system]. 

Actually, (33) holds under much more general conditions, including many 
multiple-server models. Give a verbal interpretation of (33) that makes the rela¬ 
tion plausible. 

To check your understanding of these formulas, consider once again the above 
example of hiring either two experienced legal secretaries or four slower novice 
secretaries. Recall a rapid secretary can complete an average of 20 documents a 
day, whereas the corresponding figure for a slower secretary is 10. Since an average 
of 36 documents must be typed each day, the arrival rate for each rapid secretary 
is 18 as compared to 9 for a slower secretary. 

When both the interarrival and service time distributions were assumed to be 
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exponential, there were on the average, 9 documents on each girl’s desk. The 
average time for a document to be in the system was one-half day for a faster 
secretary as compared to one day for a slower secretary. 

Now suppose instead of exponential service times, the service times are constant 
(K = 0). Then verify in (29) that there will be an average of only 4.95 documents 
on each girl’s desk and in (32) that the average time for a document to be in the 
system is .275 of a day for the fgister secretary and .55 of a day for the slower one. 
Hence, elimination of service time variations nearly cuts the averages in half as 
compared to the exponential case. 

In contrast, suppose that the variance of service times is double that of the 
exponential case— (tw) for a fcister typist and (tw) ^ slower typist. Check 
that there will be an average of 13.05 documents on each girl’s desk, and that the 
average time for a document to be in the system is .725 of a day for a faster secre¬ 
tary, and 1.45 days for a slower one. Therefore, doubling the service time vari¬ 
ability as compared to the exponential case increases these averages by only about 
50%. Work out the impact of tripling the variability. 


► Wc outline a proof of (29) to illustrate an application of the proposition, “the expecta¬ 
tion of a sum of random variables is the sum of the expectations.” To keep the argument 
simple, we assume that the system is examined at those instants in time when a service 
has just been completed and the associated customer has departed. If the system is in 
equilibrium, the formula in (29) also holds for an arbitrary moment in time, but a proof 
substantiating this assertion is much more difficult. 

Let n be the number of persons in the system just after a departure at Time T, and n' 
the number of persons just after the next departure at Time T'. Let j be the number of 
arrivals during this next interval of service. Then 


(i) 


.'-P' 

(n — 1 -f J if n 


= 0 

> 0 . 


The equality for n = 0 follows because the first person to arrive after Time T is also the 
one who departs at Time T'; consequently, the customers remaining are just those who 
arrived during the interval of his service. The equality for n > 0 holds because the next 
service interval begins immediately at Time T, and hence the departure at Time T' is 
one of the ti customers in the system at Time T. 

Relation (i) can be e.xpressed more conveniently by introducing the so-called Heaviside 
• Unit Function or Dirac Delta Function 


(ii) 

for then according to (i), 
(Hi) 


8 («) 


^|0 if« = 0 
{l if « > 0, 


n' = n — 3(n) -f j for all n. 


As a preliminary to finding E[n\^ recall that for a given length of service time /, the 
number of arrivals j has a Poisson distribution with mean X./. Therefore 


(iv) 


m = (0 dt = \tg{t) dt = ^ = p. 
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where E[j |^] is the conditional expectation ofj given /, andg(0 b the service time density. 
Also 


• = r^L;'IO^(0 dt = r [\t + (X()*]g(0 dt 

(v) 

= ~ + A.2|var [service time] + j- = p + + p*- 

It can be shown (see Appendix III.2) that 

(vi) Pq = I — p so that £[8] = £[8*] = p. 

Now square both sides of (iii) and take expectations 

E[(nr] = E[n^] -f- E[B^] + EU^] - 2E[nB(n)] + 2E[nj] - 2E[B{n)j] 

= E[n^] + p + EU^] - 2£[n] + 2£[n]£D*] - 2p£D-], 

due to (vi), n8(n) = n, and n and J being independent. Using the equilibrium condition 
*^[('*0^] = and (iv), we can write (vii) as 


(viii) 


£M = 


Elf ] + P - 2pV 

2(1 - p) 


Substituting (v) into (viii) and simplifying yields 

(ix) = p + 


Go back over the argument and indicate where we applied the proposition, “the expecta¬ 
tion of a sum equals the sum of the expectations.” 

The proof of (32) is usually based on first establishing (33), and then dividing £[n] 
by \. M 

*Priority discipline. In many real situations, the queuing dbcipline is 
not according to a first come, first served basis. Consider, for example, a young 
executive who returns to his office after being away several days on a business trip. 
He may find a number of telephone messages awaiting him, including one from 
his boss. Most likely, he will return his boss’s call before any others. 

Suppose* the input to a queuing system can be classified into r dbtinct types, and 
priorities from 1 to r are assigned to these types in decreasing order of importance, 
so that Typ'* 1 has the highest priority and Type r the lowest. As soon the 
server finishes with a customer, he always goes on to serve the next customer in 
line who has the highest priority. (If several individuals in line have the same 
priority number, then they are served on a first come, first served basis.) In some 
situations, an additional assumption is made that the service of a customer being 
waited on is preempted whenever a customer with a higher priority enters the 
system. The results given below, however, are for the nonpreemptive dbcipline. 

Assume that each customer Type k arrives according to a Poisson input process 
with mean rate Also suppose that the service dbtribution for each Type k is 
arbitrary with density 


(34) 


[service time [Type k] 


% 


Vjt = Var [service time | Type k]. 
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Define 




Th. 

« fij 


(7o = 0 


and assume or,. < 1 to ensure the system reaches statistical equilibrium. 

Consider the system at an instant in time just after a service of a customer has 
been completed. Then if a Type k customer just arrives, the expected time he 
waits in line is given by 


(35) 


E [time in line | Type k] = 


2(1 -(r.-,)(l -CT,) * 


Verify that when r = 1, the expression in (35) reduces to that for ^ [time in line] 
(32). Since Xjt/S equals the probability that an entering customer is of Type k. 
you can find the expected time in line for an arbitrary customer by calculating 


T 

2 XjJE [time in line | Type k] 
(36) E [time in line] = — - - - 


The following numerical example will illustrate the impact of imposing a 
priority discipline. Suppose the arrival rate for the system is X = 18 and the 
service rate is fc = 20 without a priority discipline. Then, as you saw in Fig. 20.3 
and for the example of the legal office typists, the expected time in line is .9. Sup¬ 
pose instead that the arrivals are classified into Type 1 and Type 2, having the 
respective arrival rates Xj and X 2 , where Xi + Xo = X = 18. Assume that the 
service rate fi applies to both types and that the service distribution is exponential. 
Then verify that according to (35) and (36), 


(37) 


E [time in line | Type 1] 


2(9)[ 

-L + -L 
20* ^ 20* 

] _ .045 

2(1) 

(■-») 

1 

('-i) 


E [time in line | Type 2] 


.g[time in line [ Type 1 ] __ £‘[tifne in line | Type 1] 

/1 _^•! 

V 20 207 


E [time in line] = .45. 


Hence, the average time in line for an arbitrary customer is still .45, but the 
expected time is always less than .45 for a Type 1 customer and always greater 
than .45 for a Type 2 customer. Further, the smaller the value of Xi, the smaller 
the expected wait in line for both Types 1 and 2, even though the average time in 
line for an arbitrary customer remains unaffected. To illustrfite, suppose Xi = X 2 
= 9; check in (37) that the expected time in line for a Type 1 customer is (-x-fir = 
.0818) and for a Type 2 customer is .8181. What happens when Xi = 3? When 
X, = 15? 
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In some instances, the several types of customers require different lengths of 
service. Then assigning priorities according to the mean service rates—the highest 
rate having the highest priority—will always reduce the overall average time in 
line. To illustrate, consider again the above example with X = 18 and = 20; 
the associated utilization factor is = .9), and the expected time in line is .45. 
Suppose in a two-priority-discipline scheme that Xi = X 2 = 9, and = 30 and 
yx 2 = 15, so that the utilization factor [dr = -^ + remains the same. 

Verify from (35) and (36) that the expected time in line for a Type 1 customer is 
(it = .0714) and for a Type 2 customer is (« = .7142), which are smaller 
values than the corresponding averages for fti = = /^ = 20. Now the 

expected time in line for an arbitrary customer is .3928, which is smaller than .45 
in the system without priorities or with /Xi = /X2 = 20. 


20.6 MULTIPLE-SERVER MODEL WITH POISSON INPUT 
AND EXPONENTIAL SERVICE 


Certainly most queuing systems are composed of several servers, and hence 
waiting line models with more than one server are important. In this section, we 
generalize the results in the preceding section. The queuing discipline for this 
model is relatively simple 2 is compared to most realistic situations. Nevertheless, 
the results can be used at least as an initial approximation to the behavior char¬ 
acteristics of more complex systems. 

Let 


(1) S = number of servers 
and assume that 

interarrival density: 

(2) service time density 

for each server: 

where the individual service times are all mutually independent (that is, for each 
server as well as among the servers). 

The appropriate difference equations [analogous to those for the single-server 
case (7) and (8) in the preceding section] that determine the stationary proba¬ 
bilities are: 

(3) ^ '^n-i — (^^ + + (n -h l)>xPn+i for 1 ^ n < 5* 

0 = XP„_i — (X + Sfi)Pn + SfiPn^x for n>S, 

(These equations are derived in the next section.) The associated solution is 


p =_£!_ 

" _5!_yn-S 


( 4 ) 


• P„ for n'^S, 
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where p = Xlfi and 


0 — 

r*’-l 

v 

p‘ J. 

p" ] 


>«0 

Sl{ 

■-d 


In order for the system to have the stationary probability distribution given in 
(4) and (5), you must assume that X < pS {or p < S), What do you think would 
happen if X, > pS? [Check that when .S' = 1, (4) and (5) simplify to (12) in Sec. 
20.5.] 

In the case of an unlimited number of servers—for example, a self-service situa¬ 
tion—the first formula for in (4) applies for all n. Consequently, is then a 
Poisson distribution, with = p, (This model is designated as MjMjoo, When 

^ = oo, the conclusion that is Poisson actually holds for any general service 
distribution, that is, for the model M/C?/oo.) 

► The formulas in (4) also apply to the case with a finite limit M ^ .Son the total number 

of customers in the entire system. Then fi^M and Pq is determined from 

(i) i:/^n = i. 

71=0 

Further, the stationary probabilities (4) exist even if X > pS, M 


Operating characteristics. By elementary algebraic manipulations 
involving P„, you can easily derive most of the system’s operating characteristics. 
A useful quantity for this purpose is the probability that all the servers are busy; 


( 6 ) 


P [busy period] = P [n > 5] 


5!(/.5 - X) 


•Po = 



Some authors call the quantity in (6) the probability of a delay—actually a better 
term might be virtual delay. The value P[w > •S' + 1] represents the frac¬ 
tion of time that customers are actually being delayed in the system. 

Define the symbols 

(7) Poisson (^ = jIp) = and Poisson (x < j|p) = 2 

' x-o 


You can compute (6) by the formula 

P[bu>yp«-iod]-^-, 

^ ' Poisson {x = S\p) -f f 1 — Poisson {x < 5|p) 

Table 1 on page 0000 provides values of P[busy period] for selected values of 
p = X/p and S. 

Then 

£[line length] = /»[busy period] • ^ ’ Po 


(9) 
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‘number of 

5-1 « 


(10) 

E 

customers 
in service 

‘number of"! 

= 2 ”f" 2 ” p 

H-0 ns« 

1 

‘number of 

(11) 

E 

customers 
in system J 

= .£[«] = £[line length] + E 

customers 
in service 


For this model, 

(12) X.5[time in system] = £'[number of customers in system], 
so that dividing (11) by X yields 

(13) ^(timejn system] = ^ 

the first term on the right of (13) being £[time in line], and the second being 
-^[time in service]. 

A significant property about this system is that the distribution of departures 
over an interval T is Poisson with mean \T per unit of time. Consider, then, a 
large-scale queuing system composed of groups of servers in tandem—that is, 
arranged so that the output of one group becomes the input to the next. If each 
group can be described by a multiple-server model like that above, the system 
averages can be easily found by first analyzing each group in isolation, assuming 
the same input rate \, and afterwards adding the results. 


Sensitivity analysis. Once again, consider the example in Sec. 20.5 of 
the manager of the legal office, Noah Peale, who is about to hire either two ex¬ 
perienced typists or four novice typists. Recall that the arrival rate for documents 
is 36 per day, and each faster secretary can type 20 documents per day as compared 
to 10 per day for the newer secretaries. You found in Sec. 20.5 that if each typist 
operates separately and independently, then there will be an average of 9 docu¬ 
ments on each girl’s desk, and a document will spend an average of half a day on 
the desk of a faster secretary and a full day for a slower secretary. .When the service 
time is a constant (variance = F’ = 0), the average number of documents on 
each girl’s desk falls to 4.95 and the expected time to .275 of a day for a faster 
typist, and .55 of a day for a slower one. 

Now suppose that the secretaries do not operate separately and independently. 
Instead let the system be composed of a “typing pool.” In this system, a single 
stack of documents waits to be typed; when any girl completes one document, she 
goes to the stack and takes the next one in the pile. Accordingly, using Table 1 
on page A76, you can find that 


(14) 


{ 7877 

!8526 


for iS = 4 and /a = 10 
for iS = 2 and /^ = 20 


(X = 36). 


Therefore, from (9), (10), and (11), 
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(15) 


E 


number of 
documents 
in system 


7.0887 + 3.6 = 10.6887 for = 4 and /x = 10 
7.6741 + 1.8 = 9.4741 for 5 = 2 and/x, = 20, 


and from (13), 

(16) is [lime in system] = 


r.l969 + .1 = .2969 
(.2132 + .05 = .2632 


for 5' = 4 and = 10 
for sS* = 2 and yx = 20. 


Notice that although the probability of a document being delayed before typing 
is smaller when there are four secretaries, the corresponding slowness of the novice 
typists is more than offsetting in terms of the average number of documents as well 
as the average time spent in the system. Pooling the secretaries into a single system 
substantially reduces the average number of documents in the entire system and the 
average time each document spends in the system. In fact, the average time is 
almost that for the constant-service-time case. Also there is not much difference 
between having two faster or four slower typists, given that their combined service 
rate in the system \s Sfi = 40. 

The systems you have considered so far are the cases of .S' = 2 or ^ = 4 combined 
with the possibilities of each secretary operating separately or all S secretaries 
operating in a typing pool. Turn now to an intermediate possibility having two 
secretarial pools, each operating independently of the other, and consisting of two 
novice typists. Then, for each pool, X = ^ = 18, and, for iS = 2, = 10, 


(17) P[busy period] = .8526 

(18) £ [number of documents in each pool] = 9.4741 

(19) £[time in system] = .4264 + .1 = .5264. 

Consequently, the probability of a busy period and the average number of docu¬ 
ments in each pool are the same as for the case of the two faster secretaries; but 
the two-pools arrangement results in twice as many documents on the average 
being in the entire system. Further, the expected time a document spends in the 
system doubles as compared to the case of .S = 2 and fi = 20. 

The various comparisons in the above illustration are valid for other numerical 
values of the arrival rate X, the service rate /x, and the number of servers S, 
In particular, given values for X and /jS, the P[busy period] increases for smaller 
values of S, as does the expected number waiting in line and average time spent 
waiting. But the expected number in the entire system as well as average time 
spent in the system decreases for smaller values of S, 

Depending on the application, you may be able to affect the values of X, fi, 
and S and therefore select them optimally. If the associated objective function is 
very simple, you may be able to derive an analytic formula or construct tables and 
nomographs to aid in the optimization search. You may also be able to apply 
certain advanced algorithmic techniques, similar to those in Chap. 18. 


♦Priority discipline. The end of Sec. 20.5 shows how imposing a priority 
discipline affects the average time in line for customers of different types. Recall 
that Type k arrives according to a Poisson process with rate Xjt, and that the 
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priority numbers are assigned so that /: = 1 is the highest and k = r the lowest. 
Here we assume that the service time distribution is exponential with rate /t for 
every type of customer. Define, as before, 


( 20 ) 


_ 





Assume to ensure the system reaches equilibrium. Then 


£’[time in line]Type k] = -r'P[busy period]. 

(O — —* o-k) 

The average time in the system for an arbitrary arrival remains the same as in a 
system without priorities having an arrival rate X = 2 

To illustrate, consider th^ example of X = 36, 5 = 2, and = 20. In a system 
without any priorities, you saw in (16) that the expected time in line is .2132. 
Suppose now that there are two priorities with X, = = 18. Verify from (21) 

that the expected time in line for Type 1 is (.8526/22 = .0387) and for Type 2 is 
(.8526/2.2 = .3875). The smaller the value of X,, the smaller will be the average 
wait for both types. 


* 20.7 BIRTH-AND-DEATH PROCESS 

Here we present a unified treatment of the special cases you have seen so far, 
and, at the same time, develop a model that assists in analyzing other cases as 
well. This model is termed a birth-and-death process. 

To begin, assume the probability transition laws for the system do not change 
over time; that is, postulate the process to be time-homogeneous. The system 
is considered to operate continuously without any specific starting point, and is 
characterized at Time t by the number of customers n in the system (« = 0, 1, 
2, . . .). We will want to designate a particular instant of time, however, to serve 
as a reference point. It is convenient to let / = 0 be this point, and to let i be the 
number of customers in the system at Time 0. 

Define 

(1) Pin{T) = probability that the system contains n customers at Time T, 

given that it contains i customers at Time 0. 

Because the system is time-homogeneous, the value of Ptn(h) for A > 0 not only 
gives the probability defined in (1) at Time h, but also the probability that the 
system contains n customers at'Time T -f- h, given that it contains i customers at 
Time T. 

We postulate that the Pin{T) satisfy what are known as the Ghapman- 
Kolmogorov equations for a time-homogeneous Markov 
process: 

( 2 ) 


Pin(T + A) = S P.«(7’)/>„„(A) (A > 0). 
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for every i and n. You can interpret (2) as stating that the probability of the system 
having n customers at Time T + h, given that it has i customers at Time 0, 
can be found by adding, over all possible m, the joint probabilities that there are 
m customers in the system at Time T, given the system has i customers at 
Time 0, and the probability that there are n customers in the system at Time 
T + h, given that the system has m customers at Time T. The so-called 
Markov properly relates simply to the assumption that the only relevant information 
about the system at Time T is the number of customers in the system, namely, m, 
and not the detailed history of the process leading up to Time T. In this sense, the 
system is memoryless, and hence (2) represents a nontrivial assumption. [Think 
of an example or two in which (2) does not apply.] 

The birth-and-death process is a special case of (2). Assume that for a very 
small interval of time, h > 0, we need only consider that at most one customer 
enters or leaves the system, and that 

/ox = (Xm^)(l — limh) = for m = 0, 1, 2,. . . 

Pm.m->ih) = (1 — x„h){fi^h) = for m = 1,2,..., 

SO that 

Pmm{h) = 1 - ~ Pm,m-,l{h) = 1 - (^m + ^r m = 1, 2, . . . , 

P^{h) = \-Po,{h) = \-\oh rorm = 0, 

where ^ for all m, and > 0, for > 1; all other Pmk{h) = 0. In othei 
words, for a very small interval of time, /i, the probability that the number of 
customers increases from m to m + 1 is approximately XjnK and that it decreases 
from m to m — 1 is approximately Given our approximation, the probability 
that the number of customers remains at m can be written as approximately 1 
minus these two values. The approximations can be made exact by adding terms 
that have coefficients A*, where A > 2. 

The values and may be thought of as arrival and departure rates, res¬ 
pectively, when there are m customers in the system. It is important that you 
realize no assumption has been made about the number of servers or the queue 
discipline in the system. The postulates refer only to probabilities of an arrival and 
a departure. 

Substituting (3) and (4) into (2) yields for /z > 1 

Pin{T+h)=P,,n.,{T)Pr,,Uh) + Pin{T)PUh) + Pi.n.l{T)Pn,Uh) 

(5) = Pi.n-l{T)Xn.,h + Pin{T)[\ - 

+ Pi,n^i{T)fin+ih {h small). 

Bringing the term Pin{P) to the left-hand side of (5), dividing by A, and letting 
^ > 0, yields the so-called forward system of differential equations 

(6) ~ ^n-\Pi,n-\{P) ”1" pn)Pin{P) 4“ 

The relation (6) is exact because the terms neglected in (5) drop out as A —► 0. 
You can also use (6) to represent the case of n = 0 by letting = 0 and po = 
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write the equation for n = 0. Thus (6) represents a system of equations corre¬ 
sponding to « = 0, I, 2, . . ., where i is taken as a given initial condition. 

► The term “forward” refers to the fact that the equations were derived by looking at 
the second interval of time (T, T -f A) and using (3) and (4) to approximate the transi¬ 
tion probabilities in this small interval. You can also derive backward equations by inter¬ 
changing T and h on the right of (2), and then using (3) and (4) to approximate PUh)- 
In the resulting differential equations for dP^ldT, the value of n at TimeT is taken as 
given, and i at Time 0 is considered as the variable, so that i = 0, 1, 2,. ... ^ 

Review. Let us pause here to see that (6) really does include queuing 
models that you have already examined. Consider the pure-birth model with 
Poisson input, treated in Sec. 20.3. For that situation, X„ = X and = 0, for all 
n. Assuming that i = 0 at Time 0, (6) becomes 

(7) ^ = -xp«„(r). 

The solution for any i is 

(8) P,„{T) = for n ^ 

which agrees with (10) and (xi) of Sec. 20.3 for i = 0. 

Next, consider the pure-death model in Sec. 20.4 having a single-server with 
exponential service times and A/customers in the system at Time 0. For that situa¬ 
tion, = 0, for all w, and = /x, for 1 < « < A/, so that (6) becomes 

(9) ^ for 1 ^ < M 

(10) = -fiPMT) for«= M, 

which has the solution Pjr„{T) = Pn{T) as given by (13) in Sec. 20.4. Suppose, 
instead of there being a single server, each of the M customers can start service 
at Time 0. Then for 0 < n < A/; and check that (6) gives the same results 

as (20) and (21) in Sec. 20.4. 

Finally, consider the single-server model with Poisson input and exponential 
service treated in Sec. 20.5. Verify that for = X, for n > 0, and = /a, for 
n > 1, the system (6) is the same as (4) and (5) in Sec. 20.5. Recall that the 
subscript i in (6) designating the number of customers in the system at Time 0 was 
suppressed in (4) and (5) of Sec. 20.5, since the emphasis there was on obtaining 
stationary probabilities. [To obtain an explicit transient solution to (6), you must 
use the fact that P«(0) = 1, and all other P(n(0) = 0-] 

Steady-state solution. The forward equations (6) depend on both i 
and T, Now suppose T —> oo, and assume that in the limit the system reaches 
statistical equilibrium. For the limiting process, the influence of the value of t at 
Time 0 vanishes, and hence the subscript i can once again be suppressed in the 
probability notation. As in Sec. 20.5, you can solve for the equilibrium or stationary 
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values of by setting the time derivatives in (6) equal to 0: 


(11) 0 — “I" pn)Pn “I" /^n + \^n+l ^ 1 

(12) 0 = + PiPi ;i = 0. 

Starting with n = 0, this system of difference equations can be solved recursively, 
yielding the general result (assuming all /^n > 0, for « > 1) 

(13) p^=:hizl.h^ ... forn^l. 

' Pn Pn-l Pi 

[If there is a number N such that = 0 and /x-„ > 0 for n > N, and if i > N 
at Time 0, then Pn = 0, for n < N, and (13) is easily modified by adding Wto all 
the subscripts.] 

The value of Pq is determined from 


(14) 


1 ='Po +i’ofi + ^ + + + •••)’ 

" \ At, fl2 fix flz fl2 M'l / 


where we postulate for all the models to be considered in this text that the infinite 
series on the right of (14) converges to a finite value. For example, in the single¬ 
server, Poisson input, exponential service model of Sec. 20.5, 


(15) 1 +A2+^>.^+... = 1+P + P= +forp<l, 

fix fl 2 fix ^ 1 — P 

SO that 

(16) = 1 - p. 

An easily verified sufficient condition ensuring that the scries on the right of (14) 
converges is that for n large enough, say n > all X„ < where 0 < r < 1. 

In the case of a finite limit M on the total number of persons in the system, you 
can let X„ = 0, for n > M in (11) and (12), so that Pr, = 0, for n > M in (13). 
Consequently, Pq can always be easily determined from a finite-valued summation 
analogous to (14). 


Examples. We now illustrate the usefulness of the birth-and-death process 
equations. 


Case i. Suppose customers arrive according to a Poisson process with input rate 
% X, but each customer serves himself, according to an exponential distribution with 
rate fi, (This situation may be conceived of as a system with an infinite number 
of servers, and is designated accordingly as M/M/oo.) Then Xn = Xand = n\h — 

recall this value for yxn was established in (17) of Sec. 20.4. Consequently, 


(17) 


X| _ I Xa ^ X| _ Xq 


1 + ^ + 

Pi Pt 


Pi ^ pz pi Pi 


= 1 + P. + _fiL + 
^ 1 ^2-1 ^ 




so that 
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(18) Pq = e~^ and P„ = £-4-^ (Poisson distribution). 

(Another system in which = n/i consists of a single server that speeds up its 
service rate proportionately as the number of persons in the system gets larger.) 

Case a. Suppose there is a single exponential server so that /i„ = /a, for n ^ 1. 
But customers who are about to enter the system are discouraged by a long line 
in such a way that = X/(w + 1). Verify that this assumption also leads to the 
results given in (17) and (18). 


Case Hi. Consider the multiple-server model in Sec. 20.6. There 
(19) = X for w > 0. 

Applying the result (17) in Sec. 20.4, we may conclude that the departure rate 
is nfi when there arc n customers in the system, and SfM if n > S, so that 


( 20 ) 


(nfi for \ ^n<S 

|»S'yx for n > S, 


You can verify that the difference equations (3) in Sec. 20.6 agree with (11) and 
(12) above as does the corresponding solution (4) and (5) with the formulas (13) 
and (14) above. 


Case iv. So far, all the examples assumed, in effect, an unlimited population 
from which arrivals occur. Now consider a model in which the source population 
is finite, and therefore the rate of arrivals decreases as the number in the system 
grows. An industrial situation of some practical significance that meets this de¬ 
scription is a factory containing a group of machines that occasionally break down 
and require repair. (In queuing theory literature such a problem is sometimes 
referred to as a machine interference model.) 

Let 

M = number of machines and R = number of repairmen. 

Assume that if a machine is operative, then the time at which it breaks down is 
described by an exponential probability distribution that has rate X and is inde¬ 
pendent of the operating behavior of the other machines. Suppose further that 
any nonfunctioning machine requires the attention of only a single repairman, and 
that the service time probability distribution is exponential with the same rate fi 
for every repairman and every machine. 

Suppose n machines have broken down, and hence M — n are operative. Then 
in a very small interval of time, ^ > 0, the approximate probability that one of the 
operative machines breaks down can be calculated as usual: 

pfone out of Af — 72 machines 1 / ir __ , 

[breaks down in an interval ' ' 


(21) 
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. 20.7 • 


By the same token, 
(22) P 


one out of n inoperative 
machines completes 
service in an interval h 


= nfih for 1 < « < /?. 


State why the probability in (22) is R^hy for /? < n < M. Then in terms of a 
birth-and-death process, you can define 




(23) 


UM - n)\ 

for 0 < n ^ Af 

|o 

forn > -M 

(n/i 

for 1 < n < ft 

U/4 

for ft < n < M. 


Then the difference equations in (11) and (12) become 

0 = —MXPo + fiPi forn = 0 

0 = (M — n + \)XPn^i — [{M — n)X + + {n + l)fiPn+i 

(24) for 1 < M < -ft 

0 = (M — « + l)Aft„_i — [(M — n)X + ft/x]ft„ + RfJ^Pn-^.i 

for ft ^ n ^ M, 

and letting p = Xlfjby (13) can be written as 

ft„ = (-;f)p”fto for 0 < n < ft 

(25) 

P„ = {%)pn.^^^.p^ [or R<n< My 

where (^) = -Ml/n!(M — n)! is the standard binomial coefficient and -Pq is computed 
from the relation 


(26) Sftn^l. 

• n=0 

There are no simple expressions for the expected number of nonoperative 
machines, but in any specific case, the value of E[n\ can easily be computed directly 
using the values for ft„ in (25). 


’''Model interpretation. The probabilistic structure of each queuing 
model in the preceding sections was described in terms of interarrival and service 
time distributions. For the birth-and-death process model, no ambiguity arises 
in interpreting the corresponding interarrival and service distribution assumptions 
insofar as the model is applied to situations in which the input is a Poisson process, 
and the service time for each server is an exponential distribution. But when Xn 
is defined, for example, as in Case ii above, there is no simple way of providing a 
specified form for the interarrival distribution. The reason is that the time between 
successive arrivals depends on the number of customers in the system, which in 
turn depends on the service events. A similar remark holds with regard to defining 
a service lime distribution. (For example, such difficulty occurs when fin is speci- 
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fied, as at the end of Case i, to be the service rate for a single server and dependent 
on the number of persons in the system.) Therefore only in special cases are you 
able to interpret the above birth-and-death model as being a combination of 
Poisson input and exponential service times. 

By the same token, it is difficult to work out the probability distribution and 
expectations of the time a customer spends in the system, even when the discipline 
is first come, first served. 

* 20.8 OTHER QUEUING MODELS 

Most of the results in this chapter have pertained to queuing models in which 
arrivals arc described by a Poisson input process and the service times for each 
server are given by an exponential distribution. All of the models have assumed 
there is a single waiting line, and customers are processed on a first come, first 
served basis. (The situation of multiple waiting lines v/as treated approximately 
as several systems operating side by side and independently of each other.) 

It is not loo difficult to extend such results to other cases containing minor 
variations on the assumptions. For example, it is fairly easy to encompass the 
phenomenon of balking (a tendency for a customer not to enter the system as the 
line gets longer) and impatience or reneging (a tendency for a customer to leave 
the system before being served). (The required approach makes only minor modifi¬ 
cations in the specification of and in the birth-and-death model of Sec. 
20.7.) In Appendix III we treat several examples in which the interarrival and 
service times are other than exponentially distributed. (Some of the techniques 
that will be illustrated can also be used to analyze cases in which there are bulk 
arrivals or bulk service.) It is also possible to obtain usable formulas to handle 
different, but simple, queuing disciplines, such as random service or last come, 
first served. Such variations, however, are not considered in this book. 

In any particular industrial application, it is the exception rather than the rule 
that any of these models precisely represents reality. Therefore you should use 
these models primarily for general insights and guidance in determining how 
sensitive a system’s operations may be to the relevant decision alternatives. If the 
resultant approximate analysis shows that the economic impact of a wrong deci¬ 
sion may be serious, you should then augment your analysis by applying computer 
simulation to explore in further detail the specific questions of system design. 


REVIEW EXERCISES 

1 Suggest two or there queuing situations that you think might warrant a scientific study. 
State in which of the two categories described at the beginning of Sec. 20.1 your 
suggested applications might fall. Explain the decisions that might be made as a 
result of a scientific study, and how you would distinguish a good deebion from a poor 
one. 
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2 In each part below, suggest one or two queuing situations that illustrate the indicated 
characteristic. (Be imaginative and do not use the examples already given in the 
text.) 

(a) Customers arriving singly. 

(b) Bulk arrivals. 

(c) Predetermined arrivals. 

(d) Completely random arrivals. 

(e) Unlimited source population. 

(f) Finite source population. 

(g) Customers that always join the system when they arrive. 

(h) Customers that may balk when they arrive. 

(i) First come, first served discipline. 

(j) Last come, first served discipline. 

(k) Random service discipline. 

(l) Special priority service discipline. 

(ri) Customers that always remain in the line after joining it. 

(n) Customers that may renege after joining a line. 

(o) Single waiting line. ‘ 

(p) Parallel waiting lines. 

(q) Customers that may jockey for position. 

(r) Servers in parallel. 

(s) Ser\'ers in series (tandem). 

(t) Service time mechanism that is independent of the line length. 

(u) Service time mechanism that is influenced by the line length. 

3 Consider three or four of the queuing situations that you suggested in exercise 2. 

(a) State what you think are the four or five most important operating characteristics 
to measure in assessing the performance of these systems. 

(b) Also suggest a variety of managerial options that can be selected in designing 
these systems to operate efficiently. (Be sure to define what you mean by 
“efficient.”) 

4 (a) Consider the negative exponential distribution given by (2) in Sec. 20.3. Show 

that the mean equals I /X and the variance 1 /X^. 

(b) Verify the mathematical manipulations that establish in (3) and (4) the memory¬ 
less property of the exponential distribution. 

(c) Verify that the sum of n independent values drawn from the same exponential 
density (2) has a gamma distribution, as given by (8). 

(d) Verify the probability statement in (9) by repeatedly applying integration by 
parts. 

(e) Use (10) to state the probability of one arrival in an interval of length T = h. 
Then apply a Taylor series expansion and verify the approximation in (7). 

(f) Show that the mean and variance of a Poisson distribution are as given in (11). 

(g) Verify the equality in (12) from (9) and (10). Give a verbal interpretation of (12). 

(h) Show that replacing \n in the Erlang distribution (14) by X yields the gamma 
distribution (8). 
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5 Compare Figs. 20.2a and 20.2b and disoiss how the shape and concentration of 
probability mass shift with variations in n and X. 

6 Consider the pure-death model in Sec. 20.4. 

(a) Derive (12) by using reasoning analogous to that employed in obtaining (11). 

(b) Show that the probabilities (13) and (14) satisfy the differential equations (11) 
and (12). 

(c) Determine how Pn(T) behaves as T grows without bound, for n = 1, 2,..., Af. 
For w = 0. [Suggestion: use the example M =2 and = I, and draw a graph 
of each F„(r).] 

*7 Consider the self-service model in Sec. 20.5. 

(a) Justify in detail the approximations in (17) and (18). 

(b) Justify in detail the approximation in (19), and show how to obtain (20). 

(c) Verify that the binomial probability distribution (22) satisfies the differential 
equations (20) and (21). 

(d) Determine how Pn{T) behaves as T grows without bound, for n = 0, 1,.. ., M, 
[Suggestion: Use the example M = 2 and = 1, and draw a graph of each 
PniT),] 

(e) Suppose M = L Compare the probabilities in (13) and (14) with those in (22) 
for « = 0, 1. 

8 Consider the single-server model with Poisson input and exponential service, discussed 
in Sec. 20.5. 

(a) Perform the detailed algebraic steps to derive (4) from (3). Also show how to 
obtain (5). 

*(b) Devise a situation in which each Pn(T) approaches a limiting value as T’oo 
but £[n] is not finite. Comment on why the term statistical equilibrium is not 
applied to such a case. [Hint: consider X > fi,] 

(c) Give a verbal interpretation of the assumption in (6). 

(d) Verify that the stationary probabilities in (10) satisfy the difference equations 
in (7). 

*(e) Perform the detailed algebraic steps to derive the formulas £[n], Var [n], and 
F [n ^ TV] shown in (13) for the geometric distribution. 

(f) Provide the detailed justification and show the intermediate algebraic steps for 
each equality in (15) that gives jF[line length]. 

(g) Perform the detailed algebraic steps to derive the formula for the waiting time 
density h{w) in (19). 

(h) Perform the detailed algebraic steps to derive the formulas for jE[time in line] in 

( 21 ). 

9 Consider the secretaries example in Sec. 20.5. How docs this simple model approxi¬ 
mate the real situation? Why would the average delay for a document tend to be 
smaller in actuality than the delay given by (20) ? If you were the office manager, 
what else would you take into account in deciding between the two alternatives? 
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10 Consider the secretaries example in Sec. 20.5. In each part below, determine the 
average number of documents on each girl’s desk and in the entire system. Calculate 
the average time for a document to be in the system and the average length of the 
busy periods. 

(a) 5 = 1 and = 40. 

(b) 5=10 and fi = 4. 

11 Suppose in the secretaries example in Sec. 20.5, there are an average of 38 documents 
to be typed per day. For each alternative in the text, determine the average number 
of documents on each girl’s desk and in the entire system. Calculate the average time 
for a document to be in the system and the average length of the busy periods. 

♦12 Consider the finite-queue model discussed in Sec. 20.5. 

(a) Give a plausible explanation of why statistical equilibrium is reached for any 
value of the intensity ratio X/fi, 

(b) Verify that when \ < fi and M oo, in (23) agrees with (12). 

♦(c) Derive ^[number of customers in system] using the probabilities Pm for fi, 
as given in (23). Do the same for X = fi, 

(d) For p = .7, . 8 , .9, .95, find the corresponding value of M such that jB[number of 
customers in system] in (24) is within .01 of the same operating characteristic as 
given by (13). 

(e) Determine the limiting value of ^[number of customers in system] in (24) as p 
grows arbitrarily large. 

(f) Show that (23) is the limiting value of Pn for \ 9 ^ ft for as p approaches 1. 

(g) Let Af = 3. Calculate jE[number of customers in system] for p = 1,2, 5, 10. 

13 Consider the single-server model with Poisson input and arbitrary service distribution, 
as described in Sec. 20.5. 

(a) Verify that when the service time is exponential, then (29), (30), and (32) reduce 
to (13), (15), and (20), respectively. 

(b) Assume ft = 10. Calculate the operating characteristics (29), (30), and (32), 
for p = .7, . 8 , .9, and .95 with V = 0, 

(c) Rework part (b) with V = .01. 

(d) Rework part (b) with V = A. 

(e) Rework part (b) with F = 1 . 


♦14 Consider the single-server model with priority discipline, as discussed at the end of 

Sec. 20.5. 

(a) Verify that when r = 1, (35) reduces to £[time in line] in (32). 

(b) Verify the sample calculations in (37). 

(c) Suppose = Xj = 9 in (37). Find the expected time in line for Types 1 and 2 
customers. Determine what happens when Xj = 3 and when Xi = 15 (remember 
Xi -j- X 2 = 18). 

(d) Suppose Xj = X 2 = 9, fti = 30, ft 2 = 15, and the service distributions are expo¬ 
nential. Determine the expected time in line for Types 1 and 2 customers. 

(e) Suppose Xi = X 2 = 9, fii z= = 20, and let Fi = 0 and = 1/20*. Determine 
the expected time in line for Types 1 and 2 customers. 
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(f) Rework part (e), except let Vi = 1/20* and Fj = 0. 

(g) Suppose X, = Xj = 9, /ii = 30, /i^ = 15, and let Fi = 0 and Fj = 1/15*. 
Determine the expected time in line for Types 1 and 2 customers. 

(h) Rework part (g), except let Fi = 1/30* and Fj = 0. 

(i) Suppose there are three classifications, where Xi = X 2 = X 3 = 6 . Assume that 
the service rate /a = 20 applies to all three types and that the service distributions 
are exponential. Determine the expected time in line for each of the three types of 
customers. 

(j) Rework part (i), except let /ii = 33i and fJ >2 = 

(k) Rework part (i), except let fii = 40, fi 2 = 20, and /Aj = ISJ. 

15 Consider the multiple-server model in Sec. 20.6. 

(a) Verify that the probabilities in (4) and (5) satisfy the difference equations (3). 

(b) Check that when S = 1, the probabilities in (4) and (5) reduce to the formula in 
(12) of Sec. 20.5. 

(c) Give the intermediate algebraic steps to justify the statement that when the number 
of servers is unlimited (self-service), then in (4) is a Poisson distribution. 
{Hint: use a series expansion for f*.) 

16 Consider the multiple-server model in Sec. 20.6. In each part below, show the detailed 

algebraic steps justifying the formula that is cited in the text. ^ 

(a) P[busy period] in ( 6 ). (c) jE[line length] in (9). 

(b) P[busy period] in ( 8 ). (d) iE[number of servers] in ( 10 ). 

17 Consider the secretaries example where the girls work in a typing pool, as in Sec. 20.6. 
Suppose the arrival rate is X = 38. Calculate the P[busy period], ^[number of docu¬ 
ments in system], and E[time in system], for each case below. 

(a) iS = 4 and = 10. (c) 5=1 and = 40. 

(b) 5 = 2 and fi = 20. (d) 5 = 10 and /a = 4, 

18 Consider the multiple-server model with priority discipline discussed at the end of 
Sec. 20.6. In each part below, apply (21) to find the expected time in line for each 
Type k. 

(a) Let A = 2, Xi = X 2 = 18, = 10, and 5=4. 

(b) Rework part (a), except let 5 = 1 and = 40. 

(c) Rework part (a), except let 5 = 10 and ft = 4. 

(d) Let A: = 3, X, = X 2 = X 3 = 6 , ft = 20, and 5=2. 

19 Consider the birth-and-death process in Sec. 20.7. 

(a) Substitute the approximate probabilities in (3) and (4) into the Chapman- 
Kolmogorov equations (2) and verify the result in (5). Also exhibit the interme¬ 
diate steps yielding the differential equations ( 6 ). 

(b) Write the equation in ( 6 ) corresponding to n = 0. 

(c) Examine the pure-birth model for an arbitrary 1 , and write the exact expression 
for ( 6 ). [For i = 0, ( 6 ) is given by (7).] Show that ( 8 ) satisfies the differential 
equations. 
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(d) Show that PMn{T) = /^n(7’), as given by (13) in Sec. 20.4, is the solution to 
(9) and (10) for the pure-death model with a single server. 

(e) Examine the pure-death model where all M customers start service at Time 0. 
Check that (6) gives the same results as (20) and (21) in Sec. 20.4, 

(f) Examine the single-server model with Poisson input and exponential service in 
Sec. 20.5. Verify that the system (6) is the same as (4) and (5) in Sec. 20.5. 

(g) Check that the stationary probabilities in (13) satisfy the difference equations 
(11) and (12). 

(h) Explain why = 0 for n < iV, where N \s 2 l number such that iin = 0 and 

> 0, for n > Ny assuming that i > N 2 Li Time 0. 

(i) Verify that if there is an N such that < r/x„+,, where 0 < r < 1, for ;i ^ iV, 
then the series on the right of (14) converges. 

*20 Consider the birth-and-death process discussed in Sec. 20.7. 

(a) Examine Case it and show that the probabilities are given by (17) and (18). 

(b) Examine Case Hi and show that the difference equations (3) in Sec. 20.6 agree 
with (11) and (12), as docs the solution (4) and (5) with the formulas (13) and 
(14). 

(c) Examine Case iv, the machine interference model. Explain why the probability 
in (22) is Rfih, (or R < n ^ M, Verify that substituting (23) into the difference 
equations (11) and (12) yields (24), Also verify that (13) leads to the solution (25). 


21 Explain your understanding of the following terms; 


operating characteristics 
input process 
bulk arrivals 
balk 

reneging (impatience) 
queue discipline 
first come, first served 
last come, first served 
random service order 
servers (or channels) 
jockey 

servers in parallel 

servers in scries (tandem) 

systems synthesis 

birth (arrival) process 

death process 

renewal process 

recurrent events 

completely random arrivals 

stationary (homogeneous) time process 

memoryless property 

intcrarrival time 

exponential arrivals (Poisson input, 
Markovian input) 


periodic (regular, constant) arrivals 
service time 
exponential service 
pure-death model 
service rate ft 
*self-service model 
*Kcndairs notation 
single-server model 
multiple-server model 
transient solution 
statistical equilibrium 
equilibrium (stationary) probabilities 
traffic intensity p 
utilization factor p 
difference equations 

* finite-queue model 

* loss-delay system 
Pollaczek-Khintchinc formulas 

♦priority discipline 
♦preemptive priority 
busy period 
virtual delay 
♦birth-and-death process 
♦Chapman-Kolmogorov equations 
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purc-birth model 
arrival rate X 
Erlang arrivals 


♦time-homogeneous Markov process 
♦forward system of differential equations 
♦machine interference model. 


FORMULATION AND COMPUTATIONAL EXERCISES 

22 Suppose an item held in inventory is replenished according to the rule “when the 
inventory level falls below 1, order 10 items.” Assume that the current level of inventory 
is 3, that items are demanded one at a time, and that the time between demands is 
exponentially distributed with a mean rate of two per week. 

(a) What is the probability distribution of the number of items demanded in a two- 
week period ? (Give both the form of the distribution and the values of its param¬ 
eters.) 

(b) What is the probability that no replenishment will take place during the coming 
week? During the coming two weeks? (Write the detailed formulas for these 
probabilities; calculate the numerical values if you have access to the appropriate 
tables.) 

23 Consider a production process with two machines, A and B, in tandem. The output of 
items from the first machine. A, is Poisson, with the rate of 10 per hour. The second 
machine, B, processes each item witt exponential service time at the rate of 12 per 
hour. There is a congestion problem when more than two items are waiting to be 
processed on Machine B. 

(a) Calculate the fraction of time that the system is congested. 

♦(b) Suppose Machine A is turned off when ever more than two items are waiting to 
be processed by Machine B. Explain why your answer in part (a) overestimates 
the fraction of time that Machine A is turned off. Calculate the correct proba¬ 
bility. (Be sure to indicate the specific queuing model you use.) 

*24 The P. Kaboo Camera Store tries to keep in stock only one unit of a very expensive 
lens. When the lens is sold, the store reorders a new one. Assume that, on the average, 
the store has one customer in four weeks who wants to buy such a lens; if the lens is 
out of stock, the customer goes elsewhere. What is the probability of the lens being out 
of stock if the average replenishment time is one week? Two weeks? Four weeks? 
(Assume the customer interarrival and replenishment time probability distributions 
are exponential.) 

25 Consider a single-server model with Poisson input and a customer arrival rate of 30 
per hour. Currently, service is provided by a mechanism that takes exactly 1.5 minutes. 
Suppose the system can instead be served by a mechanism that has an exponential 
service time distribution. What must be the mean service time for this mechanism in 
order to ensure the same average time a customer spends in the system ? The same 
average number of customers in the system? 

26 Let the value of Af denote the “degree of service” offered by a single server. I'he value 
Af = 1 represents a minimal degree of service, and larger values of Ai represent more 
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thorough and longer-lasting service. Suppose that the form of the service time distri¬ 
bution is gamma, given by (16) in Sec. 20.4, where ft = 1. Note that both the mean 
and variance of the service time distribution depend linearly on the “degree of service” 
parameter M. Suppose the arrival rate is X = .8, for M = 1, and assume that for 
M > 1, the arrival rate decreases; interarrival times are exponentially distributed. 
The reason for this effect on the arrival rate is that as the degree of service increases, 
such service is required less often. In each part below, determine the arrival rate such 
that the expected number of customers in the system is the same as for M = 1 and 
X = .8. 

(a) Af = 2. 

(b) M= 4. 

(c) Af=8. 

27 The Schmerz Auto Rental Company must choose between operating one of two types 
of service shops for maintaining its cars. It estimates that cars will arrive at the main¬ 
tenance facility on the average of one very 40 minutes (assume this rate is virtually 
independent of which facility is chosen). In the first type of shop, there are dual 
facilities operating in parallel; each facility can service a car in 30 minutes on the 
average. In the second type, there is a single facility, but it can service a car in 15 
minutes on the average. Assume that each minute a car must spend in the shop reduces 
contribution to profit by one monetary unit. Let Ci and C 2 be the cost per minute of 
operating the first and second type facilities, respectively. Find the value of C 1 /C 2 
such that Schmerz would be indifferent between operating the two shops. 

28 Dusty Page, the Head Librarian of the Free Kowt College, is considering staffing an 
extra checkout desk in the main library. Up to now, two desks have been in operation. 
On the average, one student arrived every two minutes, and the total checkout time 
(that is, time in line plus in service) averaged a minute. The College has just become 
co-educational, and with the influx of female students. Page expects an increased birth 
rate in the system. Determine the value of the arrival rate such that three checkout 
desks still ensure an average total checkout time of a minute. Assume that the multiple- 
server model in Sec. 20.6 can be used to yield good approximations. 

29 The Gabby Company has leased three WATS (“wide area telephone service”) lines 
to enable its executives to make toll-free long distance calls. Let c be the cost per hour 
per WATS line. Suppose that the frequency of calls per hour is described by a Poisson 
distribution with mean 10, and the lengths of the calls are exponentially distributed 
with mean 15 minutes. Let w denote the cost per hour of executive waiting time (that 
is, time waiting until one of the lines is free). Let c = 1 and determine the range of values 
for w that will make the decision to lease f/iree WATS lines optimal. 

30 The manager of the Hippon Drug Store is remodeling his facility and is considering 
three ways of organizing his branded-items business. The first way is to employ a fast 
clerk to wait on all customers. Assume such a clerk serves customers exponentially at 
an average rate of two minutes per customer. The second way is to employ two 
moderately fast clerks to wait on all customers. Assume each serves customers expo¬ 
nentially at an average rate of four minutes per customer. The third way is to use a 
self-service system in which each customer waits on himself. Self-service, at an average 
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rale of six minutes, is slower than clerk service. The manager wants to calculate the 
average number of customers in the store, the average time each spends in the store, 
and the average time each spends waiting for service, under each way of organizing. 
Assume that customers arrive completely randomly, one at a time, at the rate of 
15 per hour. Calculate the operating characteristics of interest to the manager. Also 
determine these characteristics assuming that the fast clerk could service each customer 
in exactly two minutes lime. 

31 The Paine N. B. Heind Infirmary of Ye Owl College has noticed that each day two 
students, on the average, arrive in need of hospital care. Such a student requires, on 
the average, a three-day confinement to bed. Assume that interarrival times and 
service limes are both exponentially distributed. 

(a) What is the probability distribution of the number of students occupying hospital 
beds? What is the associated average number? (Be sure to indicate which model 
in the text you are applying to derive your answer.) 

(b) Suppose there are five students in the hospital at the start of a day. How many of 
these students, on the average, will remain in the hospital at the same time 
tomorrow? Three days from today? Seven days from today? 

*32 In each part below, apply (13) and (14) in the birih-and-death model in Sec. 20.7 
to find the equilibrium probabilities P„. 

(a) Xq = 1, X., = X 2 = = 0, for a > 3; = 1, for all n ^ 1. 

(b) Xq = Xj = Ij ~ fo*^ w ^ 3; fin = 1> for all w > 1. 

(c) = 1, for n = 0, 1, 2, and X„ = 0, for n ^ 3; ft, = 1, = 2, /I 3 = 4. 

(d) X„ = 1, for n = 0, 1, 2, and X„ = 0, for n > 3; ft, = 4, ftj = 2, fij = 1. 

*33 The Nockov and Weldon Machine Shop operates three delicate pieces of equipment, 
each of which breaks down on the average of once every 2.5 hours. A single repairman 
is needed to service a broken machine and requires 45 minutes, on the average, to 
repair the equipment. Assume the machine interference model (Case iv) in Sec. 20.7 
applies. In each part below, calculate the equilibrium probability that all the machines 
arc in good working order, and the expected number of machines that arc inoperative. 

(a) The shop employs only one repairman. 

(b) The shop employs two repairmen. 

(c) Suggest the various costs that you would measure to determine an optimal choice 
between one and two repairmen. Indicate the appropriate operating character¬ 
istics you would calculate to ascertain the expected cost for each alternative, and 
display a formula for the expected cost. 


MIND-EXPANDING EXERCISES 

Exercises 34 through 39 refer to the birth-and-death model in Sec. 20.7. In 
each exercise, define the appropriate values for X„ and fi„, and exhibit the linear 
difference equations that yield the equilibrium probabilities. You will find it 
necessary in a few of these exercises to adapt the logic used to develop the dif- 
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fcrence equations in Sec. 20.7 in order to accommodate a more complex descrip¬ 
tion of the states of the system. 

34 Consider a single-server model with exponential interarrival times and exponential 
service. Suppose, however, that there are bulk arrivals, that is, when an arrival event 
occurs, r customers enter the system with probability q^T = 1,2,...,/?, where R is 
the largest group that ever enters. 

35 Consider the following single-server system with Poisson input and exponential service. 
When a customer arrives at the system and sees n persons present, he joins the system 
with probability r„, and balks with probability 1 — r„. Also, when n persons are in the 
system, the rate of departure due to reneging is (that is, in a relatively small interval 
of time, A > 0, the probability of one customer in line departing prior to service is 
g„A);notefo = ^i = 0. 

36 Consider Case iv in Sec. 20.7 and suppose M = R, 

(a) Check that the formula for in (25) for 0 ^ n ^ /? = Af is appropriate. 

(b) Show how can be written as a binomial distribution with parameter p. Exhibit 
a formula for the mean number of inoperative machines that explicitly indicates 
the dependence on the failure and service rates. 

37 Consider the following single-server preemptive priority system. Two types of customers 
arrive at the system; Type 1 at the rate Xj and Type 2 at the rate Type 1 has 
complete priority over Type 2. Service will be terminated on a Type 2 customer 
(implying there are no Type 1 customers waiting in the system) when a Type 1 
customer arrives; service will then commence immediately on the Type 1 customer. 
Service on the “bumped” customer resumes after the system no longer contains any 
Type 1 customers. Service rates for Types 1 and 2 are /ij and fi 2 , respectively. {Hint: 
let the state of the system be indicated by both the number of Type 1 and the number 
of Type 2 customers in the system.) 

38 Consider a two-server model with Poisson input (rate X) and exponential output, 
except let the rate for the first server be /ii and for the second server be where 

^ {Suggestion: when the number of customers n in the system equals 0 or is 
greater than 2, let the probability have its usual meaning; when the number of 
customers equak 1, let Pqi denote the state that the first server is free but the second is 
busy, and let Pio denote the opposite condition.) 

39 Consider the following system of two channels in tandem (series). Arrivals to Channel 1 
arc at the rate X (Poisson input distribution). Service rates at Channels 1 and 2 
arc fii and /ij, respectively, and service times are exponentially distributed. When a 
customer finishes service in Channel 1, he then requires service at Channel 2. If 
Channel 2 is occupied with another customer, the customer in Channel 1 has to wait 
where he is, and hence blocks the service of any customer that is waiting in the line in 
front of Channel 1. {Suggestion: designate each possible state of the system as follows. 
Let 0 denote no customers in the entire system and let 1 denote a customer being 
served in Channel 2 and no other customers in the entire system. Define i, for i = 
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1, 2,.. ., as the number of customers waiting in line plus those in service at Channel 1. 
Let I, 0 denote the level z at Channel 1 and no one being served at Channel 2; let i, 1 
denote the level z at Channel 1 and customers being served at both Channels 1 and 2; 
let z, 2 denote the level z at Channel 1, and a customer being blocked in Channel 1 
due to a customer being served in Channel 2.) 
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Management Systems 


21.1 WHEN ALL ELSE FAILS• 

If you have patiently proceeded from one chapter to the next, you have studied 
a perhaps bewildering variety of operations research models and techniques. 
Students often ask, in effect, “Is this arsenal of tools powerful enough to encompass 
all the important managerial decision problems requiring data analysis?” The 
answer is no, not by a long shot. To see why, reflect on the kinds of problems that 
you know can be effectively analyzed by the operations research tools presented 
thus far. As you become aware of gaps, you will see more clearly why so many 
significant types of decision-analysis problems are still not solvable by these 
approaches, and therefore must be attacked in other ways. In the next few para¬ 
graphs we summarize the limitations as well as the strengths of operations research 
tools including linear and dynamic programming, inventory and queuing theory. 

You have already learned that linear programming models are most successful 
in aiding the planning efforts of corporate enterprises. If the planmng horizon 
is 10 years or longer, a corresponding multiperiod linear programming model 
typically deals only with annualized data. The effects of the resultant plan on wcek- 
to-week and month-to-month operations are left implicit. Analogously, if the 
planning horizon is much shorter, say three months to a year, the corresponding 
model usually ignores the day-to-day and week-to week variations. Thus, for the 
most part, linear programming analysis falls short of prescribing rules that 
translate a recommended plan into operating procedures for time spans shorter 
than the periods in the model. 

A second limitation of linear programming analysis relates to uncertainty about 
the future. Imprecise forecasts to some degree exist in all planning studies. 
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Frequently, this uncertainty is not really the essence of the planning problem, or it 
reflects a lack of knowledge about only a few parameters in the model. In such 
cases, sensitivity analysis, as discussed in Chap. 5, suffices to determine the impact 
of uncertainty. But on other occasions uncertainty pervades the entire model, 
and standard sensitivity analysis is too clumsy and computationally burdensome 
for analyzing the impact of uncertainty. 

To illustrate, consider a chemical manufacturing company that seeks a long- 
range strategy for the development and marketing of new products. Substantial 
research and investment costs are associated with each product, and the actual 
size of the product’s market is uncertain. Furthermore, most of the profits that are 
generated from a successful product will be used to finance the research and 
development of new products. A linear programming model that manages to 
capture the dynamic elements of this situation, but treats the uncertainty aspects 
by simply using average values, is not likely to yield a good strategy. 

In contrast, dynamic programming models can analyze multiperiod planning 
problems containing uncertainty, and so can be used to determine optimal 
strategies. But, as compared with linear programming applications, these dynamic 
programming models in practice can treat only drastically simplified systems. 
As you learned in Chaps. 10 and 17, unless the underlying system is characterized 
by only a few state variables, the computational task of solving a dynamic pro¬ 
gramming model is horrendous. 

A similar limitation holds for those dynamic probabilistic models that are 
amenable to mathematical analysis, such as the inventory and queuing phenomena 
you studied in Chaps. 19 and 20. To solve these models, you not only must restrict 
yourself to a small-scale system, but you also must simplify the way the system 
can operate. To illustrate, a realistic analysis of waiting lines in a job-shop is intract¬ 
able using mathematical queuing theory like that presented in Chap. 20 and 
Appendix III. Those models serve only as rough approximations to realistic 
queuing phenomena. 

Thus, despite the great diversity of applications of mathematical programming 
and probabilistic models, many important managerial decision-making problems 
must be analyzed by other kinds of techniques. 

Challenge remaining. The expanding scientific literature on operations 
research bears witness that there is steady progress in finding techniques to overcome 
the above-mentioned limitations. But for now and the foreseeable future, the 
approaches given in the preceding chapters cannot be relied on to provide a 
complete analysis of managerial decision-making problems pertaining to: 

(i) Choice of Investment Policiesfor Strategic Planning, A major corporation’s invest¬ 
ment policy^ to be comprehensive, should include provisions relating to research 
and development of new products, expansion into new markets, choice of selec¬ 
tion criteria for major projects, measurement and evaluation of risk, means of 
financing by debt and equity, reinvestment of earnings, disposition of liquid 
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assets, evaluation of mergers and acquisitions, and divestment of assets. A full- 
fledged operations research model for the analysis of alternative policies must 
recognize the impact of the uncertain and dynamic nature of investments, as well 
as provide a means for screening the enormous variety of investment decisions 
that face an organization. 

(ii) Selection of Facilities in Operations Planning, Several examples in this category 
were already discussed in Sec. 20.1, They included the determination of the num¬ 
ber of checkout stands in a supermarket, the number of gasoline pumps at a 
service station, and the number of elevators in a new office building. There are 
numerous other examples dealing with personnel staffing, plant layout, and 
machine capacity decisions. Typical facilities selection questions are of the form; 
“How many?” “How large?” “Where located?” 

(iii) Design of Information-Feedback Scheduling and Operations Rules, Illustrations 
of decision problems in this category are equally numerous, although you may 
not think of them right away, unless you have had some previous work experience. 
An important example is the design of scheduling rules for a job-shop manufactur¬ 
ing plant, or an equipment repair facility, or a computer center. Such rules for 
a manufacturing plant take account of promised due dates to customers, the 
requirements for, and the availabilities of, machine capacities, the deployment of 
skilled labor, and the provisioning for raw materials. As information on new orders 
arrives, and as completed orders leave the system, the shop schedule has to be 
updated and revised. 

Another example of an information-feedback system is a scheduling procedure 
for routing transport facilities. To illustrate, a freighter shipping company in 
making a schedule of its ocean going equipment for several months ahead, must 
take into account cargo demands at various ports, ship capacities and speeds, 
uncertainties in sailing times due to vagaries in the weather, and delays due to 
port congestion. Many shipping lines that own a large fleet of vessels must 
reschedule daily as they receive more accurate information about uncertain 
events. Similar problems arise in the scheduling of patients into a hospital, and 
the timing of traffic lights on a majpr thoroughfare. 

What makes the three types of problems described above so difficult to analyze? 
It is the combined effect of uncertainty, the dynamic interactions between deci¬ 
sions and subsequent events, the complex interdependencies among the variables 
in the system, and, in some instances, the need to use finely divided time intervals, 
Such total systems problems are too big and too intricate to handle with linear and 
dynamic programming models, or standard probabilistic models. 

Frequently, actual decisions arising from these three types of problems involve 
spending at least several hundred thousand dollars, and vitally affect the future 
operating costs and efficiencies of a company. Thus, management is highly moti¬ 
vated to employ a systematic approach to improve on intuitive, or “seat-of-the- 
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pants,” analysis. So far, the best operations research approach available is 

digital computer simulation. 

Simulation approach. Our main concern in this chapter will be to 
describe simulation and the kinds of problems you encounter in employing this 
technique. VVe do not show you in detail how to design and run simulations. 
Such instructions are in texts devoted to simulation and in manuals distributed by 
computer manufacturers to explain special simulation programming languages. 

In brief, the simulation approach starts by building an experimental model of 
a system. Then various specific alternatives are evaluated with reference to how 
well they fare in test runs of the model. 

If you think about it, you will recall occasions when you have been involved in 
a simulated environment. For example, an amusement park, like Disneyland, 
offers you many attractions, such as the jungle boat-ride and the Matterhorn 
bobsled, that try to simulate actual experience. Less frivolous examples are 
planetarium shows and the environments in a museum of natural history. You 
may have learned how to drive an automobile in a mock-up mechanism with a 
steering wheel and gas and brake pedals. And if you have been in the armed 
services, you will remember that boot camp or basic training consists mainly of 
simulated exercises. 

It is usually too inconvenient and expensive to solve managerial decision prob¬ 
lems by environmental analogue simulations, such as the field combat war games 
that arc used in boot camp and basic training. Rather, it is preferable to represent 
a complex system by a computerized mathematical model. In a computer, the 
only thing that can be shot is an electronic circuit. 

The uncertainties, dynamic interactions, and complex interdependencies are 
all characterized by formulas stored in the memory of the high-speed digital 
electronic computer. The system simulation begins at a specified starting state. 
The combined effects of decisions, and of controllable and uncontrollable evenis, 
some of which may be random, cause the system to move to another state at a 
future instant in time. The evolutionary process continues in this fashion until the 
end of the horizon. Frequently, the time intervals are finely divided and extend 
over a fairly long horizon. As a consequence, the simulation experiments involve 
a vast number of calculations, rapidly performed by the computer. This feature of 
years of history evolving in a few minutes on a computer is termed time com¬ 
pression. 

The only game in town. Most operations research analysts look upon 
digital computer simulation as a “method of last resort”—hence the title of this 
section, “When All Else Fails. . . .” There are tv/o reasons for this gloomy attitude. 

The first reason is the nature of most simulation results. When the model 
includes uncertain events, the answers stemming from a particular simulation 
must be viewed only as estimates subject to statistical error. For example, a simu- 




' 21.2 • 


Perspective 891 


lated queuing model yields only an estimate of a waiting line’s average length or 
the associated probability of a delay. Therefore, when you draw conclusions 
about the relative merit of different specific trial policies as tested by a simulation 
model, you must be careful to assess the accompanying random variations. 

The second reason for diffidence about simulation involves the nature of the 
applications themselves. If a system is so complicated that it is beyond the reach 
of such operations research tools ^ linear and dynamic programming or standard 
probability analysis, then the required model-building effort and the subsequent 
analysis of the simulated results are likely to be difficult. Many an unwary analyst 
has found, to his chagrin, that the simulated world is as unfathomable as the real 
world he hoped to approximate—he allowed so much to go on in the model that 
it hampered his finding any insightful information. 

The above two reasons also suggest why electronic computers are indispensable 
in performing simulations. To obtain sufficient statistical accuracy for reliable 
decisions, a considerable number of simulation runs are usually necessary. Each 
experiment is so complicated that it would be virtually impossible to perform the 
simulation manually in a reasonable period of time. It is not surprising, then, that 
computer simulation is often an expensive way to study a complex system. 

21.2 SIMULATION IN PERSPECTIVE 

As you read in the preceding section, many important managerial decision 
problems are too complex and too large to be solved by mathematical program¬ 
ming and standard probability analysis. In such cases, real-life experimentation, 
even if feasible, is usually too costly a way to analyze the alternatives. These 
observations establish the need for other problem-solving approaches, but do 
not by themselves justify computer simulation. Here we discuss why computer 
simulation is a useful technique, as well as what its limitations are. 

Unlike the situation with mathematical programming, there are as yet no 
underlying principles guiding the formulation of simulation models. Each ap¬ 
plication is ad koc to a large extent. Computer simulation languages come the 
closest to providing any general guidelines. [SIMSCRIPT and the General 
Purpose Systems Simulator (GPSS) are the two best-known languages; we say 
more about these programs in Sec. 21.8.] 

The absence of a unifying theory of digital simulation is both a boon and a 
bane. On the positive side, you can build a simulation model containing arbi¬ 
trarily high-order complexities and a huge number of dynamic interdependencies, 
as well as nonstationarities and correlated random phenomena. On the negative 
side, the more complicated the model, the more you will have to rely on em- 
bryonically developed statistical theory to perform the data analyses. As mentioned 
above, the very intricacy of the model can make it difficult to assess the model’s 
validity. If the model is very complicated, you may have to expend a great deal of 
computer time on replication to obtain trustworthy answers and nearly optimal 
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policies. Given the considerable research interest in simulation techniques, how¬ 
ever, many of the current deficiencies in the theory and design of simulation 
experiments are bound to be eliminated in the years ahead. 

Objectives. You would construct a simulation model to assist in analyzing 
managerial decision problems with one or more of the following purposes in 
mind: 

(i) To Describe a Current System, Consider a manufacturing firm that recently 
has witnessed an increase in its customer orders, and has noticed a consequent 
marked deterioration in meeting due-dates promised to its customers. This 
company may want to build a simulation model to study how its current procedures 
for estimating due-dates, scheduling production, and ordering raw material are 
giving rise to the observed delays. 

(ii) To Explore a Hypothetical System, Consider a hospital that is contemplating 
the installation of a new computerized inventory replenishment system for its 
medical supplies. It may want to build a simulation model using historical data 
to test what the average level of inventory investment would be, and how often 
there would be shortages of various supplies under the proposed plan. 

(iii) To Design an Improved System, Consider a job shop in which machine 
capacities are allocated by priorities assigned to each job. The company may 
want to build a simulation model in order to find an effective way to assign such 
priorities so that the jobs are completed without long delays and, at the same time, 
so that equipment utilization is acceptably high. 

We turn next to the steps in constructing and applying a simulation model. 

So you want to build a simulation. The outline to follow describes 
the way you would go about constructing a simulation: 

Step 1, Formulate the Model, This step is much the same as that for other opera¬ 
tions research models. There is an ever-present danger, however, of including too 
much detail in a simulation model and, as a result, consuming excessive amounts 
of computer time to perform the experiments. The best guard against this 
tendency is to keep your specific purpose constantly in mind. For example, if a 
model is to aid in the choice between two different locations for a new warehouse, 
it is probably not necessary to simulate activities on a hour-to-hour, or even day- 
to-day basis; weekly aggregates ought to suffice. If, on the other hand, a model is 
to aid in the choice between one or two loading docks at a new warehouse, then 
it may be necessary to simulate activities occurring in intervals as small as 5 to 15 
minutes. 

Step 2. Design the Experiment, You will reduce the chance of making mistakes and 
wasting time if you work out the details of the experimental procedures before 
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running the model. This means that you need to think out carefully what operating 
characteristics of the simulated system* you plan to measure. Further, you must 
consider the statistical tools you intend to apply to take account of the experi¬ 
mental fluctuations in the measurements. 

Step 3. Develop the Computer Program. The simulation experiments will be per¬ 
formed entirely by a high-speed electronic calculator. That is, each historical 
evolution of the model, including the generation of random events, will take place 
within the computer. If the simulated model has a very simple structure, you may 
find it easiest to use a standard programming language, such as FORTRAN, PL/1, 
or ALGOL, to develop the computerized version. More likely, you will find it 
preferable to employ one of the several simulation languages, such as SIMSCRIPT 
or GPSS, that arc available on many large-scale electronic computers. 

When you undertake an actual application, you will find that the above steps 
are not completely separate and sequential. For example, if you have already 
become familiar with, say, the GPSS simulation language, then you may want to 
formulate the model, initially, in terms of this language. We give more detail on 
each of these steps in the sections below. 

21.3 STOCK MARKET SIMULATION EXAMPLE 

An investor, Wynn Doe, wants to evaluate a particular strategy for buying and 
selling common stocks. To keep the exposition straightforward, suppose he does 
all of his trading in a single stock. At present, he holds 100 shares of the stock, 
which currently has a price of JIO a share. Again for the sake of simplicity, 
assume that the stock price can change each day by only SI, so that some of the 
possible stock prices are S8, $9, SI0, SI 1, 812,.... The investor makes, at most, 
one transaction each day, and pays a commission of 2% of the transaction value 
whenever he buys or sells; of course, he need not make a transaction every day. 

Wynn Doe wants to test the profitability of the following rule for buying and 
selling that has been suggested by his broker Benton Cherning: 

(i) If you own the stock, then sell it whenever the price falls. 

(ii) If you do not own the stock, then buy it whenever the price rises. 

According to this rule, if Wynn Doe owns the stock he will hold on to it while 
the price stays the same or rises; if he does not own the stock, he will refrain from 
buying it as long as the price stays the same or falls. 

In order to evaluate this strategy, Wynn Doe must also postulate how he believes 
the stock price will fluctuate from day to day. After analyzing historical data, 
he formulates the price-movement model shown in Fig. 21.1. To illustrate, if the 
share prices on Monday and Tuesday are both SIO, then he believes that the price 
on Wednesday will be Sll with probability j-, SIO with probability J-, and $9 
with probability ;}-, as can be seen in the second row of Fig. 21.1. If, instead, 
Tuesday’s price is S9, then he believes that the share price on Wednesday will 
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Today’s Stock Price- 
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Figure 21.1. Stock Price 
Movement Probabilities. 


be $10 with probability j-, $9 with probability j-, and $8 with probability as 
can be seen in the third row of Fig. 12.1. Notice that as the stock price increases, 
the investor thinks there is probability that it will increase again, and analogous 
statements hold if the share price remains the same or decreases. 

To begin testing Cherning’s rule by manual simulation generate a 
specific history of price movements according to the probabilities given in Fig. 
21.1. A simple mechanism for doing this is to toss a pair of unbiased coins, using 
the correspondences shown in Fig. 21.2. Verify that in Fig. 21.2 the assignments 
of the outcomes of a toss of two unbiased coins yield the postulated probabilities 
in Fig. 21.1. 
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Stays the Seme 
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A Head 
and 
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A Tail 

Two Tails 
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Two Heads 

Two Tails 

A Head 
end 

A Tail 


Figure 21.2. Price Move¬ 
ments Generated by Tosses 
of a Pair of Unbiased 
Coins. 


Suppose you simulate ^0 days of activity, starting on Day 1 and ending on 
Day 20. Then you must toss the two coins 20 times; a particular sequence of 
tosses is recorded in Fig. 21.3. To determine the associated sequence of stock prices, 
you have to specify the initial conditions, namely, the stock price on Day 0 and 
whether it represents a fluctuation from the preceding day. In Fig. 21.3, the price 
on Day 0 is $10, which presents no change from the preceding day. Given these 
initial conditions and a toss having a head and a tail on Day 1, the stock price 
for Day 1 is $10, according to the second row of Fig. 21.2. Then on Day 2, since 
yesterday’s price remained the same, the toss of two tails implies that the share 
price falls to $9, again according to the second row of Fig. 21.2. Proceeding to 
Day 3, since yesterday’s price decreased, the toss of two heads causes the share 
price to be $10, according to the third row of Fig. 21.2. Check the entries in Fig. 
21.3 for Days 5, 10, 15, and 20. 

You can now determine how well Cherning’s suggested rule for buying and 
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selling has performed on this particular 
simulated 20-day history of price move¬ 
ments. The details are shown in Fig. 21.4; 
notice that the history of prices from Fig. 

21.3 has been copied for easy reference. The 
entries in the column labeled “Decision” are 
a direct consequence of the price history and 
the suggested rule. The entries in the last 
three columns are determined after some 
auxiliary calculations. 

To illustrate, on Day 2, the investor sells 
his 100 shares at a price S9; but he must 
pay a 2% commission, which amounts to 
(.02 X S9 X 100 = 518); thus he receives 
only $882 (= $900 — $18) from the sale. 

On Day 3, he repurchases the stock. Once 
again he must pay a 2% commission, so 
effectively the stock price is $10.20 a share. 

Since he has $882 cash, he can purchase 
only 86 shares, leaving him $4.80 (= $882 
— 86 X $10.20) cash. Notice that at the 
end of the 20th day, the investor’s cash 
position—$931.90—is worse following the 
rule than it would have been if he had sold 
his 100 shares on Day 0 and thereby re¬ 
ceived $980 cash, after paying the commis¬ 
sion. 

Given all the model’s assumptions, is 
Cherning’s rule profitable? Probably your 
immediate reaction is, “No.” But wait a 
minute. Suppose instead of arbitrarily select¬ 
ing 20 days as the length of the simulation, you had picked either 6 or 16 days 
instead. What would your answer have been then? Or suppose you rerun the 
simulation with a new history of 20 tosses. Will the rule still look poor at termi¬ 
nation ? The issue of whether the rule is any good really depends in part on the 
statistical variability in the result obtained on Day 20, and on the significance of 
looking at a horizon of 20 versus 200, versus 2000, versus any other number 
of simulated days. 

As you think further about the model, you will realize that the evaluation issue 
is complicated by the fact that as the horizon lengthens, there is an increase in the 
possible range of variability in the investor’s wealth position at the end of the 
horizon. Further, even if the rule implies an upward drift in the expected wealth 
position as the horizon lengthens, there is at least an initial increase in the proba¬ 
bility that the investor may go broke along the way. 
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So as you can see, even this simple-minded simulation gives rise to some dif¬ 
ficult questions concerning what to measure and how to design a scientific experi¬ 
ment to test the effectiveness of the rule. What is more, if you take the trouble to 
run the model by hand for another 20 periods, you will quickly appreciate the 
desirability of letting an electronic computer do all the coin tossing and arithmetic, 

21.4 BUILDING A SIMULATION MODEL 

We now return to a more general discussion of the steps involved in using 
computer simulation. In this section we examine three aspects of model building: 
specifying the model’s components; testing its validity and reliability; determin¬ 
ing its parameters and measuring its performance. 

Model components. The structure of most simulation models is con¬ 
veniently described in terms of its dynamic phenomena and its entities. 
The dynamic phenomena in the stock market simulation of the preceding section 
include the investor’s activity of buying or selling the stock, according to the stated 
decision rule, and the factors governing the movement of stock prices. The 
entities on any day include the amount of stock the investor holds, his cash posi¬ 
tion, and wealth. Typically, the entities in a model have attributes. To il¬ 
lustrate, the amount of stock the investor holds has a monetary value, given the 
associated price of the stock. Further, there are membership relationships 
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providing connections between the entities. For example, the investor’s wealth on 
any day includes both his cash and stock positions. 

At any instant of a simulation, the model is in a particular state. The descrip¬ 
tion of the state not only embodies the current status of the entities but frequently 
includes some historical information. For example, the state of the system at the 
beginning of a day in the stock market simulation is described by yesterday’s 
price, how yesterday’s price differed from the price on the day before, the number 
of shares held, and the cash position. 

A model also can encompass exogenous events, that is, changes that 
are not brought about by the previous history of thecimulation. To illustrate, the 
investor in the stock market simulation may have decided to add SI000 more 
- cash from his savings on Day 21, regardless of how well he has done using the 
tested strategy. 

Knowing the state of the system and the dynamic phenomena, you can then 
go on to determine the subsequent activities and states. Frequently, simulation 
models having this evolutionary structure are called recursive or causal. 

Note that in building a causal model, you must resolve the way activities occur 
within a period. For example, on each day of the stock market simulation, first 
the price is determined, then the decision to buy or sell is exercised. Actually, the 
price of a stock may change several times during a day, so the model we con¬ 
structed is only a rough approximation to reality. The model also assumes that if 
the investor sells the stock, he receives the cash at the end of the day; and anal¬ 
ogously, if he purchases the stock, he pays the cash at the end of the day. Such 
financial transactions do not always occur so rapidly in practice. 

Model validity and reliability. After building a simulation model, 
you are bound to be asked, “How realistic is it?” The more pertinent question is, 
“Does the model yield valid insights and reliable conclusions?” After all, since the 
model can only approximate reality, it must be evaluated by its power to analyze 
the particular managerial decisions you are studying. 

Once the purpose of the simulation experiment is defined, you construct each 
piece of the model with a commensurate amount of detail and accuracy. A caveat 
is in order here. As simulation experts can attest, it is easy for a novice to build 
a model that, component by component, resembles reality; yet when the pieces 
are hooked together, the model may not behave like reality. So beware not to 
assume blindly that the entire simulated system is sufficiently accurate, merely 
because each of the component parts seems adequate when considered in isolation. 
This warning is especially important, because usually the objective of a simulation 
model is to fathom the behavior of a total system, and not that of the separate 
parts. 


Model parameters and performance measures. It is one thing 
to describe the pieces of a simulation model abstractly, and it is another to collect 
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sufficient data for a trustworthy representation of these pieces. Limited availa¬ 
bility of data may very well influence the way you build a simulation. 

You must be particularly cautious when you are dealing with extrapolated 
data and nonstationary performance measures. (Remember the story of the 
cracker barrel manufacturer who, not so very long ago, forecasted that he would 
be selling millions of barrels today. He assumed, unquestioningly, that his sales 
trend would continue as it had in the past.) 

You also must watch out for cyclical or periodic phenomena. When these are 
present, you must be judicious in selecting the variables to measure in the experi¬ 
ments. If you look only at “ending values,” for example, then your conclusions 
may be very sensitive to the exact length of the horizon that you simulated. 

21.5 GENERATING RANDOM PHENOMENA 

Most applications of simulation models encompass random phenomena. For 
example, in simulated waiting line models, the random variables include arrival 
and service times; in inventory models, the variables include customer demand and 
delivery times; and in research and development models, the variables include 
events of new product discoveries. Frequently, such simulations require thousands, 
and sometimes hundreds of thousands, of draws from the probability distributions 
contained in the model. How an electronic computer makes these draws is the 
subject of this section. 

Uniform random numbers. As you will see, the basic building block 
for simulating complex random phenomena is the generation of random digits. 
The following experimental situation is an illuminating description of what we 
mean by generating a sequence of uniform random numbers. 

Suppose you take ten squares of paper, number them 0, 1 , 2, . . . , 9, and place 
them in a hat. Shake the hat and thoroughly mix the slips of paper. Without 
looking, select a slip; then record the number that is on it. Replace the square 
and, over and over, repeat this procedure. The resultant record of digits is a 
particular realized sequence of uniform random numbers. Assuming the squares 
of paper do not become creased or frayed, and that you thoroughly mix the slips 
before every draw, the ;ith digit of the sequence has an equal, or uniform, chance 
of being any of the digits 0, 1, 2,. . . , 9, irrespective of all the preceding digits 
in the recorded sequence. 

In a simulation, you typically use random numbers that are pure decimals. 
So, for example, if you need such numbers with four decimal places, then you 
can take four at a time from the recorded sequence of random digits, and place 
a decimal point in front of each group of four. To illustrate, if the sequence of 
digits is 3, 5, 8, 0, 8, 3, 4, 2, 9, 2, 6, 1, . . ., then the four-decimal-place random 
numbers are .3580, .8342, .9261,.... 

Suppose you have to devise a way for making available inside a computer a 
sequence of several hundred thousand random numbers. You would probably 
first suggest this idea: perform something like the “slips-in-a-hat experiment” 
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described above, and then store the recorded sequence in the computer’s memory. 
This is a good suggestion, and it is sometimes employed. The RAND Corporation, 
using specially designed electronic equipment to perform the experiment, actually 
did generate a table of a million random digits. The table can be obtained on 
magnetic tape, so that blocks of the numbers can be read into the high-speed 
memory of a computer as they are needed. Several years ago, this tabular approach 
looked disadvantageous, because considerable computer time was expended in the 
delays of reading numbers into memory from a tape drive. But with recent 
advances in computer technology and programming skill, these delays have been 
virtually eliminated. 

Experts in computer science have devised mathematical processes for generating 
digits that yield sequences satisfying many of the statistical properties of a truly 
random process. To illustrate, if you examine a long sequence of digits produced 
by these deterministic formulas, each digit will occur with nearly the same 
frequency, odd numbers will be followed by even numbers about as often as by 
odd numbers, different pairs of numbers occur with nearly the same frequency, 
etc. Since such a process is not really random, it is dubbed a pseudo-random 
number generator. 

Computer simulation languages, like those discussed in Sec. 21.8, invariably 
have a built-in pseudo-random number generator. Hence, you will rarely, if 
ever, need to know specific formulas for these generators. But if you want to 
strengthen your confidence in the process of obtaining the numbers, then you 
can study the example of a pseudo-random number generator given below. If 
not, go on to the discussion of how to generate random variables. 

*Gongruential method. To begin, we need to review the idea of 
modulus arithmetic. We say that two numbers x and y are congruent, 
modulo m, if the quantity (;if — is an integral multiple of m. For example, 

4 = 4 (modiJo 10) 

84 = 4 (modulo 10) 

124 = 4 (modulo 10) 

1,000,004 = 4 (modulo 10). 

To find the value of, say, 857 (modulo 10), you calculate the integer remainder 
of 857 divided by 10, which is 7. 

One popular approach for generating pseudo-random numbers is the so-called 
Multiplicative Gongruential Method. The general formula for 
producing the random numbers is 

(2) (modulo m), 

where the parameters a and m, and the seed Tq are specified to give desirable 
statistical properties of the resultant sequence. Note that because of the modulus 
arithmetic, each r„ must be one of the numbers 0, 1, 2, 3,.. ., m — .1. 

Clearly, you must be careful about the choice of a and Tq. For example, if a = 1, 


letting m = 10, we can write 

3 = 3 (modulo 10) 
13 = 3 (modulo 10) 
513 = 3 (modulo 10) 
48,653 = 3 (modulo 10) 






900 Simulation of Management Systems 


• 21.5 • 


then fn = To, for all n. Or if Tq = 0, then = 0, for all n. The values of a and Tq 
should be chosen to yield the largest cycle or period, that is, to give the 
largest value for n at which = Tq for the first time. 

To illustrate the technique, suppose you want to generate ten-decimal-place 
numbers Uiy 1/3 ,.... It can be shown that if you use Un = X 10"*°, where 

fn = 100,003 (modulo 10*°) 

To = any odd number not divisible by 5, 

then the period of the sequence will be 5 X 10®; that is, = r© for the first 
time at n = 5 X 10®, and the cycle subsequently repeats itself. Given that you want 
len-decimal-place numbers, this is the maximum possible length of period using 
(2). (There are other values for a that also give this maximum period.) Verify 
that the selection of r© in (3) eliminates the possibility that = 0; so satisfies 

0 < i/„ < 1. 

Let us look at an example of (3). Suppose Tq = 123,456,789. Then 
n = (100,003). (123,456,789) = 12,346,049,270,367 
^ ^ = 6,049,270,367 (modulo 10*°), 

so that tt, = .6049270367, and 


( 5 ) 


U = (100,003). (6,049,270,367) = 604,945,184,511,101 

= 5,184,511,101 (modulo 10*°), 


n 


1 

.60492 

70367 

2 

.51845 

II 101 

3 

.66636 

33303 

A 

.3321 1 

99909 

5 

.99544 

99727 

6 

.98361 

99181 

7 

.94266 

97543 

8 

.80343 

92629 

9 

.33660 

77887 

10 

.78869 

33661 

11 

.70269 

00983 

12 

.11790 

02949 

13 

.383 1 9 

08847 

14 

.23804 

26541 

15 

.97953 

79623 

16 

.73484 

38869 

17 

.59322 

16607 

18 

.94573 

49821 

19 

.33541 

49463 

20 

30087 

48389 


so that U 2 = .5184511101. The decimals = 1, 2, . . ., 
20, are shown in Fig. 21.5. Notice that the rightmost 
digits in this sequence form a short cycle 7, 1, 3, 9, 7, 1, 
3, 9, ... . Thus the statistical properties of the digits 
near the end of the number are far from random. 

While (2) works reasonably well for some types of 
simulation models, it has poor serial correlation proper¬ 
ties that make it dangerous to use for dynamic systems. 
A simple device for rectifying this deficiency is to 
intermix several sequences, each being generated with a 
different value for the seed 7'o> and possibly a different 
.value for a. For example, you can sequentially rotate 
among, say, 10 of these generators. 

The advantage of using a pseudo-random number 
generator in lieu of a recorded table of random numbers 
is that only a few simple computer instructions arc 
required to generate the sequence. Therefore, the 
approach uses only a small amount of memory space 
and does not require reading magnetic tape. 


Figure 21.5. The Multi¬ 
plicative Congruential 
Method: a = 100,003; 

ro = 123,456,789. 


Generating random variables. We turn next 
to an explanation of how to employ a sequence of 
uniform random numbers to generate complex proba- 
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bilistic phenomena. The treatment below suggests several techniques that can 
be used; but it is by no means exhaustive. Further, the examples that illustrate 
the techniques are chosen more for expository ease than computational efficiency. 
In an actual situation, you should seek the advice of a computer science specialist 
to determine the appropriate technique for your model. 

Inverse Transform Method. The following is the simplest and most 
fundamental technique for simulating random draws from an arbitrary single¬ 
variable probability distribution. Let the distribution function for the random 
variable be denoted by 

^{x) = probability that the random variable hzis a value less than or equal to x. 
For example, suppose the random phenomenon has an exponential density 


function 


(6) 

fit) = t > 0, 

then 


(7) 

F{x) = f dt = \ — e~- 

J 0 


Now 0 <F{x) < 1, and suppose F{x) is continuous and strictly increasing. 
Then given a value u, where 0 < « < 1, there is a unique value for x such that 
F{x) = u. Symbolically, this value of a: is denoted by the inverse function F *■*(«). 
The technique is to generate a sequence of uniform random decimal numbers 
ti„, w = 1,2,..., and from these, determine the associated values as = F"*(«„). 

The correctness of this approach can be seen as follows. Consider any two 
numbers Ua and Ui,, where 0 < t/o < < 1- Then the probability that a 

uniform random decimal number u lies in the interval Ua ^u ^Ut, is 
Since F(a:) is continuous and strictly increasing, there is a number Xa such that 
^(*^a) = «o> and a number Xi, such that F{xi,) = «&, where'Xa < The Inverse 

Transform Method is valid provided that the true probability of the random 
variable having a value between Xa and equals the generated probability 
— Ma- This true probability is F{xi,) — F{xa) == Ui, ^ Ua by construction, so that 
the method is indeed valid. 

To see how this method works, return to the exponential distribution ( 6 ) and 

(7) . Let v„ denote a uniform random decimal number. Set 

( 8 ) = 1 — 

so that 

(9) 

A A 

where m„ = 1 —• w„, and hence is itself a uniform random decimal number. Thus, 
you generate a sequence of uniform random decimal numbers m„ Uz, 113 ,, . ., and 
by (9) compute .Vi, Xo, ;c 3 ,..., to obtain a random exponentially distributed 
variable. A diagrammatic representation of the technique is shown in Fig, 21.6. 








902 Simulation of Management Systems • 21.5 • 



Figure 21.6. Inverse Transform Method for a Continuous and Strictly 
Increasing F(x). Example: F{x) = 1 — = —loge (1 — v)l\ for 

A = 1. 

The idea can also be applied to a probability mass function ^0). Suppose 
j = 0, I, 2, 3,. . ., so that 

( 10 ) m = tpU)- 

Then the inverse function can be written as 

(11) =j for F{j -!)<«„< F{j), 

where we let F(—1) = 0. For example, suppose the probability mass function is 
the binomial distribution: 

(12) P(J) = (j[)^(l-/)*-' for J = 0,1,..., A, 

where 0 < /> < 1, and A: is a positive integer. In particular, assume k = 2 and 
p = .5; then p{Q) = j-,^(1) = y, and p{2) = so that by (11) you have 

1 0 for 0 < t/„ < .^5 
1 for .25 < i/„ < .75 
2 for .75 < t/n < 

Since is a uniform random decimal number, there is a probability that 
lies between 0 and .25, a ^ probability that it lies between .25 and .75, and a j- 
probability that it lies between .75 and 1. A diagrammatic representation of the 
technique is shown in Fig. 21.7. 

Of course, many continuous distribution functions F{x) do not have analytic 
inverse functions as does the exponential distribution. The Inverse Transform 
Method can still be applied in these instances by employing a discrete approxima¬ 
tion to the continuous function, that is, by storing the values of F{x) for only a 
finite set of x. The accuracy of the approximation can be improved by interpolating 
between the stored values. In several computer simulation languages (such as 
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Figure 21.7. Inverse Transform Method for a Probability Mass Function. 

Example; p(0) = h Pi = h p2 = i- 

GPSS), you need only specify this discrete approximation, and the corresponding 
random phenomenon will be automatically generated. 

^Tabular Method* The rule in ( 11 ) is easily implemented for an electronic 
computer by means of a few standard programming instructions. But if the range 
of possible values for j is large, then an excessive amount of time may be consumed 
in searching for the 7 that satisfies the inequalities in (11). A faster version of the 
Inverse Transform Method can be employed at the expense of using part of the 
computer’s internal memory for storing a long table. We illustrate the idea with 
the binomial example (13). 

You can store the inverse function in computer memory in the form 


1 for j = 1, 2,..., 25, 
for j = 26, 27,..., 75, 

’ for j = 76, 77, ..., 100. 



(14) 


, ..., 


Given a value of let be the number formed from the first two digits of m„. 
Set + 1 , and let = G{sn)- For example if ti„ = .52896..., then 

= 52 + 1 = 53, and so = G(53) = 1 in (14). Once the function G(s) has 
been stored in memory, only a few calculations must be performed by the com¬ 
puter to produce x„. 

’^'Method of Convolutions* Sometimes you can view a random variable 
as the sum of other independently distributed random variables. When this is so, 
the probability distribution of the random variable is a convolution of probability 
distributions, which may be easy to generate. (Occasionally, you can obtain a 
workable approximation to a complex probability distribution by using a weighted 
sum of independently distributed random variables. For this reason, the approach 
also, has been called the Method of Composition.) 

To illustrate, consider a random variable having a gamma density Junction 


(15) 


^' 0 ') = a. posiUve integer. 
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Such a variable can be considered as the sum of k independent random variables, 
each drawn from the same exponential density specified in (6). Consequently, 
adding k independent values of a*„, as given by (9), yields a random variable with 
the distribution in (15). 

Similarly, a binomial random variable, as specified in (12), can be viewed 
as the sum of k draws of a variable described by 



1 with probability p 
0 with probability 1 — p. 


(16) 


You can therefore obtain a binomially distributed variable by adding k values of 
I. Each of these values for i is determined by the rule 



for 0 < M < /? 
for /? < M < 1, 


(17) 


where « is a uniform random decimal number. 

^Method of Equivalent Transformations. Sometimes you can 


generate a random variable by exploiting a correspondence between its probability 
distribution and that of a related random variable. 


For example, consider the Poisson distribution written in the form 



which has mean XT, In terms of the waiting line models in Chap. 20, you can 
interpret j as the number of customers arriving during a period of length T, where 
the interarrival times for the customers are independently and identically dis¬ 


tributed exponential random variables with the density function specified in (6). 


Consequently, you can generate a Poisson distributed random variable by 
making successive independent draws of an exponentially distributed variable— 


using (9) to obtain such values. You stop making draws as soon as the sum of 


j + 1 of these variables exceeds T, The distribution of the resultant j is (18). 
Explain why. 

^Normally distributed random variables. Unfortunately, the 
distribution function for the Normal density with mean 0 and variance 1, 



(19) 


does not yield an analytic formula for the inverse function F”*(w). Of course the 
Inverse Transform Method can be used by employing a discrete approximation, 
and interpolating between values. But there are other methods for generating a 
Normally distributed random variable. Only a few are presented here. 

One technique requires generating a pair of independent uniform random 
decimal numbers u and v, and in turn yields a pair of independently distributed 
Normal random variables x and j having the distribution function in (19). 
Specifically, compute 
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( 20 ) 


X = (—2 loge uY^ COS 27i:v 
y = (—2 loge sin 2nv, 


Alternatively, you can apply the Method of Convolutions and invoke the 
Central Limit Theorem. This technique employs the sum of k independently and 
identically distributed uniform random variables. Specifically, let i/<, for i = 1, 
2 , . . ., 12, be independent draws of a uniform random decimal number; then 
compute 

(21) x = 

lal 


The distribution of x will have mean 0 and variance 1, and will be approximat^y 
Normal. The approximation is poor for values beyond three standard deviations 
from the mean. 

A third approach is to compute 

fd - 1/)“*^®*“ - 

^ .323968 


where w is a uniform random decimal number. 


^Correlated random variables. There are straightforward ways to 
generate variables from a multivariate Normal distribution, and from other 
joint probability distributions, as well as random variables having serial correla¬ 
tion. The techniques go beyond the scope of this book, but can be found in most 
texts on computer simulation. 


21.6 HOW TO MARK TIME 

A dynamic systems simulation model can be structured in either of two ways. 
One approach, which is the more obvious, views simulated time as elapsing period 
by period. The computer routine performs all the transactions taking place in 
Period t, and then proceeds to Period t + 1 • If tfic events in Period t imply that 
certain other transactions are to occur in future periods, then the computer stores 
this information in memory, and recovers it when the future periods arrive. You 
already saw an illustration of this approach using fixed-time increments 
in the stock market simulation of Sec. 21.4. Another example is given below. 

In some simulations, the periods have to be relatively short. But there may be 
many of these periods in which no transactions occur. For such models, there is 
a second approach that lets the simulation advance by variable-time incre¬ 
ments* This idea is illustrated in the second example below. 

Time-step incrementation—inventory model. Suppose you 
wish to evaluate the operating characteristics of a proposed inventory replenbh- 
ment rule. Assume that you can specify the probability distribution for each day’s 
demand, and that daily demand is identically and independently distributed. 
If demand exceeds the amount of inventory on hand, the excess represents lost 
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sales. Let us postulate that, during a daily time period of the simulation model, 
the sequence of events is: first, any replenishment order due in arrives; then 
demand occurs; and finally, the inventory position is reviewed, and a reorder is 
placed if the replenishment rule indicates it should be. An order placed at the 
end of Period t arrives at the start of Period t + L, where L is fixed and L > 1. 

To keep the exposition simple, assume that the replenishment rule is to order 
Q, units whenever the amount of inventory on hand plus inventory due in is less 
than or equal to j, where Q.> j. Verify that the inequality Q^> s implies there is 
never more than one replenishment order outstanding. (Since our focus here is 
simulation and not inventory theory, we do not comment further on the rea¬ 
sonableness of the replenishment rule; we do point out, however, that the model 
is an approximation to that in Sec. 19.6.) 

A simulation model of this inventory system is easily constructed by stepping 
time forward in the fixed increment of a day, beginning with Day 1 (/=!). 
To start the simulation, you must specify the initial conditions of the level of 
inventory on hand, the amount due in, and the associated time due in. You must 
also designate the number of periods that the simulation is to run; let the symbol 
“HORIZON” denote this value. 

A flow chart of the simulation is shown in Fig. 21.8. The initializing is done in 
Block 1. For example, you can let the amount INVENTORY ON-HAND = Q., 
the AMOUNT DUE-IN = 0, the TIME DUE-IN = 0, and ^ = 1. Verify that 
when Block 2 is reached, the answer is “No,” and you proceed at Block 4 to 
generate a value of demand q for Day 1. Here is where you use an approach from 
the preceding section. 

At the end of Day 1, INVENTORY ON-HAND is diminished by unless q 
exceeds the amount available, in which case the amount of INVENTORY ON- 
HAND becomes 0. This calculation is performed at Block 5. 

At Block 6, a test is made to determine whether a replenishment order is to be 
placed. If so, the AMOUNT DUE-IN becomes Q,, and the TIME DUE-IN 
becomes 1 + L (since at the start / = 1), as indicated in Block 7. If a replenish¬ 
ment order is not placed, you continue directly to Block 8, where the time step 
is incremented by 1; that is, the simulation clock is advanced to Day 2. 

If Day 2 goes beyond the HORIZON you specified, the simulation terminates. 
Assuming that you set the HORIZON > 1, the simulation returns from Block 9 
to Block 2. 

At some day, TIME DUE-IN will equal /, and then the simulation branches 
from Block 2 to Block 3, where the amount of INVENTORY ON-HAND is 
augmented by Q^, and the AMOUNT DUE-IN is reset to 0. 

The flow chart does not indicate where you would collect statistical data on the 
operating characteristics of the system. In programming the model, you would 
keep a tally at Block 5 of the level of INVENTORY ON-HAND at the end of a 
day, as well as of the amount of lost sales and the number of days when a stockout 
occurs. You would tabulate at Block 6 the number of days an order was placed. 
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Figure 21.8. Inventory Model Simulation Flow Chart. 


Then, before terminating the simulation at Block 10, you would summarize these 
tallies into frequency distributions, along with their means, standard deviations, 
and other statistical quantities of interest. 

Suppose the item is a “slow mover,” that is, there is a high probability that 
demand ^ = 0 on any day. Then the time-step method may be inefficient, because 
there will be many consecutive days when the computations in Blocks 2, 5, and 6 
will be identical. Such redundancies can be eliminated by using the technique 
illustrated below. 
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*Event-step incrementation—waiting line model. Suppose 
you want to examine the operating characteristics of the following queuing sys¬ 
tem, which is simple to describe but proves difficult to analyze mathematically. 
Customers arrive at the system according to a specified probability distribution 
for interarrival times. The system has two clerks, A and B. When both servers 
arc busy, arriving customers wait in a single line and are processed by a first come, 
first served discipline. The service times for each clerk can be viewed as indepen¬ 
dent draws from a specified probability distribution; but each clerk has a different 
service time distribution. Neither the interarrival nor the service time distributions 
are exponential. 

After thinking about the way this system evolves over time, you will discover 
that the dynamics can be characterized by three types of events: a customer’s 
arrival, a customer’s service begins, and a customer’s service ends. Each event 
gives rise to a subroutine in the computerized version of the system. 

A simulation model using variable-time increments also contains a master 
program having an event list, which is repeatedly updated as the master 
program switches from one event subroutine to another. At the start of 
a simulation run, the event list is usually empty; but at a later instant in the run, 
it indicates when some of the future events are to occur. The role this event list 
plays will be clearer as you examine the event flow charts in Fig. 21.9. 


© 



© ® 




Figure 21.9. Waiting Line Model Chart of Events. 
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Assume that you specify the initial conditions of the simulation as: a customer 
arrives, say, at Time 0, there are no customers in line, and both the clerks are 
free. The master program starts with the event subroutine CUSTOMER AR¬ 
RIVES, shown as Block 1 in Fig. 21.9. Using a technique from the preceding 
section, the computer selects a random interarrival time at Block 2. The informa¬ 
tion that this next arrival event occurs at the implied future time is entered into 
the event list. A determination of whether both clerks are busy is made in Block 
3. Since the answer is “No” at the start, the master program switches to the event 
subroutine SERVICE BEGINS, as indicated in Block 4. To keep the flow diagram 
uncluttered, we have suppressed the details that would specify that the computer 
must keep track of which clerk serves the customer, an item of information that is 
needed when the master program switches to the subroutine SERVICE BEGINS 
in Block 7. 

The first instruction in SERVICE BEGINS is at Block 8, which records that 
the selected clerk is now busy. Then the ser\dce time of the customer is determined 
in Block 9, using the appropriate service lime probability distribution for the 
selected clerk. The information that a service-ends event occurs at the implied 
future time is entered into the event list. The subroutine then transfers back to the 
master program with the instruction in Block 10 to find the next imminent 
event in the event list. So far, this can he either the arrival of the next customer 
or the completion of service of the first customer. Suppose it is the latter, so that 
the master program switches to the subroutine SERVICE ENDS in Block II. 

The first instruction in SERVICE ENDS is at Block 12, which records that 
the server is now free again. Then a test is made at Block 13 to see whether the 
waiting line designated by the symbol “LINE” is empty. Verify that the answer 
is “Yes.” Also check that when control switches back in Block 14 to the master 
program, the next imminent event will be the arrival of the second customer. 

Later in the run, the LINE will contain customers, and then the answer is 
“No” at Block 13. As a result, the length of the LINE is decreased by 1 in Block 
15, and control transfers in Block 16 to the subroutine SERVICE BEGINS. 
Explain what can happen subsequently if Block 10 leads to the CUSTOMER 
ARRIVES subroutine. 

A simulation run progresses as each event subroutine either switches to another 
event subroutine or instructs the master program to increment time to the next 
imminent event. As you can imagine, considerable skill is required to write a 
simulation program that uses event-step incrementation. In particular, expertise 
is needed to program the updating of the event list efficiently as future events are 
generated by the subroutines. Many simulation languages of the type discussed 
in Sec. 21.8 already include a master program that maintains an updated list of 
events; to employ these languages, you only have to specify the separate event 
subroutines. 

We have glossed over a number of details in describing the queuing simulation 
model. We briefly mention a few of these before going on to the next section, 
which considers the design of simulation experiments. First, note that the charts 
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in Fig. 21.9 do not show a test for terminating a simulation run. Of course, you 
must include such a calculation; you might state it by means of a time horizon 
or limit on the number of customers arriving. Second, observe that tabulating 
statistics on the operating characteristics is not an easy process because of the 
variable-time increments between successive events. Care must be taken to mea¬ 
sure, for example, not only the frequency '"'::h which the waiting line has n 
customei's, but also the associated fraction of the simulated horizon. Finally, 
recall that the initial conditions were chosen arbitrarily. If the queuing system in 
fact tends to be congested, then the effect of letting LINE = 0 at the start will 
take a while to wear off. Specifying appropriate initial conditions is part of the 
tactics of designing a simulation experiment. 

21.7 DESIGN OF SIMULATION EXPERIMENTSt 

After constructing a simulation model, you face the difficult task of designing 
a set of runs of the model and analyzing data that emanate from these runs. For 
example, you must decide the 

• Starting conditions of the model. 

• Parameter settings to expose different system responses. 

• Length of each run (the number of simulated time periods and the amount 
of elapsed computer time). 

• Number of runs with the same parameter settings. 

• Variables to measure and how to measure them. 

If you are not careful, you can expend an enormous amount of computer time 
in validating the model to see whether it behaves like a real system, in estimating 
the system responses of the model to different parameter settings, and in discover¬ 
ing the response relationships among these parameters. Even then, and even after 
collecting a vast amount of data, you still may not have sufficiently accurate infor¬ 
mation to guide a managerial decision. 

Surprising as it may seem, there has been relatively little development of 
statistical techniques aimed at constructing efficient designs of simulation experi¬ 
ments. By and large, professional management scientists have tried to “make do” 
with standard statistical tools to analyze experimental data from simulations. 
These techniques at best are only moderately successful, because most of them 
are not constructed for the analysis of multidimensional time series data. In 
particular, many (but not all) of the commonly used statistical tools assume that 
separate observations of the variables being measured are uncorrelated and 
drawn from a Normal distribution with the same parameters. 

We cannot possibly summarize all of the standard statistical techniques that 
can be applied to analyze simulation data. Instead, we discuss certain design 
procedures that enable you to employ many techniques ordinarily found in a 

tThis section requires a knowledge of statistical methods 
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modern text on experimental statistics. We also give a brief overview of some 
statistical approaches that are particularly well suited to the design of computer 
simulations. 


In search of Normality, Suppose you have constructed a queuing 
model to test two different service disciplines. For example, your application may 
be a model of a job-shop production system, and the two disciplines for processing 
orders are “first come, first served” and a particular priority scheme. Assume 
further that the difference in the two disciplines is to be measured solely in terms 
of the average waiting time (exclusive of service) for orders. How might you 
ascertain what this difference is? 

This question is more difficult than it may appear at first glance. Since your 
measurements will be random variables, you must consider their statistical 
variability and be on the watch for certain kinds of complications. In any single 
simulation run, the waiting times of successive orders will be serially cor¬ 
related (sometimes called autocorrelated) ; that is, there is a greater like¬ 
lihood that the (n -f l)st order will be delayed if the nth order wails, than if the 
/Hh order commences service immediately. The extent of variation in wailing 
times may itself be affected by the two different disciplines. The model may be 
unstable and the trend of waiting times may be ever upward. Even if the system 
does approach equilibrium, which may require a considerably long run, wailing 
times need not be Normally distributed. To ignore all these considerations and 
simply compare the average waiting times from a simulated run of each discipline 
is to court disaster. 

Suppose you can demonstrate, on theoretical grounds, that the queuing model 
is stable, and that the effects of the starting conditions eventually fade away. Then 
it can be proved that even though the waiting times of successive orders are 
autocorrelated, the expected value of the sample average of these waiting times, 
taken over a sufficiently long run, is approximately that implied by the equilibrium 
distribution. 

More precisely, let for / = 1,2,...,^, represent ^ successive data obscr\'a- 
lions of the random variable in a given simulation run, and define the time- 
integrated average as 


( 1 ) 



Let yx represent the so-called ensemble mean of this random variable, as 
calculated from the equilibrium distribution. Then for q sufficiently large, we have 
the approximation 

(2) E[x] ^ fi. 

Furthermore, it can be shown that the sampling distribution of x is approx¬ 
imately Normal. You can calculate an estimate of the variance of this distribution 
as follows. Assuming that the process is covariance-stationary (the 
covariance between and depends only on k and not on /), and that the 
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associated autocorrelations tend to 0 as k grows large, you first estimate these 
autocorrelations by: 


(3) — -- 7 ^ (a*( — — x) for k — 0, 1, 2, 

q — K t^\ 

where M is chosen to be much smaller than q. (Unfortunately, a discussion of 
how much smaller M should be is too complicated to be given here, but can be 
found in the statistics literature under the subject title autocorrelation and 
spectral analysis.) The appropriate estimate of the variance of x is 

Note that if, in fact, the time series is known to be free of autocorrelation, then 
the terms r^t, for /: = 1, 2, . . . , M, would be eliminated from (4). The presence 
of positive autocorrelation, however, implies greater statistical variability in x 
as compared with the case of uncorrelated observations. 

VVe now can look at two commonly employed approaches to statistical analyses. 
For the first method, consider making one very long run of each service discipline; 
specifically, take T consecutive observations in each run. Then you can apply 
(1) through (4) with q = T. If Tis sufficiently large, the statistic {x — is 

approximately Normally distributed with mean 0 and variance 1. This fact allows 
you to perform standard statistical procedures for hypothesis testing and construct¬ 
ing confidence intervals for yx, as well as to use modern Bayesian analysis. To 
compare the effect of the two service disciplines on average waiting time, you can 
apply standard statistical theory for discerning the difference between the means 
of two Normally distributed variables that have possibly unequal and estimated 
variances. 

For the second method, consider making n independent replications, that is, 
n aifferent runs. Suppose you want to have T observations in toto from the n 
replications and that you take Tjn observations from each run (assume Tjn is an 
integer). Then for each replication />, calculate a time-integrated average JCp, 
for /> = 1, 2, . . . , «, using (1) with q = Tjn. Afterwards compute the grand 
average 


(5) 


" n 


For any T/n, if n is large enough, the sampling distribution of x is approximately 
Normal due to the Central Limit Theorem for the mean of independently and 
identically distributed random variables (namely, the x^. If you let Tjn be large 
enough, the approximation is improved because of the near-Normality of the 
sampling distribution for each Xp, What is more, it follows from (2) that when Tjn 
is sufficiently large, 

(6) ^[x] ^ /t. 

To determine the accuracy of x, you can estimate the variance of the sampling 
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distribution of x from the variation in Xp, using 


(7) 


F; =-i- 

n 


2 - xY 


71 - 1 


Once again, if n and T are large, the quantity {x — yx)/VTi is approximately 
Normally distributed with mean 0 and variance 1, and so the same sorts of sta¬ 
tistical analyses can be performed as in the one-long-replication procedure. 

Although the preceding discussion has related to a comparison of two different 
service disciplines in a job-shop model, these statistical approaches are generally 
applicable. In summary, assuming that the simulated system does approach an 
equilibriurn, then under widely applicable conditions, you can legitimately average 
the successive observations of a simulated time series. As the number of observa¬ 
tions grows large, this time-integrated average, in a probabilistic sense, converges 
to the desired ensemble mean implied by the equilibrium distribution. (You can 
find the subject of probabilistic convergence treated in detail in te.xts on stochastic 
processes under the heading ofergodic theorems,) And furthermore, under 
widely applicable conditions, the time-integrated average is approximately 
Normally distributed. (You can look up the topic of Normal approximations in 
advanced statistics texts under the heading of the Central Limit Theorem for 
correlated random variables.) 

Therefore, in many situations you can apply Normal-distribution theory if you 
cither replicate simulation runs and then take a grand average of the individual 
time-integrated averages, or if you take a single time-integrated average from a 
very long run. A comparison of the relative merits of these two approaches as well 
as of other methods goes beyond the scope of this text. (The issues involved con¬ 
cern the amount of bias introduced by the starting conditions of the simulation 
and the stability properties of Fj.) 


Sampje size. Assume that you take a sufficient number of replications or 
let the simulation run long enough to justify using the Normal distribution to 
approximate the sampling distribution of the calculated averages. You still may 
need even more replications or a longer run to obtain the accuracy you require 
for decision analysis. The determination of an appropriate sample size for a simula¬ 
tion is no different from sample-size determination in ordinary statistical problems. 
Therefore, you can find the question discussed in detail in every text on statistical 
analysis. 

We do emphasize, however, the influence of the number of observations on the 
accuracy of the statistical estimates. Whether you use the single-long-run ap¬ 
proach, depicted in (1) through (4), with q = T, or the n-replication approach, 
depicted in (5) through (7), with q = Tfn, the true variance of the sampling dis¬ 
tribution for the calculated mean equals the reciprocal of the total number of 
observations T multiplied by another factor that is independent of T. Therefore, 
to reduce the standard deviation of the sampling distribution of either x or x from 
a value of s, say, to (.l)j, you must increase the total number of observations to 
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1007’. More generally, to reduce the standard deviation by a factor of I If, you 
have to takeas many observations. 

Usually, you cannot know how many observations to take at the start of a 
simulation, because you do not know the factors that multiply 1 / 7* in the expres¬ 
sions for the true variances of the sampling distributions of ,v and x. For this reason, 
a commonly used procedure is to sample in two stages. In the first stage, you take 
a relatively small number of observations, and thereby calculate an estimate of 
the factor that multiplies IjT, With this estimate, you determine the remaining 
number of observations to take in the second stage to give the required accuracy. 

In actual applications, you may be surprised to find how many observations are 
needed to yield reasonable accuracy in the estimates. As pointed out above, the 
root of the difficulty is often the presence of positive autocorrelation. We discuss 
below a few approaches for coping with inherently large variation in the statistical 
estimates. 

^Variance-reduction techniques. There are a number of ways to 
improve the accuracy of the estimate of the ensemble average for a given number 
of data observations. These techniques arc explained in texts on simulation under 
the heading of Monte Carlo or variance-reduction methods. Their 
use in management-oriented simulations is not yet widespread. We give only 
a couple of illustrations to suggest what is involved. 

To assist in the exposition, we return to the example above of simulating a job- 
shop production system. Suppose, for the sake of definiteness, that you are simulat¬ 
ing under the “first come, first served” discipline, and that you want to estimate 
the average waiting time of an order. 

The first device we examine is sometimes called the Method of Control 
Variates, or alternatively, the Method of Concomitant Informa¬ 
tion. We present a highly simplified example of the idea. By elementary consid¬ 
erations you know that the interarrival times and the waiting times of each order 
are negatively correlated—roughly put, the longer the time since the prevfous order 
arrived, the shorter the waiting time of the latest order. Slate why. Therefore, sup¬ 
pose in a particular simulation run that the observed average of interarrival limes 
is greater than the true average. Then you can use this information to add a posi¬ 
tive correction to the observed average value of the waiting limes. Similarly, 
suppose the observed average of interarrival times is smaller than the true average. 
Then you can make a negative correction to the observed average value of the 
waiting times. The technique explained below calculates either a positive or 
negative correction, whichever is appropriate. 

Specifically, from the input data for the simulation you have the value of the 
true mean interarrival time, say, 1 /\. Then let Xi represent the waiting time of 
Order t, and yt the interarrival time between Orders t — 1 and t. Consider the 
measurement 

_ , 1 
Z( = Xt 


(8) 
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and its time-integrated average 


(9) 



T 



Note that the expectation of z is the same as that of x, since J is an unbiased esti¬ 
mator of 1 />v. So you can use z as a consistent estimate of the average wailing time. 
But if Xi and yi are sufficiently negatively correlated, then the variance of z will 
be less than the variance of x, A sample estimate of the variance of z can be 
calculated by assessing the variation in z from several replications, or by sub¬ 
stituting Zt for Xt in (1), (3), and (4) above. 

A more sophisticated method than (8) is to calculate + a{yi — l/X), 

where now the value of a is specifically chosen to make the variance of z small. 
Under ideal conditions, a can be set such that Var (z) = Var (x)(l — p*), where 
p is the correlation between x and y. 

Before going on, we caution that the preceding example is meant only to be 
illustrative of the control variate idea. If you actually apply the technique to a 
queuing model like a job-shop production system,’you should select a control 
variate that would absorb more of the sampling variation that would be accounted 
for by the interarrival times of ordei's. In fact, you probably should use several 
control variates instead of only one. 

The second variance-reducing device w'e examine is called the Method of 
Antithetic Variates. The aim here is to introduce negative correlation 
between two separate replications of the simulation, so that the variance of the 
combined averages is less than if the replications were independent. (The idea 
can also be extended to more than two replications.) 

Suppose in the job-shop production simulation that the interarrival times are 
determined by the Inverse Transform Method of Sec. 21.5. Let for ^ = 1, 
2, . . . , 772 , be the corresponding uniform random decimal numbers for generat¬ 
ing the interarrival timesin the first simulation run of the model. Then in the 
second simulation, by using the values 1 — which are also uniform random 
decimal numbers, the two time-integrated sample averages will be negatively 
correlated. State why. 

Notice that the two simulations involve a total of T observations. Whether the 
mean of the two separate negatively correlated averages has less statistical varia¬ 
tion than does the average of T autocorrelated observations from a single run 
depends on the extent to which the antithetic variates and 1 — induce nega¬ 
tive correlation. Thus, the answer depends on the particular model being simu¬ 
lated, and the specific values of the model’s parameters. 

The crucial factor in deciding w'hen to use variance-reduction techniques is 
whether, in fact, a given approach diminishes the variance of the estimates, and 
if so, whether the reduction is sufficient to warrant the extra computations re¬ 
quired. 
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^Multivariate analysis. The discussion so far has been partly mislead¬ 
ing in that we have discussed examples involving the measurement of only a 
single operating characteristic for a system, such as average waiting time, and the 
comparison of only two alternatives, such as two different service disciplines. In 
real applications of simulation models, there are usually several operating char¬ 
acteristics of relevance and a multitude of alternatives to evaluate. 

Multivariate analysis is by no means a new subject in statistics literature, but 
techniques for the analysis and design of experiments involving multivariate time 
series are just emerging. The reason for this relatively late development is that 
only recently has the availability of electronic computers made it practical to 
perform such data analyses. 

By employing the approaches previously described to yield measurements that 
are Normally distributed, you have at least partially opened the storehouse of 
standard multivariate statistics. But still it is no simple matter to design a simulation 
experiment that can legitimately apply, say, latin squares, factor analysis, or 
multivariate regression. 

Progress in devising helpful tools for multivariate analysis and complex experi¬ 
mentation is being made on two fronts. One important development, known as 
spectral analysis, aims at exploring the nature of serial correlation and periodicities 
in time series. The other front seeks methods for finding optimal levels of the 
decision variables; two such developments are response surface and stochastic ap¬ 
proximation techniques. You can find these developments explained in the technical 
statistics literature. 

21.8 COMPUTER LANGUAGES 

Unless you become both an operations research specialist and a computer 
programmer, you personally will not have to translate your simulation model 
into a workable computer program. You should, however, know the major steps 
involved in this translation. 

If your model is fairly simple and is a common application of simulation, then 
a so-called canned program may be available in which all you need do is 
specify a modest amount of input information. The best examples of this type of 
program are inventory control simulators. There are a number of canned programs 
that test the effectiveness of inventory replenishment rules. To employ these 
routines, you must supply the specific rules, such as “when down to 4, order 10 
more,” or a formula to calculate the rules, given demand data. You also supply 
as part of the input either actual historical data on customer demand or a proba¬ 
bility distribution for demand. The computer program then simulates the system 
for whatever number of time periods you designate, and calculates statistics such 
as the frequency of stockouts, the average inventory level, the number oforders 
placed, etc. 

More typically, your model will require some special computer programming. 
If the simulation is only moderately complex, is to be used infrequently, and is to 
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be programmed by personnel inexperienced in simulation techniques, then using 
a general purpose language, such as FORTRAN, PL/1, or ALGOL, is probably 
the easiest way to accomplish the task. This type of computer language is familiar 
to all programmers of scientific problems; a programmer requires only the details 
of your model to translate it into computer language. 

There is an important drawback to employing languages like FORTRAN, 
PL/1, and ALGOL. The programmer has to write, from scratch, subroutines for 
certain kinds of calculations that are included in almost all simulations. In the 
vernacular, the - programmer has to ‘‘reinvent the wheel.” For example, most 
simulations require generating random variables, and so a subroutine is needed 
for each such variable in the model. In addition, since you want to collect statistics 
on the system’s operating characteristics, subroutines have to be written to calcu¬ 
late these statistics, and a fair-sized associated programming effort must be ac¬ 
complished to format the output of the simulation runs. Even a moderately 
complex m.odel requires careful attention in organizing the data within the 
computer memory, writing a master routine for sequencing events in their proper 
order, and keeping track of simulated time within the computer. 

Several computer languages have been developed for the specific purpose of 
easing the programming task of building a simulation model. These programs 
require that you specify only the probability distribution functions, and they 
automatically generate random events according to the distributions you indi¬ 
cate. Several of the languages collect statistics on whatever operating characteristics 
you want to examine, and report the results on predesigned output forms. These 
languages also properly sequence events and keep track of time as it elapses in 
the model. 

Wiih such advantages, you may wonder why all simulations are not pro¬ 
grammed in one of these languages. At present, there are several good reasons. 
One is that the languages differ to some extent from FORTRAN, PL/1 or ALGOL, 
and hence require a programmer to become familiar with a new system. 

One of the most powerful simulation languages is SIMSCRIPT; it requires 
a knowledge of FORTRAN and is fairly complex because of its considerable 
flexibility. At the other extreme of complexity is the General Purpose Systems 
Simulator (GPSS). It is a self-contained language that is easy to learn by beginners, 
but, accordingly, is restricted in its scope. * 

A second reason for not employing a simulation language is that it may not be 
available on the computer you want to use. This is rarely the determining factor 
today because SIMSCRIPT and GPSS programs are available for many com¬ 
puters, and there is widespread access to computer service bureaus that have 
these programs. 

A third reason becomes important if the simulation is complex and is to be run 
frequently. A price you pay in using a simulation language is that it often runs 
slowly and consumes large amounts of a computer’s high-speed memory. As a 
result, you may find it costly to perform many experiments, and your model may 
literally not fit into the available memory capacity of the computer. 
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As further technical improvements in simulation languages continue, and as 
management scientists gain more experience in employing computer simulation, 
it seems likely that such languages will be the common mode of solution. 

21.9 DEUS EX MACHINA 

So far we have discussed only simulation models that to some degree represent 
approximations to real situations. Their orientation has been to provide a simu¬ 
lated environment in which to test the effects of different managerial policies. A 
related class of simulation models tries to encompass goal-seeking or purposeful 
behavior. These models display what is termed artificial intelligence. 

Some of the popular examples of artificial intelligence programs include com¬ 
puter routines for playing such games as chess and checkers. There also have 
been a few applications to managerial problems. One group of applications 
focuses on the behavioral patterns of individual decision-makers. A measure of 
such a model’s success is how well it yields decisions agreeing with those of the 
individual whose behavior is allegedly represented. 

Another group of applications deals with complex combinational problems, 
like those discussed in Chap. 13. They are sometimes referred to as heuristic 
programming methods. For example, several of these models have been 
designed to derive good schedules for intricate sequencing problems. The follow¬ 
ing illustration suggests how they work. 

Suppose the goal of the model is to schedule orders through a job-shop with 
maximum equipment efficiency. The computer starts by tentatively scheduling 
a few orders. It then selects another order to schedule, and examines various 
feasibility restrictions, due dates, and equipment efficiency. As a consequence, 
the cc nputer may have to reschedule some of the previous orders. In brief, the 
computer model uses a number of “look-back” and “look-ahead” rules, and pro¬ 
ceeds by educated trial-and-error toward a feasible schedule. If the rules are 
sufficiently sophisticated, then usually the schedule is good. Frequently, the sched¬ 
ule is nearly optimal according to the specified efficiency criterion, assuming the 
heuristic rules are promulgated with reference to this criterion. 

^Management scientists have also employed computer models for operational 
gaming. Some of the early applications, known as management games, involved 
several teams of players, each representing a busine^ firm. A team made decisions about 
pricing, production quantities, advertising, etc. The computer served the two-fold 
purpose of keeping the accounting records, and of calculating the net impact of the 
decisions made by the several teams. More recently, such applications have been used to 
train personnel in administrative procedures, and to explore the system dynamics of an 
industry in which the competing firms are employing specified strategies. M 
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REVIEW EXERCISES. 


1 Suggest two or three managerial decision problems that seem to go beyond the capa¬ 
bilities of the mathematical models treated in earlier chapters of this text. Select situa¬ 
tions that involve the optimization of an objective function subject, nerhaps, to 
constraints or uncertainties. Explain what makes these problems so complex to solve. 

2 Consider the stock market simulation example in Sec. 21.3. Repeat the experiment for 
another 20 days, starting with the same initial conditions as in Fig. 21.3. Array your 
calculations and results in tables like Figs. 21.3 and 21.4. Calculate Doe’s cash position 
at the end of Day 20; if he holds stock, compute his cash position by assuming he sells 
out at the end of Day 20. {Suggestion: if you are in a class that is assigned this problem, 
collect the results from all the students and calculate the mean and standard devia¬ 
tion of Doe’s cash position on Days 5, 10, 15, and 20.) 

3 Consider the stock market simulation example in Sec. 21.3. Test the rule: (i) if you 
own the stock, then sell it whenever the price falk two days in a row, and (ii) if you do 
not own the stock, then buy it whenever the price rises tv/o days in a row. 

(a) Use the same data as in Fig. 21.3, and'construct a table like Fig. 21.4. Calculate 
Doe’s cash position at the end of Day 20; if he holds stock, compute his cash 
position by assuming he sells out at the end of Day 20. 

(b) Extend the length of the simulation run another 20 days, and calculate Doe’s 
cash position at the end of Day 40. {Suggestion: if you arc in a class that is assigned 
this problem, collect the results from all the students and calculate the mean and 
standard deviation of Doe’s cash position on Days 30 and 40.) 

4 Suggest one or two experimental setups (analogous to the "slips-in-a-hat” approach) 
for generating uniform random digits. Discuss whether any of these approaches can 
be modified to generate other discrete probability distributions. 

*5 Consider the Multiplicative Congruential Method for generating random digits in 
Sec. 21.5. In each part below, assume modulo 10 arithmetic and determine the length 
of the cycle. 

(a) Let a = 2 and Tq = 1, 3, and 5. 

(b) Let a = 3, and Tq = 1,2, and 5. 


6 In each part below, apply the Inverse Transform Method in Sec. 21.5, and devise 
specific formulas analogous to (9) and (13) that yield the value of the variate given 
a random decimal number u^, 

(a) 

^ I(*-«)“' for a 4 

lO otherwise. 


where a and b arc real numbers and a <b. 
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(b) 


(c) 


(t for 0 ^ ^ 1 

f(t) = j 1 - / for 1 ^ < 2 
lo otherwise. 


F{x) = 


fO 

,5x 

X- .2 

2a:- 1 
1 


for a: < 0 
for 0 < X ^ .4 
for .4 ;c < .8 
for .8 < a: < 1 
for a: > 1. 


(d) Binomial distribution, given by (12), where k = 2 and /> = .!. 

(e) Binomial distribution, given by (12), where k = 3 and p = .5. 


7 Explain ho'v the description of the Inverse Transfornt Method in Sec. 21.5 needs to 
be altered to treat the case where F{x) is continuous and nondccrcasing (that is, not 
necessarily strictly increasing). 

*8 In each part below, explain how you would apply the Tabular Method in Sec. 21.5. 

(a) Binomial distribution, given by (12), where k = 2 and p — A. 

(b) Binomial distribution, given by (12), where A* = 3 and p = .5. 

(c.) />(0y = .3, /^(2) = .25, p{3) = .15, and /^(60) = .3. 

♦9 (a) Explain how you can apply the Method of Convolutions in Sec. 21.5 to generate 
the triangular distribution in exercise 6, part (b). 

(b) Consider the Method of Equivalent Transformations in Sec. 21.5, and explain 
why the suggested procedure for generating a Poisson random variable is valid. 

*10 Consider the approach (21) in Sec. 21.5 for generating Normally distributed random 
variables. Explain why (21), which involves a sum of 12 uniform random decimals, 
produces a random variable with a distribution having mean 0 and variance 1. 

11 (a) Suggest one or more methods for generating a geometric distribution p{j) = 

p{\ — py, for J = 0, 1, 2, . i.,. where 0 < /> < 1. [^Recall: let p represent the 
probability of a head appearing on the toss of a possibly biased coin; then p{j) 
represents the probability that a head appears for the first time on Toss j.] 

(b) Suggest, one or more methods for generating a negative binornial distribution 
p{j) = (^''^^"*)/>*(! — pVi for j = 0, 1,2,..., where Q < p < 1. [Recall: let p 
represent the probability of a head appearing on the toss of a possibly biased coin; 
then p{j) represents the probability that the A:th head appears on Toss k + j-] 

12 Consider the inventory simulation in Sec. 21.6. Assume 0,= 2, j = 1, L = 2, and the 
demand distribution probabilities are /»(!)= p{^) — *5. Simulate the system for 
20 days, letting INVENTORY ON-HAND = 2, AMOUNT DUE-IN = 0, TIME 
DUE-IN = 0, and f = 1 in Block 1 . {Suggestion : flip a coin to generate the daily demand 
quantities.) 
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13 Consider the inventory model simulation in Sec. 21.6. 

(a) Explain how you would modify Fig. 21.8 if lead time L were random. 

(b) Explain what complications arise if J. If demand in excess of INVENTORY 
ON-HAND is backlogged. 

*(c) Suppose the probability process for demands can be formulated in terms of a 
distribution generating interarrival times between successive customers, and a 
distribution generating the quantity demanded by each customer. Devise an 
event-step incrementation flow chart for simulating the system. 

14 Construct a time-step incrementation flow chart for the stock market example in 
Sec. 21.3. 

*15 Explain how to design an event-step incrementation simulation for the stock market 
example in Sec. 21.3. Try to construct an appropriate flow chart. 

*16 Consider the waiting line model in Sec. 21.6. Suppose the interarrival time is cither 2 
or 3, each possibility occurring with probability that service time for Clerk A is 
cither 1 or 2, each occurring with probability and for Clerk B is either 1 or 4, 
each occurring with probability At / = 0, assume that the first customer arrives and 
that both clerks are free. Simulate an hour’s operation of the system. {Suggestwn: 
determine the random events by flipping a coin. If you are in a class that is assigned 
: this problem, collect the following statistics for all the students and calculate the 
associated mean and standard deviation: the number of persons waiting in line at 
i = 30, 45, and 60 minutes; the number of persons that were served during the hour 
and their average time in the system, in line, and in service.) 

17 Consider the OR Airline telephone line simulation, discussed in Sec. 1.6. In each part 
below, draw a flow chart indicating how to simulate a two-line system. Also explain 
what statistics you would tabulate. 

(a) Use a time-step incrementation formulation. 

*(b) Use an event-step incrementation formulation. 


following terms: 


18 Explain your understanding of the 

digital computer simulation 

manual simulation 

time compression 

dynamic phenomena: 

entities 

decision rule 

attributes 

membership relationships 
state of the simulated system 
exogenous events 
recursive (causal) structure 
validity and reliability 
performance measure 


*Method of Equivalent Transformations 
fixed-time increments 
variable-lime increments 
time-step incrementation ’ 

* event-step incrementation 
♦master program 
♦event list 
♦event subroutine 
♦next imminent event 
serial correlation (autocorrelation) 
time-integrated average 
ensemble mean 
covariance-stationary 
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uniform random numbers 
pseudo-random number generator 
♦Multiplicative Congruential Method 
♦modulus arithmetic 
♦congruent, modulo m 
♦seed 

♦cycle (period) 

Invci*se Transform Method 
♦Tabular Method 

♦Method of Convolutions (Method of 
Composition) 


spectral analysis 
ergodic convergence 
♦Monte Carlo (variance-reduction) 
methods 

♦Method of Control Variates (Method of 
Concomitant Information) 

♦Method of Antithetic Variates 
canned program 
artificial intelligence 
heuristic programming 
♦operational gaming (management games). 


FORMULATION EXERCISES 

19 In each part below, describe how you would build a simulation model to represent 
the system. State explicitly the random phenomena and how you would obtain the 
probability distributions required as input. Describe the performance measures you 
would collect, and explain their relevance to the associated managerial decision 
problem. Try to develop a flow chart describing the simulation of a ‘*toy” version of 
the system. 

(a) Checkout stands in a grocer/ store. (How would you test the eflectiveness of an 
express counter to serve customers witli five or less items ?) 

(b) Tellers* windows at a bank. (How would you test the effectiveness of having an 
express window to serve customers requiring only a single transaction?) 

(c) Pumps at a gasoline station. 

(d) Runways at an airport. 

(e) Traffic lights in a configuration of eight city blocks. 

(f) Parking lot for a doctors* office building. 

(g) Telephone operators at a metropolitan police station. 

(h) Restaurant facilities at a World’s Fair. 

(i) Layout of tables at a posh restaurant. 

(j) Layout of equipment in a job-shop. 

(k) Piers at a harbor facility. 

(l) Allocation of substitute teachers in a school district. 

(m) Layout of rooms and facilities in a new school building. 

(n) Screening methods for evaluating proposed capital investments by a corporation. 

(o) Design of an electronic computer installation at a college or university. 

(p) A decision problem that you suggested in exercise !. 
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CHAPTER 


Implementation of 
Operations Research 


22.1 LIKE IT IS 

In the past few years, many significant technical breakthroughs have been 
made in operations research. Even greater progress occurred, however, in imple¬ 
menting operations research in commercial and governmental enterprises. 

In the early 1960’s, a practicing operations researcher had to be both a sci¬ 
entific expert and a master of the “art of persuasion.” Ethical, but convincing, 
salesmenship was needed then because relatiyely few companies firmly believed 
that operations research was a profit-yielding activity. Most executives classified 
the eflfort as blue-sky research and development, and, in fact, several major cor¬ 
porations placed their operations research group in an R & D department. 

Since then, the picture has changed dramatically. Only rarely now is an opera¬ 
tions researcher called upon to defend his raison (Tetre, Today, executives show 
pride in employing computer models that have been designed to assist them in 
analyzing complex decision problems. (Many managers guard their computer 
models as a part of their territorial imperative.) In short, very few executives in 
leading corporations still ask, “Why do we need operations research?” They know 
why, f 

The questions that managers do raise are, “What areas of application are the 
most profitable?” “Is our company spending too little or too much on operations 
research?” “How can I best use operations research?” In other words, the present 
interest of businessmen is learning how to get the maximum benefit from operations 
research. - 

The sections below explore the practical implications of this managerial at¬ 
titude and offer some insights into the implementation process. Although the scc- 
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tions arc entitled “How to. . .the chapter is not really a comprehensive nuts- 
and-bolts manual of procedures for guaranteeing profitable implementation. 
Rather* the chapter provides a few guidelines for making operations research 
work effectually. The orientation of the discussion is toward people, not mathe¬ 
matical techniques. 

As you will soon discover, the key to the successful conduct of operations research 
is the joint exercise of good judgment by executives and professional operations 
researchers. In particular, the managers and technicians must decide in concert 
what projects to pursue, what goals to sight, what level of effort to expend, and 
what timetable to follow. These are the subjects analyzed below. 

►You may be curious to know why implementation is so much easier now. The reasons 
may not be obvious, as many of the most useful mathematical techniques have existed 
for more than 15 years. Part of the answer has already been given in Sec. 1.7, which 
explained how executives have come to appreciate the powerful analytic assistance of 
operations research. But this recognition is by no means the only factor contributing to 
a widespread acceptance of the approach. 

An equally important reason is that computing equipment has become vastly more 
accessible, and that the number of programming specialists has increased concomitantly. 
The improvement in so-called software computer programs that assist in solving large-scale 
optimization and simulation problems as well as the development of time-shared computing 
^sterns have diminished the task of systems design and lessened the difficulties of obtain¬ 
ing numerical solutions. Another reason is that the population of well-trained and 
experienced operations researchers has expanded sufficiently to provide the required 
professional manpower. ^ 

22.2 HOW TO PUT OPERATIONS RESEARCH TO 
MANAGERIAL USE 

What commonly distinguishes ar executive familiar with employing operations 
research from a first-time user is his recognition of the need to exercise responsi¬ 
bility vis-a-vis the conduct of the project. For quite understandable reasons, tyro 
managers are usually “stand-offish” in their involvement. Such a posture is ill 
advised and can be expensive to the company, even when the operations research 
application ultimately succeeds. 

In essence, line managers must take responsibility to see that the right problem 
is analyzed and that adequate controls are exercised to monitor the progress of 
the application. Experience has shown repeatedly that ignoring this responsibility 
is detrimental to all and may easily be the root cause of failure, despite the expertise 
and sincerity of the operations research technical staff. This section suggests some 
ways for an executive to ensure that an operations research effort is well directed 
and aimed at bettering the entire organization. 

What benefits should a manager expect? Operations research 
can be employed to mount a massive analysis, when warranted, of an important 
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and intricate decision-making problem. As you will readily observe in applying 
operations research, the approach inherently requires adhering to systematic pro¬ 
cedures and paying careful attention to details. (No other approach for solving 
complex management problems even comes close in demanding so much discipline 
in analysis.) The combined utilization of advanced mathematical techniques and 
enormous computing power permits a thorough exploration of relevant alterna¬ 
tives. A good operations research study will leave no doubt in an executive’s 
mind that all reasonable courses of action have been investigated, and will make 
crystal clear the relative merits of specific alternative actions and their possible 
consequences. 

A central ingredient to a sound operations research investigation is extensive 
sensitivity testing. Careful managerial scrutiny of comparative case studies provides 
the principal means by which an executive can confirm his understanding of the 
underlying model, its assumptions, and its data. Furthermore, the benefits a 
manager receives from a planning-oriented model stem largely from such insightful 
sensitivity testing. Rarely, if ever, does an executive seek “numbers” as answers; 
T'ather, most decision-makers want a quantitative assessment of what risks are at 
stake with different actions, what changes in direction are likely to yield profit 
improvements, and what avenues are promising for further investigation (such as 
the development of new products, expansion into new markets, location of new 
plant sites, etc.). Often sensitivity testing reveals that the uncertainty of an al¬ 
legedly critical factor is actually not very important in making a good decision, 
whereas another factor, previously thought insignificant, is truly pivotal. 

To determine whether an operations research project is meeting acceptable 
standards of quality, some easy items to check are the ready availability of input 
data and model assumptions in a form understandable to nontechnicians, the 
summarization of results and the backup detail printed in the format of managerial 
reports, and reasonable turnaround times for running additional analyses having 
slightly modified input data or assumptions. The best way for a manager to make 
these checks is to ask questions and probe the answers. A competently designed 
model should provide a manager comprehensible answers to his spontaneous 
“why does. . and “what if. ..” questions without requiring a mammoth crash 
effort. (We hasten to add, however, that it is unfair to expect such rapid service 
at the initial stages of a study. A line executive should continue to ask questions 
throughout the duration of the project, and monitor whether the effort required 
to answer these questions eventually becomes routine and commensurate with the 
value of the analytic assistance provided.) 

Another indicator of project quality is the extent to which the analysis results 
in a recommended strategy, as distinct from a suggested single decision. To illustrate, 
the output from a long-range capacity-expansion study should not be merely a 
string of recommended equipment purchases and forecasted production levels. 
Rather, the output should indicate the decisions to take immediately, should 
include recommendations of when to make the next set of decisions, given the 
present data, and should establish the circumstances for reviewing, and possibly 
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revising, these future decisions. Even the immediate decision recommendations 
should be qualified to the point of ascertaining what other alternatives are ap¬ 
propriate if the data are varied within a plausible range of values and any restric¬ 
tive assumptions are relaxed. 

What limitations should a manager recognize? This ques¬ 
tion was partially answered in Sec. 1.3, and you may want to review that 
material. Three more cautions are added here. 

First, when an operations research model is. used to reduce costs, the percentage 
savings may be relatively small. But if this percentage is applied to a large cost 
base, the absolute savings can pay for the operations research study many times 
over. Occasionally, a planning model will uncover a costly error in current operat¬ 
ing procedures; in such an instance, the savings may be large. Most often, profit 
improvements stem from executives possessing a deeper understanding of the 
problem area, and hence developing a keener sense for taking correct actions and 
maintaining control in an uncertain and competitive environment. It is impos¬ 
sible to assign a precise dollar improvement figure to this type of impact; neverthe¬ 
less, the benefits are real and are perceived and valued by company management. 
In a preponderence of successful applications, the beneficial effects are truly 
manifest in the altered decision behavior of executives arid managers at several 
levels of the corporation. 

Second, although an operations research model often uses the mathematics of 
optimization, the resultant solution should not be viewed as necessarily yielding 
an optimal answer to the real problem. After all, as the text has stressed through¬ 
out, a model is inherently an approximation. to reality, and therefore an optimal 
solution to this approximation need not be the “final” answer to the actual deci¬ 
sion problem. The important issue, however, is not whether a proposed solution 
is optimal^ but whether the solution yields a significant enough improvement over 
the alternatives to make it worthy of acceptance. 

Third, while providing a solution to one set of problems, the operations research 
model may create, in turn, another set of problems. For example, the analysis 
may demonstrate the need for an improved information gathering system or for 
a restructuring of operating policies. And, ensuring the continued maintenance 
of an up-to-date model does, itself, pose new managerial problems. 

When should a manager initiate an operations research 
project? It is helpful to distinguish between so-called one-shot or infrequent 
decision problems and recurring decision analyses (like devising an annual plan, 
scheduling men and equipment, and replenishing inventories). 

In special studies, a decision to apply operations research depends on the 
economic and strategic importance of the decision, the time span available for 
performing the analysis, and the relevance and availability of data. It is difficult 
and hazardous to apply operations research under “time pressure.” Consequently, 
a manager should consider employing the approach when the stakes are sizeable, 
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the decision does not have to be made next Monday morning, data are available for 
the analysis, and the choice is not so governed by political and personality con¬ 
siderations within the company that economic analysis is of only minor import. 

In planning situations, the decision to apply operations research also depends 
on the economic and strategic stakes of the problem and the available data. But 
planning applications differ from special studies most critically in the longer time 
horizon over which the model can be developed and tested. As we point out in 
later sections, controlling progress in the conduct of an operations research study 
is important; nevertheless, the corporation will not grind to a halt if a couple of 
weeks^ delay postpones the completion of an operations research planning model. 
(And there always is a couple of weeks’ delay!) 

The decision to develop a computerized model for daily operations usually is 
more involved. Numerous companies have successfully constructed such models 
for as diverse applications as inventory control, tanker-fleet routing, and job-shop 
scheduling. Often the economic benefits are small percentage wise, the systems 
design effort is staggering, and the implementation process is painful. Hence, this 
type of application is usually justified in terms of producing economic benefits 
that will extend over a relatively long term. 

Sometimes executives misjudge whether the available data are sufficiently 
accurate as to warrant using an operations research approach. Applications of 
statistical techniques to the design of industrial research experiments, to the 
monitoring of continuous production processes and machinery, and to the auditing 
of voluminous accounting transactions demonstrate that mathematical techniques 
can be effective in analyzing sparse data that are subject to variability and 
measurement errors. Inaccurate or limited data do not per se negate the applica¬ 
tion of a mathematical technique. Even if there are no historical data at all, 
managers may be able to impart their experience-based knowledge by means of 
probabilistic statements. Hence, it is inappropriate for an executive to reject using 
operations research solely on the grounds of less than perfect factual information. 

Sometimes executives shy away from operations research because they feel 
that their company personnel are not sophisticated enough to use mathematical 
techniques. This fear may be well founded, but the apprehension also may be 
based on a limited or even erroneous understanding of the degree of sophistication 
that is actually required. And all too often, senior managers underestimate the 
capability of their experienced personnel to learn how to apply operations research. 

Many successful applications have.been made by personnel who are trained 
in accounting, engineering, economics, or business, and who have been away 
from school for years. Their first-hand knowledge of. the company more than 
compensates for their initial unfamiliarity with technicalities of operations research. 
In addition, the widespread availability of easy-to-use canned computer programs 
has removed much of the burden in going from a model formulation and actual 
data to a numerical solution and sensitivity analysis. And, finally, although the 
mathematical methods employed to obtain a numerical solution may be advanced, 
the solution itself may be easy to interpret and to implement. (A good example 
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of this type of application is inventory control. The computations of a reorder 
point and replenishment quantity can sometimes be intricate; nevertheless, the 
resultant ordering policy may simply be of the form ‘'when down to 4, order 
more” and thus may be easily understood.) 

How can a manager get what he pays for? Perhaps the most 
difficult responsibility that an executive faces in controlling the progress of an 
operations research application is to strike the right balance between conducting 
the effort as a “research project” and as a “task-force assignment.” 

Estimating how profitable or beneficial an application will be in a particular 
company is central to the research aspect. For example, many companies are able 
to reduce inventory investment by at least 25% by adopting scientific inventory 
control, but the level of reduction in a specific company can only be estimated 
after the operations research project is begun and some trial tests are completed. 
Similarly, most medium-sized oil refineries are able to cut costs by $1000 a day 
when using a linear programming model to make a weekly operating schedule, 
but an estimate of savings at any particular refinery can only be made after a 
preliminary model is built and run on a trial basis. Thus, an executive should 
view the initial phases of an operations research effort as exploratory. 

It is erroneous, however, for management to view the entire project as research. 
Companies with the best record of implementing operations research plan each 
project from the very beginning as an effort to improve current procedures. The 
line managers who are iiivolved share a sense of urgency about completing the 
effort and remain vigilant in keeping the study practical and pertinent to 
the actual decision problems. 

Standard control techniques for managing include formulating a statement of 
goals, assigning task responsibilities, developing and updating a time schedule for 
completing various tasks, and planning for managerial reviews. It is the nature of 
operations research studies to encounter delays and unforeseen difficulties. Hence, 
exp^ect that the unexpected will occur. The inevitability of these contingencies 
is.ihe very reason why an operations research project needs careful managerial 
control. 

Most operations research efforts require two to three man-years of effort and 
extend over a period of three to nine months. Naturally, if the project is important 
and complicated, these figures will be exceeded. The economic benefits of a well- 
conceived and controlled application, should far outweigh the expense of devel¬ 
oping and operating the system. 

22.3 HOW TO SUCCESSFULLY, CONDUCT AN 
OPERATIONS RESEARCH PROJECT 

This section outlines the components of a successful operations research applica¬ 
tion-and expands on several of the factors already discussed above; the context 
here is the conduct of a selected project. 
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Managerial guidance and participation. Both top management 
and operating management must recognize their respective roles in the evolution 
of a project. Since an operations research application typically cuts across different 
departments, the effort must have the sincere sponsorship of top management and 
the needed entrees into line activities. Furthermore, top management must watch 
that the corporation’s best interests are held paramount and that the study is not 
diverted so as to serve the interests of individual groups at the expense of the 
company. 

Operating management must'actively participate in the project’s goal formula¬ 
tion, administration, and evaluation. It is both difficult and foolish to impose an 
operations research system on an operating management that has not been a 
party to the system’s design. Anyone with only a modicum of experience knows 
that the best of plans can be so cleverly sabotaged by a group of unwilling personnel 
that the promulgator looks like a fool. But more is at issue than just personality 
conflicts. When operating management has not been actively engaged in the 
study, there is substantial likelihood that the proposed methods of the system 
will not be sufficiently comprehensive and flexible to handle the inevitable 
exigencies. Thus, if operating management has not participated in the evaluation 
(and, as a result, has little confidence in the worth of the endeavor), trouble 
looms ahead, even with the most insistent encouragement of top management. 

Project planning and control. The need for monitoring the progress 
of a project has been underscored. Now we highlight several factors in this process 
that are critical to success. 

• The project team should realize at the outset where managerial judgment 
will be required. Specific plans should be made to obtain this counsel, and 
these provisions may in turn require some preparatory educational effort. 
People, not computers, make managerial decisions. 

• The technical phase should be executed carefully, because if it is poorly done, 
the outcome can be disastrous. The team should recognize, however, that 
the mathematical side of the study will represent probably only a minor part 
of the total effort of developing and implementing the application. 

• The data requirements should be ascertained early, and the information 
collection indicated soon enough to avoid long delays in the project. Often, 
this phase is poorly executed in an operations* research study, even when the 
project is led by an experienced practitioner. 

• Managers and operating personnel should be alerted to any transitional 
difficulties that may arise in testing and installing a new system. For example, 
when scientific inventory replenishment rules are implemented, total inventory 
investment usually rises for the first few months. (Can you explain why?) 
Top management is likely to express consternation unless properly fore¬ 
warned. 
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• The team should be careful to document the model’s components and assump¬ 
tions, and to record the input data and sources. In a large-scale effort, 
assumptions made several months earlier are easily forgotten. Furthermore, 
as test results and new data are examined, the model is inevitably altered. So 
it is essential that the team systematically catalogues each revision. 

Credibility. Just like pregnancy, there is no such thing as a little credibility. 
Either an executive believes that the operations research representation of his 
problem is valid or he dismisses the results as worthless. The following paragraphs 
discuss how to develop a model that legitimately earns the trust of managers. 

The project team should realize from the very beginning that the economic 
benefits of an operations research application never prove themselves and are 
never self-evident. And to make matters worse, a reliable “before and after” com¬ 
parison is always extremely difficult to perform. There are two reasons why. 

First, sufficient data about past operations may not be available, at least not 
in a form convenient for tabulation and analysis with acceptable accuracy. 
Hence, in its enthusiasm to design and implement a new approach the operations 
research team should not slight the job of installing a data-gathering system to 
reflect the true economic impact of a change. And when past data are insufficient, 
the team should start collecting current data long before it institutes new proce¬ 
dures. The team also should recognize the need to design a controlled experi¬ 
ment that focuses the effects to be evaluated. Unless the team heeds these cautions, 
it will, itself, be unable to prove factually that an improvement has occurred. 

Second, only in exceptional circumstances can a team make a completely 
parallel comparison between two systems operating under different sets of pro¬ 
cedures. There is no guarantee that an approach that looks attractive in terms of 
last year’s operations will be just as attractive during this year’s activities (or vice 
versa). Further, because managerial decisions at one point in time may have a 
specific cfiTect on business conditions later, it may be futile to attempt to show with 
great precision how anything but an actually operating system behaves over an 
extensive period of time. 

Thus, it is hard to prove precisely how well an operations research approach 
would have performed historically, or how much better an implemented opera¬ 
tions research approach is faring as compared to what the previous system would 
have done. Management and the professionals must realize at the outset that they 
are limited in providing irrefutable evidence that improvements actually result 
from an operations research approach. But it is important to remember that the 
same limitations exist in measuring the impact of any competing problem solution. 

TJie above observations mean that by and large credibility ought to be estab¬ 
lished during the course of the project and not relegated to the end. Most executives 
express the following doubts about an operations research model: “How do I 
know that it uses the right data? . . . makes realistic assumptions? . . . computes 
the economic consequences correctly? . . . and encompasses the enormous number 
of relevant detailed considerations?” If you pause to think, it does stagger the 
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imagination that the essence of a complex decision-making problem can be 
transferred to the “brain” of an inanimate electronic device. The following analogy 
may help to explain the psychology of establishing credibility and suggests some 
ways of allaying those doubts expressed by managers. 

Suppose you are handed a telephone book for the first time and told that the 
volume contains the correct telephone numbers of everyone in the city. In a mo¬ 
ment, you surely would realize that the claim is an overstatement. After all, 
telephones are installed and removed every day, so the telephone book is only an 
approximate representation of all the telephone numbers in the city. (In this 
sense, the volume of listings is a “model.”) What really concerns you is whether 
the approximation is worth using. How would you find out? 

Probably you would start by looking up a telephone number you already know 
(perhaps, your own). If you find that the listed number is correct, you then might 
select a person whose number you do not know, look up the number, and place 
a call to see if the book in fact gives the right listing. After several more tries of 
this kind, assuming you are successful each time, you would be willing to start 
using the book. And most likely you would continue to use it until you observed 
an increased frequency of wrong numbers. Then you would complain to the 
telephone company, or go back to relying upon the Information Operator. 

Now consider the telephone company’s objectives. It wants to provide a model 
or system that gives you the right numbers. There are many possible systems 
(or models, if you like) for providing this service. The telephone company has 
discovered that the most economical solution is to publish one book containing 
every listed number and to distribute the volume to you and all other subscribers. 
The company knows full well that you will use only a miniscule fraction of all 
the numbers; even so, you will judge the system’s merit on the validity of this small 
fraction. 

The preceding analogy is relevant to the design of an operations research system 
in several ways. An executive first tests the validity of an operations research model 
by asking questions about data and conclusions; he knows the right answers to 
some of the questions and has some intuition about others. His confidence builds 
if the forthcoming answers are straightforward, comprehensible, and correct. 
He will start to rely on the model until his confidence is shaken by some “obvious” 
mistakes. 

The operations research project team should try to anticipate what questions 
managers may ask and what data yield answers. This task is helped by discussing 
the detailed designs of the data reports and numerical summaries with the execu¬ 
tives involved. The computer analysis should include not only summary reports 
similar to standard management information reports but also detailed backup 
analyses that clearly show the “how” and “why” of the summary figures. Much 
of the output may be examined infrequently; but it is there “just in case.” 

The telephone book analogy should not be pushed too far, because it is im¬ 
practical and impossible to provide every number that an executive might possibly 
request. But novice operations researchers invariably make the mistake of provid- 
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ing far too little backup information, documentation, and analysis. As a con¬ 
sequence, they are frequently put in a position that is embarassing to them and 
annoying to a manager, namely, having to go “back to the drawing board” to 
obtain the information that executives want in order to understand the model’s 
results. 

The above discussion stresses the output requirements of a well-conducted 
operations research analysis. Of course, the team also must employ other means 
of effective communication. These are familiar to professional task-force leaders 
and amount to maintaining an open dialogue between the managers, who 
ultimately have to judge the merit of the results, and the team members. To 
repeat, managerial guidance and participation is a sine qua 7ion for establishing 
credibility. 

Responsive and responsible implementation. Truly effecting 
change within a corporate organization, whether the change be installing a new 
computer system or reassigning managerial responsibilities, is usually a difficult 
job. Aside from any special aversion that personnel may have to computer-based 
systems analysis, there are few, if any, problems of implementing change that 
are peculiar to an operations research project. As is true for effecting most signifi¬ 
cant changes within a company, the support of top management is vital, adequate 
educational training of operating personnel is necessary, a carefully worked out 
plan for introducing the changes is essential, and the implementation process must 
be controlled and monitored to sense and then correct difficulties that may arise. 
Unfortunately, there is no substitute for experience in knowing how to imple¬ 
ment change skillfully. 

One problem does deserve additional discussion. The difhcully is reminiscent of 
that encountered in factory mechanization many years ago. Certain operations 
research applications, especially those involving daily operations, may drastically 
change the character of the decision-maker’s job. For example, developing a 
computer scheduling model for the processing of orders in a factory, or the routing 
of ships between ports, or the purchasing of materials from vendors, may transform 
a job requiring long familiarity with the decision problem into one of routinely 
supplying raw data to a computer. An operations research approach may remove 
the fun, challenge, exercise of judgment, sense of contribution, and mystique in 
a job. Rarely is top management willing to forego the resultant economic benefits 
for these reasons. But the project team must face up to the likely reaction of 
individuals who will be so affected. The team should recognize that the imple¬ 
mentation process will arouse hostility; accordingly, they should provide post- 
implementation procedures to control a situation that might easily deteriorate 
because of a hostile environment. 

Systems design. If the application is to be used again after the initial 
testing and analysis, then the ultimate success of the project depends upon the 


• 22.4 . 


Managing an O.R. Staff 935 


model’s long-term viability. In the early years of commercial applications of 
operations research, many companies achieved noteworthy success for a while; 
later they discovered that their efforts had dissipated with the changing of busi¬ 
ness conditions and the promotion or resignation of operations research personnel. 
Now experienced firms realize the necessity for building systems support to main¬ 
tain and update a continuing operations research application. 

This point would not merit special mention except for a commonly observed 
phenomenon that most executives still find paradoxical. The typical operations 
researcher, although having expertise in model building and analyzing complex 
problems, is usually ill equipped and frequently disinterested in the above- 
mentioned systems requirements. Consequently, experienced companies include 
systems-oriented personnel in an operations research project team to devise 
procedures for maintaining the model in good working order. 

22.4 HOW TO MANAGE AN OPERATIONS 
RESEARCH STAFF 

In keeping with the tenor of the chapter, this section highlights only a few issues 
that pertain directly to the profit-making impact of a corporate operations research 
activity. 

Location and size. The proper placement of an operations research 
group within a large corporation is no longer a subject of much debate among 
professionals. No standard pattern has evolved, even within an industry, and 
these technical staffs have successfully operated under the guidance of controllers, 
chief planners, vice-presidents of manufacturing, as well as chiefs of research and 
development departments. Today, pragmatic considerations dominate the loca¬ 
tion decision. And divisionalized companies operating under a policy of decen¬ 
tralized management frequently have operations research activities at both the 
corporate and division levels. 

The size of an operations research staff is an unreliable indicator of the group’s 
productivity; a small staff of six talented professionals may have a much greater 
profit impact on a corporation than a group of 20 that contains only two or 
three top-notch scientists. In operations research, quantity is a very poor substi¬ 
tute for quality. 

Corporate responsibilities. Top management expects the operations 
research staff to exercise a high degree of intellectual integrity. This means not only 
that the group must meet demanding professional standards, but also that the 
staff must seek truthful conclusions and refrain from organizational partisanship. 

The operations research group manager must be careful not to overcommit his 
staff. In an effort to please, many groups undertake more projects than they can 
accomplish in a reasonable period of time. As a result, all the users become 
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dissatisfied. An operations research group should have a systematic way to decide 
what projects to accept and how to allocate its own scarce professional resources 
to best serve the needs of the entire company. 

Cooperation with users. The preceding sections emphasized the 
importance of working with line managers in the conduct of operations research 
projects. Here this subject is treated from the technical staff’s point of view. 

The group should always keep in sight the way an operations research model 
typically assists managers. In most applications, the model-building effort provides 
insights into the quantitative implications of specified data and assumptions. 
Ultimately, it is the managers who make the decisions and arc held responsible 
for the outcomes. Hence these executives must assess the relative likelihood of 
various assumptions and weigh the risks associated with different courses of action. 
An operations researcher should avoid the trap of believing that his model is true 
reality. 

When a corporate operations research staff is first establishing its reputation, 
the group will work under less than ideal cooperative arrangements with its 
users. The requesting organization may be pleased to sec the project completed 
successfully, and even may pay for the project. Nevertheless, the user organiza¬ 
tion may not readily provide other necessary kinds of help, which often include 
the collection of data and careful managerial review of intermediate results; 
consequently, the effort may get bogged down waiting for essential assistance in 
the line organization. But when the operations research staff has progressed to 
where it can pick and choose from among several worthwhile projects, then a 
major selection criterion should be the willingness of the user organization to 
commit its personnel time to the project team. A good index of user interest and 
involvement is the extent to which it will allocate the time of its people to assist 
in the application. 

22.5 LIKE IT WILL BE 

Operations research has advanced so rapidly that speculating about even the 
near future is risky. But several developments are now underway that certainly 
will have a major impact over the years immediately ahead. We somewhat 
arbitrarily separate these into technical and technological advances. 

Technical progress. Two avenues of research are particularly important 
for new applications of operations research to executive decision problems. The 
first is the development of efficient techniques for analyzing very large-scale 
problems, such as linear programming models containing thousands of constraints. 
The second is the development of practical methods for solving realistic com¬ 
binatorial problems (as discussed in Chap. 13). 

Contributions will continue to be made, of course, in the theory of operations 
research, including the development of improved nonlinear programming 


• 22.5 • 


Like It Will Be 937 


algorithms, multi-item inventory replenishment models, and statistical procedures 
for simulation experiments. Business applications including decision models in 
areas such as marketing, advertising, pricing, purchasing, personnel development, 
and finance also will receive more attention from management scientists. In 
addition, tremendous progress will be made in the application of operations 
research in other than profit-making enterprises. 

Technological progress. Although the technical breakthroughs often 
occur independently of advances in computer hardware, the applications of the 
techniques to actual problems do depend significantly on the state of computer 
technology. (Linear programming models, for example, did not have widespread 
acceptance until comprehensive and easy-to-use computer programs were gener¬ 
ally available. The same is true for simulation models.) The development of 
time-shared computing systems seems to be the most significant technological 
advance with regard to future operations research applications. 

Time-sharing has already reduced the amount of effort required to build and 
test a model; and this mode of computing has markedly cut the turnaround time 
between successive trial runs. But the more important impact is the stimulus to 
building new types of models and attacking different types of managerial prob¬ 
lems. Corporations in the avant-garde are constructing very detailed financial 
planning models and are continually running economic analyses of new action 
alternatives as they arise in managerial deliberations. Time-sharing now allovyrs 
the decision-maker to perform sensitivity analysis on-line. Being able to execute 
hundreds of thousands of calculations in a matter of seconds and getting the 
results immediately is an impressive and important step forward in the science of 
executive decision making. 

An inevitable future impact of time-sharing will be the development of operat¬ 
ing systems for decision problems that relate to the scheduling and daily allocating 
of scarce resources, and to the supervision of production processes. The rate of 
progress and acceptance in these areas depends on the growth in the general 
availability of time-shared computing equipment and software systems that are 
appropriate to the needs of commercial installations. 

Administrative science. The time has past when an operations re¬ 
searcher can build a mathematical model and remain impervious to the behav¬ 
ioral characteristics of the individuals affected and the organizational milieu. 
Visionaries among operations research professionals are fully aware that new 
developments such as those described above exert tremendous strains on the 
managerial fabric of a corporate organization. To enhance the adoption of 
these technical and technological advances by industry and government, manage¬ 
ment and behavioral scientists together will have to find ways by which executives 
can deal effectively with computerized systems as beneficial change agents. 
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Advanced Topics in 
Network Algorithms 


I.l MAXIMUM FLOW THROUGH 
A CAPACITATED NETWORK 

In the development of sophisticated techniques to solve difficult network models, 
analysis of the following problem is of central importance: given a network with 
arc capacities, where Node 0 is the source of all flow and Node p + 1 is the sink, 
what is the maximum amount of flow that can be routed from source to sink? 
Formally, the model is described as 

( 1 ) maximize F 
subject to 

( 2 ) y] Xoj = F for A: = 0 

C5) in 
network 

(3) 2 ^fc) S • ® k = 1,2, 

(k7)) in (i,k) In 

network network 

(4) — = ior* =/+ 1 

in network 

( 5 ) 0 < < Uij for all (t,j) in network, . 

where are nonnegative integers. 

A simple method can be used to solve the problem. To make the basic idea of 
the algorithm transparent, assume 

( 6 ) Uij ^ I for all ( 1 , 7 ) in network. 

Once you see how to find a solution with ( 6 ), you will have no difficulty in 
understanding the minor modification required to solve the general case. 
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I.l MAXIMUM FLOW THROUGH 
A CAPACITATED NETWORK 

In the development of sophisticated techniques to solve difficult network models, 
analysis of the following problem is of central importance: given a network with 
arc capacities, where Node 0 is the source of all flow and Node p + 1 is the sink, 
what is the maximum amount of flow that can be routed from source to sink? 
Formally, the model is described as 

(1) maximized 


subject to 

( 2 ) 


(0, J) in 
network 


2 Xqj =F for k = 0 


( 3 ) 



Xjcj ^ ^ — Ij 2, 


(i,k) In 
network 


( 4 ) 


— S = —P ‘for A: = :^ + 1 

, 0 . P+l), 


V*i 

In network 


(5) 


0 ^ < Uij for all (i, j) in network. 


where are nonnegative integers. 


A simple method can be used to solve the problem. To make the basic idea of 
the algorithm transparent, assume 


( 6 ) 


Uij = 1 for all {iyj) in network. 


Once you see how to find a solution with (6), you will have no difficulty in 
understanding the minor modification required to solve the general case. 
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Begin with any feasible flow. The steps in the technique either determine that 
the flow is maximal, or discover another solution with increased flow: 

Step 1, Starting at Node 0, put a (+). on each arc (0,j) without flow and 
label Node j with a checH mark (^). Put the mark (0() on Node 0. 

Step 2, Consider any Node j that is labeled (^X)- Put a (+) on every 
flowless outward arc {j\k) if Node k is not labeled, and label Node k with (^). 
Then put a (—) on every inward arc {kyj) with flow i/'Node k is not labeled, 
and label Node k with (^). Finally, cross the check (>X) on Node j to indicate 
that the node also has been spanned. 

Step 3. Continue with the operation in Step 2 until Node p + 1 is labeled or 
all labeled nodes have been spanned. A breakthrough occurs as soon 2 is Node 
p + 1 is labeled, because a flow-augmenting path has been discovered 
from Node 0 to Node p + 1. The path can be found by tracing back from Node 
p + 1 the arcs that have been marked with a (+) or a (—). Add a unit of flow on 
each arc with a. (+) and remove the flow from each arc witha (—). Return to 
Step 1, If, however. Node p + 1 remains unlabeled at the termination of Step 2, 
then the current solution is maximal. 

The method is carried out for the example in Fig. I.l. The initial routing has 
two units of flow. 



There are several ways to proceed. Follow the one below by making light 
pencil marks on Fig. 1.1. 

(i) Scan Node 0: put + on arcs (0, 4) and (0, 5) and label Nodes 4 and 5 
with (v/). Mark Node 0 as (>/). 
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(ii) Scan Node 5; put — on arc (3, 5) and label Node 3 with (>/). Mark 
Node 5 with (v/). 

(iii) Scan Node 3: put — on arc (1, 3) and label Node 1 with (>/). Mark 
Node 3 with (y/). 

(iv) Scan Node 1: put + on arc (1, 2) and label Node 2 with (>/). Mark 
Node 1 with (y/). 

(v) Scan Node 2: put + on arc (2, 6) and label Nodfe 6 with (y/)- A flow- 
augmenting path has been found. 



Flow = 2 
0ns Unil: 

One Unit: (o)-*"(D-Ki)"KD 

FiGxmE 1.2. Scanning 
Process. 

Check your work with the result in Fig. 1.2. The solution therefore improves if 
you 

(i) Add a unit of flow on the arcs (2, 6) and (1, 2). 

(ii) Remove a unit of flow from arcs (1, 3) and (3, 5). 

(iii) Add a unit of flow on arc (0, 5). 

The revised solution is given in Fig. 1.3 on the next page. 

Verify that the flow is now maximal by repeating the steps of-the algorithm. 
The sequence of nodes scanned will be Nodes 0, 4, and 5. It will not be possible 
to label any other node. 

To remove the arc capacity restriction (6), the algorithm is modified in two 
respects. Put a (+) on every outward arc with less than capacity flow in the 
scanning process. Then, when a flow-augmenting path has been found, route 
as much flow as possible on the path, taking into account the amount of unused 
capacity on each (+) arc and the current level of flow on each (—) arc. Thus 
for the maximum .flow problem (1) through (5) the algorithm now terminates 
according to the following stopping rule. 
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Flow = 3 
One Unit: (§)— 

One Unit: (0)-^"(3)-»“{6) 

One Unit: (0)-->“(5)->-{6) 

Figure 1.3. Maximum Flow Solution. 

Stopping Rule. Every arc from a spanned node to an unlabclcd node is at 
its full capacity and evei'y arc from an unlabeled node to a spanned node is 
at zero flow. 

At each iteration, Step 3 results either in one or more units of increased flow or 
in termination; therefore the algorithm is finite, since the maximum possible 
flow is bounded. It remains only to show that when the algorithm terminates, the 
solution in fact is optimal. To do this, partition all the nodes into two classes, 
say, Co and Cp+,. Put Node 0 in Co and Node p + i hi Cp+i. Such a separation is 
called a cut, and we define the cut capacity as the sum of all the Uij such 
that Node i is contained in Co and Node j in Cp+i. 

The cut capacity for any partition provides a limit on the rnaximal value of 
flow possible. (If the cut capacity equals 0, then Node 0 literally is cut off from 
Node p + 1 and no flow is possible between source and sink.) Consequently, if 
a feasible flow equals any cut capacity, the flow must be optimal. Furthermore, 
because of the conservation of flow restrictions (2), (3), and (4), the value of F 
in any feasible routing must equal the sum of the flows along all arcs (ij) minus 
the sum of the flows along all arcs (j, z), where Node i is contained in Co and. 
NodejinCp+i- 

With the above observations, the optimality proof is at hand. Consider the 
feasible solution when the algorithm terminates. Define Co to be all the spanned 
nodes, and Cp+i the remaining nodes. According to the Stopping Rule, there is 
no flow from any node in Cp+i to any node in Co, so the total flow equals the sum 
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of the flows on all arcs from nodes in Co to nodes in Cp+i. All of these arcs contain 
flows at capacity levels; therefore the total flow in the solution equals the cut 
capacity, and no further improvement is possible. 

The preceding argument is summarized by the fundamental result below. 

Max Flow/Min Cut Theorem: The maximum flow Fin the network structure 
( 2 ) through ( 5 ) is equal to the minimal cut capacity relative to the source 
and sink. 

A corollary result is that the algorithm results in integral values for all Xij, 

1.2 SOLUTION OF THE ASSIGNMENT MODEL 

Recall the assignment model, discussed in detail in Sec. 6.4: 


n n 


(l) 

minimize ^ S 

t=l;=l 

subject to 


( 2 ) 

71 

2 >;« = 1 for 1 = 1 , 2 ,...,« 

(3) 

f] = 1 . forj = 1,2, ...,n 

i = l 

(4) 

Xij = 0 or 1 for all i and 7 . 


By the nature of the problem, a feasible solution contains only n variables equal 
to 1 , whereas a Sasic solution includes « + n — 1 variables. Consequently, when 
the simplex algorithm for networks is applied to the assignment model ( 1 ) through 
(4), each basis contains « — 1 routes at zero level. This observation suggests that 
the special structure of the assignment model is not fully exploited by the simplex 
algorithm. In this section, three other approaches will be explained. 

The first makes use of the maximum flow problem of the preceding section. The 
second combines the principles of the maximum flow and shortest-route algorithms. 
The third demonstrates a further connection between the assignment problem 
and the shortest-route problem. The methods will be illustrated by an example, 
which is first solved by the simplex algorithm to provide a basis for comparison. 
As you will learn in Sec. 1.3, the three approaches are of additional significance 
because they generalize to other network optimization models. 

Simplex algorithm. Consider the assignment problem shown in Fig. 1.4. 
If you apply the procedure explained in the advanced material of Sec. 7.4- for 
calculating relative costs to assist in picking an initial basic solution, you obtain 
the amounts in Fig. 1.5. A .starting basis is exhibited in Fig. 1.4. Notice that 
three routes (in the second row) are at zero level, indicating degeneracy. You 
can follow the iterations by examining Figs. 1.6 through 1.12, which contain the 
succession of improvement potentials and trial solutions. Observe, that the value 
of the objective function does not change until' the final solution. 
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Figure 1.6. Assignment Model Improvement 
Potentiab for Initial Solution. 



Figure 1.8. Assignment Model Improvement 
Potentials for Second Solution. 
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Figure 1.7. Assignment Model Second Basic 
Solution. 
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Figure 1.10. Assignment Model Improvement 
Potentials for Third Solution. 
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Total Cost = 54-2(1) = 52 

Figure I.ll. Assignment Model Optimal 
Basic Solution. 



Maximum flow approach. An alternative method is based on the maxi¬ 
mum flow algorithm in the preceding section. It is also motivated by a consideration 
close in spirit to the suggestion in the advanced material of Sec. 7.4 (p. 226) for 
finding a good initial solution. The technique, in bare outline, consists of two 
steps: 


Step L Given row constants for i = 1, 2,..., w, and column constants Wj, 
for j = 1, 2,. . ., «, yielding nonnegative relative costs — ^t— w^) > 0, 
determine whether a feasible solution exists using only routes with relative costs 
equal to 0. If so, stop, since the solution is optimal; otherwise go to Step 2, 

Step 2. Revise and Wj such that at least one new route has relative cost equal 
to 0. Return to Step L 


The details of each step are explained with reference to the previous example. 
You can always begin Step 1 with the constants used by the method for selecting 
an initial solution suggested in Sec. 7.4. (In the illustration, the relative costs 
are the entries in Fig. 1.5.) However, in order to compare this approach with 
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the simplex algorithm, it is convenient to start with the values indicated in Fig. 
1.13. 

The maximum flow algorithm is employed in Step 1 to find whether there exists 
a feasible solution using only routes with 0 entries in Fig. 1.13. Tabular short-cuts 
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Figure 1.13. Assignment Model Relative 
Costs: Cij — ot — 


Figure 1.14. Maximum Flow Algorithm 
Revised Relative Costs. 


are available to carry out the procedure; but since the main purpose of this 
discussion is to make clear how the maximum flow model is of value, the exposi¬ 
tion will not streamline the format to aid the calculations. 

A network flow diagram comprised only of the routes with relative cost equal 
to 0 in Fig. 1.13 is constructed in Fig. 1.15. The node designation r< corresponds 



to Row i in Fig. 1.13, and similarly to Column j. All arcs out of the source 
node and into the sink node have capacity 1, corresponding to the row and 
r ol iit nn constraints of the problem. 
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If the maximum flow for the network were four, the corresponding routing would 
be an optimal assignment, since a unit of flow on arc (r^, would imply = 1 . 
A trial solution with three units of flow is exhibited in Fig. 1.15. The maximum 
flow algorithm is carried out in Fig. 1.16 to demonstrate that flow cannot be 
augmented in this network. Therefore you must go to Step 2. 



Figure 1.16. Scanning for Augmented Flow. 


The following rationale makes plausible a way to revise the Vi and Wj, Since 
the flow in Fig. 1.16 is maximal but less than four units, it is necessary to intro¬ 
duce at least one new arc (n, kj). Given the nature of the flow algorithm, it is 
reasonable to restrict attention to those arcs such that Node ri is labeled but 
Node kj is not. If such an arc is added, then the steps of the flow algorithm will 
continue and permit at least one more Node kj to be spanned. 

In altering the Vi and however, you should be careful not to destroy the 
equality Cij — Vi — Wj = 0 for routes now having flow, as well as for those marked 
with (+) in the spanning process. Otherwise, you may not be able to continue 
the scanning where you left off in Step 1 (Fig. 1.16). Also, the relation Cij — i/t 
— > 0 must be preserved for all i and j, A rule that achieves all these condi¬ 

tions is 

(5) (i) Add c to Vi if Node ri is labeled, 

(ii) Subtract c from if Node kj is labeled, 

where 

( 6 ) Q = smallest relative cost for arcs between every 

labeled Node ri and tmlabeled Node kj. 

In Fig. 1.16, Nodes and r 4 are labeled, whereas Nodes kj, k 2 , and ks are not; 
therefore in Fig. 1.13 examine the entries at the intersection of Rows 3 and 4 and 
Columns 1, 2, and 3 to obtain 

(7) c = minimum (3, 4,4, 2, 7, 7) = 2 = ^ 41 * 
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The revised Vi and are shown in Fig. 1.14 along with the new relative costs. 
Observe that arc (>-4, ki) now has relative cost equal to 0, but arc (ri, has a 
positive relative cost. The associated network appears in Fig. 1.17. The maximum 



flow algorithm continues from Fig. 1.16 and, in the process, labels Nodes 
kj, ri, kg, rg, kg, and finally the sink. The (+) and (—) signs on the arcs indicate 
the flow-augmenting path that produces the optimal solution in Fig. 1.18. 



We now recapitulate the algorithm. Starting with trial values for i;* and in 
Step /, apply the maximum flow algorithm to the associated network. If the 
resultant total flow equals n, the solution is optimal and you stop. Otherwise, 
proceed to Step 2, where you revise the Vi and Wj according to the rule in (5) and 
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(6). Return to Step 1 with the resultant, somewhat altered, network: at least one 
new arc will have been added (where it is assumed the corresponding c^j equals 
the value c), and some unused arcs may have been dropped. Restart the maximum 
flow algorithm with the previous flow pattern. Again check for a total flow of 
and so forth. 

Proof that the solution at termination is optimal is exactly the same as that 
given for the simplex method applied to the transportation model. Convergence 
in a finite number of iterations is established by noting that 

(i) Each time Step 2 occurs, at least one more node is spanned in the subsequent 
Step L Since the number of nodes is finite. Step 2 eventually results in a 
breakthrough at the succeeding Step 1, 

(ii) Only a finite number of breakthroughs can occur, because each results in 
increased flow, and total possible flow is bounded. 

An implication of the convergence prooT is that no more than .5(w- + 3« — 2) 
applications of Step 2 are rcquirect. This bound is typically far in excess of what 
actually occurs; however, you should note that the bound is considerably smaller 
than which is the simplex algorithm bound calculated in terms of the 

number of basic solutions possible. 

The network flow approach has a certain resemblance to the dual simplex 
method in that the dual constraints arc satisfied at every iteration, but a feasible 
solution is not obtained until termination. In sharp contrast to both the standard 
and dual simplex methods, the network approach does not maintain a basic 
solution. 

Minimal cost/maximum flow approach. To initiate Step 1 of the 
previous approach, you had to find constants Vi and such that all the relative 
costs {Cij — Vi— Wj) were nonnegative. You can assume without loss of generality 
that all Cij > 0, since you can always add a positive constant c* to every cost 
element. Then you can begin Step 1 in the previous approach by letting Vi = 

= 0. If you do so, the sequence of solutions turns out to be a minimal cost routing 
among all routings with the corresponding amount of flow. This point of view 
leads to another statement of the algorithm that provides a helpful insight in 
generalizing the method to more complex network problems. The idea is to 
increase total flow in such a way that the routing for each higher level of flow 
incurs minimal total cost. Thus, if F units of flow have been so routed, the method 
seeks a flow-augmenting path with least cost, and increases flow on this path. 
The technique for finding the path employs the shortest-route algorithm. 

The approach is summarized below: 

Step L Construct a new network based on the current solution as follows. 
Include each arc in the original network that currently is flowless, and let Cij be 
the arc’s path length. If flow occurs between Nodes ri and kj, add arc (Ar^, r^) and 
let its path length be —It is possible to augment flow in the original network 
on any path between source and sink of the new network. 
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Step 2, Find a shortest path from the source to the sink in the new network. 
In the usual manner, increase flow on this path in the original network. If all 
the assignment model’s constraints are satisfied, stop; otherwise, return to Step L 




Figure 1.20. Shortest Route. 
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The approach is illustrated for the example in Fig. 1.4. Assume you have 
applied the algorithm to obtain the minimal cost routing for three units of flow. 
The solution is the same as that in Fig. 1.15, and has a cost of 28 (= 2 + 18 + 8). 

The new network constructed according to Step 1 appears in Fig. 1.19. Notice 
the source is connected only to Node and the sink to Node ks, since the other 
such arcs are at capacity flow. The network contains arcs {kiy fi), (Aj, fj), and 
{ki, r^) because the current solution has flow in the opposite direction. The remain¬ 
ing arcs have zero flow in the current solution. 

The shortest-route algorithm of Sec. 7.6 is applied to the network and yields 
the result in Fig. 1.20. Sending a unit of flow along the indicated path produces 
the optimal solution in agreement with Fig. 1.18. Notice that a unit routed on 
arc {ki, r 4 ) implies that flow is to be removed from arc (r 4 , ki) in the original 
network. Since the shortest path has length 24, the final solution has total cost 
of 52 (= 28 + 24). The length of the shortest routes from each Node Ti is an 
optimal value for — and similarly, the length from each Node kj is an optimal 
value for Wj, 

Shortest-route approach. In Sec. 6.5, you discovered how to convert 
a shortest-route problem into the format of an assignment model. In this chapter, 
you learned an efficient method for solving the shortest-route problem without 
making such a conversion. Now you will see how to solve the assignment model by 
putting it into the form of a shortest-route problem and then applying the shortest- 
route algorithm in Sec. 7.6. 

The general idea is to solve a 1 X 1 assignment model, which is a trivial task. 
Then use the answer to solve a 2 X 2 model, and continue in the same fashion, 
adding one more row and column at each iteration until the n X n solution is 
obtained. Given the solution for a ^ X ^ problem, any remaining row and any 
remaining column can be chosen to form the (i^+l)x(^+l) problem. 

As before, the technique is explained in terms of the example in Fig. 1.4. 
Suppose you have solved the 3x3 problem consisting of Rows 1, 2, and 4 and 
Columns 1, 2, and 4 of Fig. 1.4. For convenience, the array is displayed in Fig. 
1.21, and the circled Cij correspond to the optimal solution for that table. Notice 
that the solution is the same as that in Fig. 1.15. 
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(Col. 4) 

(Row 1) 
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Figure 1.21. 3x3 Assignment Problem. 
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Figure 1.22. 4x4 Assignment Problem. 
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To form the 4x4 problem, add the remaining third row and column of 
Fig. 1.4, which gives Fig. 1,22. Then calculate relative costs {c^ — Vi), where 
Vi = 0 for the new row and the other y, are the values of Cu for the optimal routes 
in the 3x3 problem. The result appears in Fig. 1.23. Notice the table has the 


Node^ 

0 

Mode 

1 

2 

3 

4 


15 

24 

10 

26 

0 

1 

0 

8 

-2 

13 

2 

2 

-8 

0 

-9 

2 

18 

3 

4 

17 

0 

19 

3 Figure 1.23. Relative Costs 






Cij - Vi . 


same appearance as that for a shortest-route problem, such as Fig. 6.9 on p. 179. 
The network is drawn in Fig. 1.24, using the node designations appearing in Fig. 


1.23. 



Figure 1.24. Shortest Route Network for 4x4 Assignment Problem. 

Apply the shortest-route algorithm to find a best path from Node 0 to Node 4 
in Fig. 1.24. The solution, exhibited in Fig, 1,25, corresponds to the optimal 
solution to the original assignment problem. 



The only subtlety in demonstrating the validity of the approach is to establish 
that when the shortest-route network is drawn, the sum of the path lengths around 
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every loop is nonnegative. Recall that such a condition is required for the shortest- 
route algorithm. Observe in Fig, 1.24 that the only loops possible involve Nodes 
1, 2, and 3, which correspond to the rows and columns considered in the 3x3 
problem. As usual, the optimal solution to the 3 X 3 problem must remain optimal 
in terms of the relative costs in Fig. 1.23. A loop among the Nodes 1, 2, and 3 is 
tantamount to a particular solution to the 3x3 problem; therefore no loop can 
have a negative cost, since the optimal 3x3 solution has zero cost in Fig. 1,23, 

1.3 ALGORITHMS FOR OTHER NETWORK MODELS 

The techniques in the preceeding section generalize readily to other network 
optimization problems. 

The only alteration needed to adapt the maximum flow approach to handle the 
standard transportation problem in Sec. 6.2 is to let the arc capacity from the 
source to Node ri be and similarly the arc capacity from Node kj to the sink 
be Z)^. For this problem, the number of times Step 2 is executed cannot exceed 

(1) 2 ‘S', + (2 — l)-[minimum (m, n) + 1]. 

Analogously, the only essential change required in the minimal costlmaximum flow 
approach to cover the general network optimization model (1) through (4) in Sec. 
6.8 (pp. 192-193) is to revise Step 1 so as to include each arc in the original 
network that at present has flow below capacity. (The statement concerning 
Nodes r, and kj applies to any Nodes i and j in the network.) As the advanced 
material at the end of Sec. 6.8 indicated, it may be necessary to transform the 
model initially so that there is only a single source and sink. 

The shortest-route approach also can be extended to treat the standard transporta¬ 
tion problem, which is npt surprising since such a problem is conceptually equiv¬ 
alent to an assignment model. 
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APPENDIX 


Probabilistic 
Inventory Models with 
Periodic Review 


II. 1 BACKGROUND SUMMARY 

This chapter deals with dynamic single-item inventory models in which a 
reorder decision is allowed only at fixed intervals of time, such as every 10 days, 
or on each Monday. (Recall that the dynamic probabilistic model in Sec. 19.6 
permitted a replenishment action to occur at any instant.) There are two main 
reasons for studying periodic (or discrete) review models: they are much more 
amenable to rigorous analysis than are continuous review models; and the 
theoretical results, which often do not require stationarity assumptions, lead to 
computable formulas. Both advantages will be illustrated in the sections to follow. 
In an actual application, if stationarity is assumed and the horizon is unbounded, 
then the selection of a continuous or a discrete review model rests primarily on 
which better represents reality. (As you will find in Sec. II.6, the corresponding 
numerical calculations to find an approximately optimal policy for a periodic 
review model are as simple as those in Sec. 19.6.) 

You have already seen several examples of periodic review inventory models. 
Section 8.3 was devoted to a situation with deterministic demand and periodic 
reviews occurring over a finite horizon; the analysis was extended to the case of 
an unbounded horizon in Sec. 12.5, and to probabilistic demand in Secs. 17.4, 
18.3, and 18.4. Chapter 9 demonstrated that if you arc willing to make special 
cost assumptions, then you can derive the form of an optimal policy for determin¬ 
istic demand, and thereby simplify the numerical optimization process. Now 
you will consider the analogous simplifications for the case of probabilistic 
demand. 
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IL2 TIMING AND UNFILLED DEMAND 

Let the planning horizon comprise a finite number of periods. Assume that the 
integer X (lambda) represents the delivery lag, so that an order placed in 
Period t arrives in Period t + X. Further suppose the sequence of events within 
any Period t is 

(i) ordering decision 

(ii) delivery of any order due in from Period t — X, 

(iii) customer demand, 

and let 

pt{q) = probability that demand is q in Period t (where ^ > 0). 

Assume that demand in Period t is integer valued and independent of all previous 
demands. 

It is meaningful to permit the value X = 0, which is the case of immediate 
delivery. Observe the following difference between the continuous and discrete 
review models. When the value of the lag L = 0 in the continuous review model 
of Sec. 19.6, the analysis simplifies to the economic lot-size calculation in Sec. 19.5, 
since there is no need to protect against uncertain demand occurring over an 
interval of time. But in the periodic review model, even if X = 0, one period’s 
random demand occurs prior to the next opportunity to reorder. In general, a 
value for X in the periodic review model corresponds to a value (L = X + 1) 
in the continuous review model. 

Let 

= level of inventory on hand at the start of Period t 
= level of inventory on hand plus on order at the start of Period t 
prior to the ordering decision 
Xt = amount ordered in Period t 

kt = it + Xt^x “ level of inventory on hand after any delivery 
but before demand 

yt =ji + Xt = level of inventory on hand plrs on order in Period t 
after the ordering decision but before demand. 

Then ^ 

.jv . ^ Ht for X = 0. 

jit + Xt^x + • • • + xt.i for X > 0. 

Figure II. 1 displays the timing relationships among these quantities. 

If X = 0, ihmjt = ih and hence kt =yt is the total inventory available to meet 
demand in Period t. Consequently, the decision Xt has an immediate effect on the 
value of kt. 

If X > 0, the situation is different, for then Xt will not affect the inventory 
position until Period t + X, when the order arrives. By definition, kt+x = ii+x + Xt 
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Period / 


Figure II. 1. Time Se¬ 
quence and Inventory 
Quantities. 


Inventory on hand 


order: delivery: 




demand: 


T 

h 


yt 

Inventory on hand 
plus on order 


but the value of it+x depends in part on the next X demands qi, qt^u • • • j 
Let us see how. 


Backlog case. If all unfilled demand is backlogged and eventually satisfied 
by future deliveries, then 

( 2 ) ki^x = {u + Xt^x + * • * + *^<-1 + + • • • + 9t+x-i) 

= + '" + 9t+x-i) (complete backlog: X > 0), 

Explain why, using the definitions of i(+x,jti and When kt+x is negative, there 
is a corresponding backlog of demand at Period t + X, prior to demand occurring 
in Period t + X. Be sure you see that (2) implies ki^.x is completely determined by 
the ordering decision in Period t and the cumulative demand over X successive 
periods, starting with Period t. 

Given the probability distributions pi{q)i you can easily obtain the probability 
distribution for cumulative demand over X successive periods. Let 

(3) A<+a- i(Q) = probability that the cumulative demand 

(^< + • • • + ^i+A-i) is Q. 

Then, because of (2)y pi,t+x-i{(l) also represents the probability that kt+x has the 
value (yt — Q,). You can compute pt,t+x-i{(D recursively by a convolution 
calculation. 


W A,«+i(Q.) — ^ Pv+i{^)Pt,u (Q, 4) ihr Q 0, 1 ,. • •, 

qimQ 

where pt^t = pt, and (4) is applied successively for « = /,...,/ + X — 1. 

For many standard probability distributions, you can derive a formula to 
obtain pt,t^x-\((D without having to perform all the recursive numerical calcula¬ 
tions in (4). For example, ifA(^) is a Poisson distribution with mean E[qi\ = mtz 


(5) 




for ^ = 0, 1, 2, 


• i 


then the distribution for cumulative demand is abo Poisson with mean (Af = 

mt-\ - 
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(6) Pu^x.,((l) = for a = 0, 1, 2,.... 

An important special case of (3) and (4) arises if is the same distribution 
for all /, 

(7) A(y) =^(9); 

then 

( 8 ) P^(Q.) =Pu*x-,(a) =^^P(9)P^-'((l-q), 

where p^{(D is called the X-fold convolution of p{q)* To illustrate, suppose p{q) is 
a negative binomial distribution: 

(9) P[q) = (1 - pyp^ for ^ = 0, 1, 2,, 

where E[q\ = r^/(l — 0 < < 1, and r is a positive integer. Then it can be 

shown that the X-fold convolution is also a negative binomial distribution 

(10) P\Q,) = (1 - pyrpQ for a = 0, 1, 2,..., 

where £[a] = — P)- 

As another example of X-fold convolutions, consider the probability distribution 

( 11 ) ^( 2 ) = .2 ^( 3 ) = . 8 . 

Then the formula for \-fold convolution (8) is 

( 12 ) P\Q.) = tpiq)P{(l-q), 

qmO 

so that 

^’(4) =pi2)p{2) = .04 

(13) p\5) = ^(2)^(3) + ^(3)p(2) = .32 
^’(6) =^(3)^(3) = .64. 

And continuing to X = 3, you have 

'(H) pm = tp{q)p^«l-q), 

«»o 

SO that 

/’(6) =p{2)p\4) = .008 

(15) p^{7) = pi2)p*{5) + ^(3)^‘(4) = .096 

p>{8) =p{2)p^6) +p{3)p\5) = .384 
m=W(6) = .512. 

Lost demand case. Now suppose that \ > 0, but all unfilled demand is 
permanently lost; then the formula for kt+\ becomes complex. To see why, consider 
the case X = 2. Specifically, verify that 
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^<+2 = maximum(A;t+i — qt^u 0) + Xt 
(16) = maocimum [maximum(A:^ — qty 0) + — ^<+ 1 , 0] + 

= maximum [maximum(it + x <_2 — qty 0) + 0] + 

which is not always the same as the value in (2). In particular, note that kt^i 
does not depend simply on yt but on the separate components (i< + Xt), 

and similarly does not depend on the cumulative demand, but on the individual 
demands {qiy qi^i). As a consequence, the probability distribution for given 
the value for is not directly related to pt,t^x^i (ft). 

To avoid the intricacies associated with this more complex situation, we postu¬ 
late from here on that all unfilled demand is completely backlogged. If the penalty cost 
is relatively large, the resultant policy will be approximately optimal when unfilled 
demand is in fact lost. (When \ = 0, the lost sales case is nearly as easy to solve 
as the backlogged demand case, and the results will be indicated in the special 
material.) 


II.3 FINITE HORIZON MODEL 


Assume that the objective function is the discounted sum of each period’s 
expected purchasing, inventory holding, and shortage costs. Specifically, let 

(1) Ct{x) = cost of ordering the amount x in Period t, 

and suppose that this cost is actually incurred when the order is delivered in 
Period t + X. Also, let 


(2) ^i{ki q) = inventory holding and shortage penalty cost incurred in 
Period t when k units are available for filling the demand q 
in Period t. 

For example, Hi{k^ q) may be of the form 


(3) 


H,{k,q) 


\ht{k — q) for A — ^ ^ 0 
\nt{q — k) for A — ^ < 0. 


An important special case of (3) is when the two functions on the right of (3) are 
linear. Note that when A is negative, indicating a backlog condition before the 
demand q occurs, then the amount {q — A) on which the penalty cost is assessed 
represents the sum of demand and the backlog. Hence the penalty cost is charged 
in each period that a unit of demand is backlogged, and not in just the period 
when the backlog first occurs. Recall when'X = 0, ki =yi so that 


(4) 


expected holding and penalty 
cost in Period t given =y 


= Liiy) = 2 Ht{y, q)pt{q) 

fl -0 


= 0 ). 


When X > 0, recall that 

(3) kt^x ^yt — 6. j.. ?<+••*+ 

so that the level of directly influences holding and penalty costs in Period t X, 

Consequently, 
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( 6 ) 


expected holding 
and penalty cost 
in Period t + X 
givc^t 


= AW = E E - Q.y g)Pt*k{q)pt.M-i{Q.) 

Q=0q^Q 


(X>0). 


According to ( 6 ), Li{y) is found by taking for each {y — Q,) the conditional 
expected value of with respect to and then taking the uncondi¬ 
tional expected value with respect to Q,- If has the form in (3), then ( 6 ) 

reduces to 


huxiy - Q.)Pi.m{Q.) + E y)Pi.M{.Q.) 

«=0 (i>v 

L’^ux((l-y)Aux (d) 

Q^O 


for > 0 
for^ < 0 . 


Note that here (1= q^ + • • • + g^+x-i + that is, the random variable q 
in ( 6 ) representing demand in Period t + X has been absorbed in the symbol Q, 
in (7), and the second subscript on the convolution probability distribution has 
been increased by 1, accordingly. If all the pi{q) are identical, then pi,i+x{(D in 

(7) is the corresponding {X + l)-fold convolution, which would be denoted by 

P^*'{Q,)- 

Finally, assume inventory decisions are to be made in Periods 1 , 2,. . ., T, 
which implies that the inventory system, itself, is to operate until the end of 
Period T X. [Note that Lj.{y) should be defined so as to indicate what happens 
at the end of Period T + X if any items remain in inventory or if there is a backlog.] 
Because the optimization occurs for decisions made in Periods 1, 2 , . . ., T, it is 
convenient to let T denote the length of the planning horizon (rather than T + X), 
and this convention is used hereafter. 

The minimization problem involves finding an optimal order quantity for 
each period and eveiy possibleor, equivalently, an optimal value foryt{jt) > 7 ^. 
Leaving implicit in the notation below the dependence ofy^ on and 7 ^+, on 
the expected value for a specified set of policies for ^ = 1 , 2 , . . ., T, and 
initial level 7 , is 

(8) £(e ci^*^-'[ct{yt -jt) + AW)]}. 


where the expectatipn is taken over all possible sequences of demand, and oc is 
the usual one-period discount factor, where 0 < o: < 1. Observe that the expres¬ 
sion in ( 8 ) really ignores the costs in Periods 1, 2,..., X — 1; this is legitimate, 
since these costs are beyond control. Also notice that the constant a} multiplies the 
whole expression, and can and will be dropped henceforth for simplicity. The 
dynamic programming recursion appropriate to finding optimal policies that 
minimize the expression in ( 8 ) is 

fiU) = minimum —j) + Lt{y) + a S/t+i (j* — ?)A( 9 )] 

(9) for < = 1, 2 ,..., r 

Jt+1 U) = 8 * 
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The functionsTJO) are computed in the usual way for each possible j, starting with 
t = T, T — 1, . .., 1. (In practice, the range for q is truncated at a value q* < oo 
such that the cumulative demand function at ^ is approximately 1. Similarly, the 
values forj are restricted to a range of interest.) Such a recursion is easily calculated 
on a high-speed electronic computer. 

► The implicit notation in (8) conceals the following details. Given any set of replenish¬ 
ment rules and the random demands induce a probability distribution on the ji, for 
1 < / ^ T, and they, in turn, induce a probability distribution on the yt, for 1 < / ^ T. 
The expectation in (8) is over these induced probability distributions, so that the 
required minimization is really over replenishment rules. ' ^ 


*Lump-sum penalty cost example. The following example was con¬ 
sidered in Sec. 19.4 (pp. 792-803) in the context of a single-period decision model: 

/(?) = y for j = 4, 8 


( 10 ) 


f ^ 

Hy) = JT S />(?) = |y 

(I>V ^ 


for X = 0 
for X > 0 
for^ ^ 3 

hry = 4, 5, 6, 7 
for ^ 8, 


where K>0,c^0, and tt > 0 . 

Assume A, = 0, and that the demand distributions and cost functions are 
stationary, a = I, and T = 2. Apply (9) and check your results with those in 
Fig. II.2. (Refer to the calculations in Sec. 19,4 to verify the Period 2 policies.) 
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Figure 11.2. Lump-Sum Penalty Cost Example. 
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►The recursion in (9) can be applied with*little or no modification to other inventory 
situations having different assumptions. For example, you can let the value of (1 — <x) 
represent a probability that the inventoried item becomes completely obsolete. If the 
probability of survival changes over time, then you can use a value instead of O'. 
(The value of can actually represent the combined effect of product survival and the 
discount factor.) 

As another example, you can constrain the values of^( and xt = (j’t — jt) in the 
minimization in (9). Finally, under several conditions, such as postulating that only 
one order can be outstanding at any time, you can modify (9) so as to encompass a 
random lead time, that is, to permit the value of X. to vary probabilistically. 

Note that the optimization in (9) has been expressed in terms of minimizing expected 
costs. Suppose each item purchased by the company’s customers yields revenue r per 
unit, which is remitted to the company when the item is actually delivered to the cus¬ 
tomer. If (X = 1, this revenue can be ignored in the optimization process, because a 
policy that minimizes cost also maximizes profit, provided a/i demand is eventually 
satisfied, including unfilled demand at the end of Period T -f X. But for o: < 1, unfilled 
demand diminishes the present value of revenue receipts, since the payment of r is 
postponed until the customer is supplied his item. To handle this consideration, let Lf(j>) 
represent the expected holding and penalty costs exclusive of revenue, and then define 

(i) L,{y) = Lt{y) + (1 - a.)r S (Q.-y)Pu*x{Q.). 

Q>Jl 

[Assume Lj^y) includes the cost in Period T + X 1 of filling any backlog existing at 
the end of Period T + X.] 

Suppose unfilled demand is lost. If X = 0 so that the appropriate dynamic 

programming recursion analogous to (9) is 

(ii) /i0‘) = tninimumrc,(^ - j) + i,00 + a - 9)/’«(?) + «/(0) E /'«(?)]. 

v'^i L <J=0 Q>v J 

which is computationally no more complex than (9). The no-backlog case for X > 1 
can formally be treated by the recursion 

Jt{k, Xt-i) = minimum F aht{x) + f; Ht{k, q)pi{q) 

x^O L <7=0 

k 

(^^0 "f" ^ Syi+1 Q ^I-A+2> • • * 5 

0 = 0 

+ O' S /<+! -X+l> ^t-A+2> • • • > ^) I* 

Q>k J 

Write the appropriate recursion for X = 1 and compare it to (ii). < 

II.4 FORM AND EXACT COMPUTATION OF 
OPTIMAL POLICIES* 


The mod.el in the preceding section will be specialized by making these cost 
assumptions: 

for a: = 0 , ’, 

and Ki 

+ ctx for x>0 ' ^ 

oo as |;y I —>00, 


( 1 ) 

( 2 ) 


fO 

is convex and (<r, — occ,+i)y + Lt{y) 
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where > 0, > 0, and Cr+i = 0. For example, suppose Lt{y) is given by (7) 

in the previous section, and A^+xO*) + ^<+x(~j) a convex function for ally. Then 
is convex and satisfies the assumptions in (2) provided (c^ — 


—► oo as Ij; I —> oo. 


Define an inventory policy as being of the (s, S) type if for every Period t 


ytU)=J and Xt(j)==0 for St 

Mj) = St and xtif) =S,-j forj < 


. (3) 


Then we can demonstrate: 

Optimality of {s, S) Policy, Given the model represented by the dynamic program¬ 
ming recursion (9) in Sec. II.3, in which (1) and (2) above hold, there is an optimal 
policy of the (j, S) type. Further, if all Ki = 0, then St = Si, 

An important implication of the theorem is that you can completely summarize 
an optimal policy for each period by specifying only two parameters Si) and 
without stating the precise level of inventory at the start (/ = 1). {The theorem 
remains true when the condition in (2) that Li{y) is convex is weakened to 
— oiCi+i)y + Li{y)'\ is unimodal^ which was defined in Sec. 14.3.} 

Calculation of an optimal policy* Our knowing that the form of 
an optimal policy is (j, S) greatly simplifies the optimization process in the 
recursion (9) above. Assume here that Si'^O for all t and Ki> 0 for at least one /. 
(The case of all = 0 is treated later.) 

To begin, suppose j is arbitrarily small so that the optimal yi{j) = Si, Then 
you can see by inspecting the dynamic programming recursion (9) that is a 
minimizing value of^ in 


(4) 


minimum 

V 



= minimum [gt(y)] = gt{Si), 


y 


Given the (j, S) form of the policy, you can state that. 



+Ki^ cij forj < St 

^i{j) + ^ ^ 

. fl-o 


Therefore Si is the smallest value such that 


(6) Li{st) + a S/j+i(j< — ?)A(?) ^gt{St) +Kt — ctSt, 


or on rearranging terms, St is the smallest value such that 


(7) 

where, as in (4), 

( 8 ) 


gt{st) ^gt{St) +Ki 


gt{y) = cty + Lt{y) + a {y - ?)A(?). 
^-0 
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In summary, the algorithm for Period t can be stated as: 

Step L Calculate in ( 8 ) for all relevant y and let > 0 satisfy 

minimum gt{y) = gt{St). 

Step 2. Let St be the smallest value such that ^gi{St) + Kf (Note that 
Si may be negative.) 

Step 3, Calculateyi(j) according to (5). 

Thus for each Period t the algorithm performs only a single minimization in 
Step i, and not one for each j as in the recursion (9) for the general model in the 
preceding section. The algorithm begins with t = T, where = 0, and 
proceeds to t = T — 1, T —■ 2,.. ., 1. The algorithm is illustrated below. 

►When all Ki = 0, you can omit Step 2 since Si = St by the theorem. Further, if all the 
cost parameters as well ^s the demand distributions are stationary, then Si '^Si+i, By 
making one further assumption about a terminating condition, Step 1 can be considerably 
simplified—this approach is given in Sec. II.5. ^ 


Examples. The following examples serve two purposes: they demonstrate 
how the algorithm operates, and they give you an idea of how optimal policies 
behave when both Kt> 0 and the planning horizon is finite. 

Assume that Kt and Ci are stationary, and that the holding and penalty cost 
function is also stationary and of the form 



9) 


{ ^. for^ — ^ > 0 

for^ — ^ < 0, 


where c + A > 0 and 7t > c. Further let 


( 10 ) X = 0 and pt{q) = p{q) 

so that the demand too is stationary, and 


( 11 ) Ltb)^L{y) 


A* (^ - q)p{q) +!]»•(? —y)p{q) for ^ 0 

flaO q>y 

2 jr • (9 - y)p{q) for < 0. 


Given all the stationarity assumptions, it is now more convenient to describe 
the stage variable in the dynamic programming relation in terms of “number of 
periods remaining until the end of the horizon,” thereby yielding, analogously to 
( 9 ) in the preceding section, the recursion 


PnU) = minimum ]^{y -j) + L(^) + 

FoU)=0. 


To help you remember that the recursion is now backward in time, the symbol 
Fn is employed instead o£fi. 
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To start, suppose 

(13) is: = 24 c=l h = 2 7f = 10 <X = l x=*0 

and that demand is deterministic: 


(14) 


pi3) = 1. 


Consequently, 

(15) 




[10.(3^ 


— 3) for^ ^ 3 


j>) Tory < 3. 

For any n, Step 1 of the algorithm indicates you must calculate the function, 


(16) \ G„{y) =cy + L{y ) + a — ?)^(?)» 

which, given (14), simplifies to 

(17) G,{y) =<y + L(y) +F„.,(y - 3). 

Again, to help you remember that the recmsion is now backward in time, the 
symbol G„ is employed instead of gf. 

To illustrate, let n = 1 j then from (12), (13), (15), and (17), 

C,( 6 ) = 6 + 2(6 - 3) + 0 = 12 

(18) C,(0) = 0 + 10(3) + 0 = 30 

G,(-2) = -2 + 10[3 - (-2)] + 0 = 48. 

The entire function Gi{y) is found in Fig. II.3. Accordingly, Si = 3 since Ci(3) = 3 
is the minimum value for Gi(y). 

Going to Step 2, you can see that j, = 1 is the smallest value of Jj such that 

(19) Gi{si)^Gi(3)+K-=27, 

since G,(l) =21 whereas G,(0) = 30. 

For any n, Step 3 indicates you are to c^culate 



fGn(*S'„) + 9 

forj < Sn 

( 20 ) 


- q)pis) fori ^ r„. 

So for n = 

1 , verify that 

1 

CM 

for J < 1 

( 21 ) 

FiU) = - 

10(3 -j) + 0 

for 1 < 3 



.2(j-3)+0 

for 3 ^j'. 


the complete function Fj(j) is also shown in Fig. II.3. 

Returning to Step / for n=2, you calculate Gt{y) according to (17); for example, 

C,( 6 ) = 12+F,(3) =12+ 0 = 12 

C,(0) = 30 + F,(-3) = 30 + 30 = 60 
G,(-2) = 48 +F,(-5) = 48 + 32 = 80. 


( 22 ) 
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y ond j 

6,(y) 


Cziy) 


Gziy) 


(?4(y) 

^4(7) 

Gsiy) 

FsU) 

GeCy) 

^6(/) 

GAy) 

F 7 U) 

-3 

57 

30 

90 

39 

99 

54 

114 

75 

135 

90 

150 

105 

165 

126 

-2 

48 

29 

80 

38 

89 

53 

104 

74 

125 

.89 

140 

104 

155 

125 

-1 

39 

28 

70 

37 

79 

52 

94 

73 

115 

88 

130 

103 

145 

124 

0 
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60 

36 

69 

51 

84 

72 

105 

87 

120 

102 

135 

123 

1 

21 

20 

50 

35 

59 

50 

7^ 

71 

95 

86 

NO 

101 

125 

122 

2 

12 

10 

40 

34 

49 

47 

64 

62 

85 

83 

100 

98 

115 

113 

3 

3 

0 

30 

27 

39- 

36 

54 

51 

75 

72 

90 

87 

105 

102 

4 

6 

2 

26 

22 

41 

37 

56 

52 

77 

73 

92 

88 

107 

103 

5 

9 

4 

19 

14 

43 

38' 

56 

51 

71 

66 

92 

87 

107 

•102 

6 

12 

6 

12 

6 

39 

33 

48 

42 

63 

57 

84 

78 

99 

93 

7 

15 

8 

17 

10 

37 

30 . 

52 

45 

67 

60 

88 

81 

103 

96 

8 

18 

10 

22 

14 

32 

24 

56 

48 

69 

61 

84 

76 

105 

97 

9 

21 

12 

27 

18 

27 

18 

54 

45 

63 

54 

78 

69 

99 

90 

lO 

24 

14 

32 

22 

34 

24 

54 

44 

69 

59 

84 

74 

105 

95 

II 

27 

16 

37 

26 

41 

30 

51 

40 

75 

64 

88 

77 

103 

92 

12 

30 

18 

42 

30 

48 

36 

48 

36 

75 

63 

84 

72 

99 

87 

13 

33 

20 

47 

34 

55 

42 

57 

44 

77 

64 

92 

79 

107 

94 

14 

36 

22 

52 

38 

62 

48 

66 

52 

76 

62 

100 

86 

113 

99 

15 

39 

24 

. 57 

42 

69 

54 

75 

60 

75 

60 

102 

87 

III 

96 

Optimal 

Policies 

•0,3) 

(3,6) 

(2.9) 

(2,6) (2,12) 

(2,9) 

(2,6) 

(2,9) 

(2.9) (2,6) (2,12) 


DotQ:/<’=24 c = l /j = 2 7r = IO a = l p(3) = 1 X= 0 

Figure> II.3. Deterministic Example. 

The complete function G 2 {y) is given in Fig, II.3, where you can see that *$'2 = 6 . 
Continuing to Step P, check that jo = 3 is the smallest value of ^2 such that 

(23) ^ 2 (^ 2 ) < ^ 2 ( 6 ) +K=36, 

since ^ 2 ( 8 ) = 30 whereas ^ 2 ( 2 ) = 40. The resultant values forF 2 ( 7 ) are indicated 
in Fig. II.3; to test your understanding, verify the values for Fo(9), ^ 0 ( 6 ), ^ 2 ( 0 ), 
and/?' 2 (~ 2 ). 

The values of G„(^) and F„(j) for n = 3, 4,. . ., 7, also appear in Fig. II.3. 
Notice that these G„(^) and Fn{j) are not convex. For example, there are local 
minima of both Gi{y) and 7 ^ 4 ( 7 ) at^ and j equal to 3, 6, and 12. Observe that 
Fn{j) decreases for j < .r„, and, as you would expect, increases for 7 sufficiently 
large, for then you incur a large inventory holding cost. But note that Fn{j) does 
not always decrease for Sn^j ^ Sn^ To illustrate, 7 ^ 4 ( 3 ) =51 whereas ^ 4 ( 4 ) = 52, 
even though (2, 9) is optimal for n = 4, Consequently, although it is optimal to 
order (9 — 74 ) if J 4 < 2, it is more costly to start with inventory j\ = 4 than with 
j\ = 3. In such an instance, it would be better for the company to get rid of one 
item, provided the associated cost of disposal were less than 1(= 52 — 51). 

The corresponding optimal values for (j„, Sn) are shown at the bottom of 
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Fig. II.3, Notice that in certain periods there is more than one optimal policy. In 
factj it can be shown that multiple optimal policies continue to be present for 
larger n. Specifically, for A: = 0, 1,2,..., the optimal policies are 

(2, 9) and (2, 6) for n = 5 + 3A: 

(24) (2, 9) for = 6 + 3A: 

(2, 9), (2, 6), and (2, 12) forn = 7 + 3A:, 

where n is the number of periods remaining until the end of the horizon. Despite 
this multiplicity of solutions for specific finite horizons, the unique optimal policy 
is easily shown to be (2, 9) when the horizon is unbounded. 

5>If the disposal cost of x units is di^x, then it is possible to modify an (j, S) policy as 
follows. Ifj{ < Sly order {Si — ji), where (r^, Si) are determined by (4) and (7). Other¬ 
wise, let j be the minimizing value in 

(i) minimum \di {j —y) -f Li{y) + cL S/t+i {> — V)A(?)] =/«6’) lorj ^ Si, 
y^J L J , 

For the example in Fig, II.3 with di = 0, the modified policy leads to the same set of 

(st^Si). ◄ 

As a second illustration, suppose the economic data are the same as in (13), 
except here consider a = 1, .9, and .8, and X = 0, 1,2, and assume the stationary 
demand distribution is now 

(25) ^(2) = .2 . ^(3) = .8. 

[Recall ^^'‘"^(Q,) were calculated in (13) and (15) of Sec. II.2.] The resultant 
optimal policies, which are unique, are shown in Fig. II.4. Notice that neither 



X = 0 

II ° ' 

= 1 


a =1 

a = .9 

a = ,8 

X = 1 

X = 2 

n 


s„,S„,D„ 



5n-2»’?o-2»^n-2 

1 

1,3.2 

1.3.2 

1.3,2 

— 

— 

2 

3.6,3 

2.6.4 

2,6,4 

3.6,3 

— 

3 

2.8,6 

2.8,6 

2,8,6 

5.9.4 

6,9.3 

4 

2.11.9 

1.6,5 

1.6,5 

5.II.6 

8,11,3 

5 

2.6.4 

2,6,4 

2,6,4 

4.13.9 

8,14.6 

6 

2,9.7 

2,8,6 

1.6,5 

5.9.4 

7,16,9 

7 

2,9.7 

2,6,4 

2.6,4 

5.11.6 

7,12,5 

8 

2.9.7 

. 2,8,6 

2.6.4 

4.12,8 

8.14,6 

9 

2,9.7 

2,8,6 

2.6.4 

5.9.4 

7,14.7 

10 

2.9.7 

2,8,6 

2.6.4 1 

5.11.6 

7,14,7 


Note: s„y n = number of periods remoining to end of horizon 

Data:24 c = l h^2 Tr = IO p(2) = ,2 pl3) = .8 

Figure 11.4. Probabilistic Examples. 
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s„, Sn, nor Dn^ Sn — Sn is monotonic. Observe that as oi gets smaller, both Sn 
and Sn get smaller, so that the ordering decision tends to be postponed and the 
order quantity diminished. In other words, the smaller the cc, the greater the 
tendency to incur the replenishment expenses further into the future. Lastly, 
note that as \ gets larger, so do both Sn and Sn* 

For a final example, consider the same economic data in (13), but let A = 4, 
and keep the same deterministic demand assumption in (14). As you can verify 
by applying the optimizing algorithm, the optimal policy (j*„, Sn) is 

(1,3) (1,6) (1,9) for 71=1,3,5,7,... 

^ ^ (2,6) for n = 2, 4, 6, 8,..., 

where n is the number of periods remaining to the end of the horizon. Observe 
not only the multiplicity of optimal policies when n is odd, but also the persistent 
oscillation of Sn between the values 1 and 2. Consequently, if n periods remain to 
the end of the horizon and if at the current period jn = 1, you do order if n is 
even, but do not if n is odd. As shown in the special material below, (2, 6) becomes 
the unique optimal stationary policy when the horizon is taken as unbounded. 


► Suppose for every n you let = 6 in (26). Consider the case of inventory being at 
level 1 for the initial period. Then if you place an order immediately, so that in effect 
j = 2, the resultant sequence of order quantities is (5, 0, 6, 0, 6, 0, 6,...) and the 
sequence of ending inventory levels is (3, 0, 3, 0,...), regardless of whether s = 1 or 2 in 
the remaining periods. Consequently, the associated sequence of replenishment costs is 

(ii) K + 5c + 3h, 0, iC + 6r + 3//, .0, K + 6c + 3h, 0,... 

(j = 2 at initial period). 

If instead you do not place an order at the initial period, so that in effect j = 1, then 
the resultant sequence of order quantities is (0, 8, 0, 6, 0, 6,..,) and sequence of 
ending inventory levels is (—2, 3, 0, 3, 0,...), regardless of whether j = 1 or 2 in the 
remaining periods. Therefore the associated sequence of replenishment costs is 

(lii) 2/>, -f" -f“ 3/f, 0, 7C 6r 3A, 0, ^ -j" -j- 3hj ... 

(j = 1 at initial period). 

Consider the economic parameters of the example in (26), namely, K = 24, r = 1, 
A = 4, and p = 10. You can verify by accumidating the costs in (ii) and (iii) that placing 
an order in the initial period {s = 2) is better than postponing the order for one period 
{s = 1) if the horizon length T is even; it is worse if T is odd. 

Now suppose the horizon is unbounded, and introduce the discount factor oc. Then 
using (j, S) = (2, 6) as a stationary policy results in the present value of all costs 

(iv) P.v.[2] = /C + 5« + 3A + 


and similarly using (j, .S') = (1, 6) results in 


(V) 


P.V.[1] = 2p + a{K + 8r + 3^) + 


a\K + dr + 3h) 

1 - q:2 
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• Thus 

(vi) P.V.[!] - P.V.[2] = {2p- K-Sc- 3A) + a{K+ie+ 3A) - ^ 

= 2/. - ^ - y > 0 for a = 1, 

given the values of the cost parameters, so that (j, S) = (2, 6) is better, M 

*Proof of ^^Optimality of {s,S) Policy*^. The proof that an (-r, tS} 
policy is optimal under the hypotheses stated in (1) and (2) above is of two-fold 
importance. Obviously, such a proof is required in order to establish that the 
stated theorem is indeed correct; but this reason is the concern of technical 
specialists, and therefore does not justify our including the proof in an intro¬ 
ductory text. The second reason is that the proof exemplifies what is called an 
inductive line of reasoning. This approach applied to dynamic programming 
recursions sometimes leads to insightful results, such as the form of an optimal 
policy. That is why the argument appears here. The case of all = 0 is fully 
proved, and the argument for the more general case is only sketched. 

To start the inductive proof, suppose for a given t that gt(j) in (8) is 
convex: 

(27) gt(y +1) - gtb) > gt{y) - gi{y “ 1) for all integer 

and gi(y) oo as \y\-^ oo. As in Step 1 of the algorithm, let St be the value of^ 
that minimizes gt{y). These hypotheses on gt{y) and Ki = 0 establish the result in 
Step 2 that St = St, and that the form of the optimal policy is to order {St — j) 
ifj < Sty and not to order otherwise. To summarize so far, the inductive hypothesis 
that gt{y) is convex and Kt = 0 leads to the conclusion that the optimal policy is 
{sty St) with St = Sf 

The next part of the proof is to establish that gt^i{y) is convex and gt-i(y) 
as \y\ —► oo. 

Consider first j < — 1. Then from (5) and the fact that St = St, 

(28) ft U) = gtist) — ctj = —ct{st —j) + 2y<+i(^« — fiPtifh 

qmQ 

and ftom (5), the expression (28) also is Valid for j = j*. Therefore, according to 

(28), — 1, 

(29) f{j + 1) -f{j) = -c, -ftU- 1). 

so thatyt(j) is convex for these integer values ofy. 

Consider next j ^ j,. Then from (5) and the hypothesis that is convex, 

(30) Mj + 1) -fU) = gtU + 1) - gt{j) -ct-^gtU) - gtU - 1) - 
If J > St, then simplifying the expression on the right of (30) gpves 

( 31 ) ftU+ 1 ) -m ^fu) -ftU - 1 ). 






A34 Inventory Models with Periodic Review 


. II.5 • 


If j = Sty then since^ = st = St minimizes gt{y)y 

(32) gt{st +1) — ^ Oj 

so that from (29), (30), (31), and (32),/t(j) is convex for all integer 7 . 

Knowing that ft{j) is convex, it follows that ot ft{j — q)pt-\{q) is convex for 
any q. Since the sum of convex functions is convex, you can conclude that 

(33) ^<-i(^) — ct-\y + Lt-i{y) + CL 2/< {y — q)Pi-\{q) 

<7-0 

is convex. Further, since (r^^i — cLCt)y + Lt^i{y) —> 00 as |^ | —> 00 by assumption 
(2), so does gt^i{y)i as can be shown. The inductive proof that ^^( 7 ) is convex for 
all t and gt{y) —> 00 for |^ | —> ex? is complete upon verifying that, for t = T 
(the end of the horizon), 

(34) griy) = Cry + Lj,{y) 

is convex and griy) —> 00 as | j | —> <^. These statements are indeed true by the 
assumptions in ( 2 ). 

The same inductive line of reasoning can be used in the case where all Kt > 0, 
except that the assumption of convexity in (27) must be generalized to the postu¬ 
late that^(( y) in (7) satisfies, for integer j, 

(35) gt(j + a) — gi{y) + — gt{y — 6 )] {KfConvexily), 

where a and b are arbitrary positive integers. The property in (35) is referred to as 
/jCt-convexity. The inductive hypothesis is that (35) holds for a given /, and 
you can show that (35) implies that the form of an optimal policy for Period t 
is (j’t, St) as constructed by Steps 1 and 2, For the remainder of the proof, you 
must establish that gt-\{y) possesses the property of Kt^i-convexity, 

11.5 CASE OF ZERO SETUP COSTS 

Suppose the model in the preceding section is further specialized by eliminating 
the setup costs Kty so that 

(1) Ci{x) = Ct-x for a: > 0, 
where Ct > 0 . 

Further assume that if any items are left in inventory at the end of the horizon, • 
they can be salvaged at the cost of c^+i per unit, and if any demands are still 
backlogged, they must be filled at the same cost of Cy+i per unit. Note that here 1 
we let Cj’+i > 0. Consequently, the general dynamic programming recursion (9) 
in Sec. II.3 is altered by setting 

( 2 ) fr+iU) ~ 

The Optimality of {s, S) Policy Theorem still remains valid with this new assump¬ 
tion. For many situations, when all Kt = 0, you can obtain the common value of 
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St = by a procedure that is much simpler than that in the previous section. In 
fact, it amounts to solving a sequence of static or one-period models, which you 
studied in Sec. 19.4. 

Of course assumption (2) can lead to different (jt, St) values than would arise 
if you assume fr+i — especially for periods near the end of the horizon. 
Explain why. But if the horizon 7* is fairly long, this ending assumption (2) 
usually has just a negligible effect on the ordering policy for the current period, 
especially when o: < 1, for then fr+i is weighted by only in the T-period 
objective function. Indeed, if the horizon is unbounded and all the cost param¬ 
eters as well as the demand probability distributions are stationary, then the 
determination of an optimal stationary policy is unaffected by assuming (2). 

The approach is to consider 

(3) minimum[(ri — oiCt+i)y + Ltiy)"] for ^ = 1, 2,..., T General Case, 

V 


and let 

yt = 

vgJue o£y found in 

If 



( 4 ) 

St <yui 

for all ^ = 1, 2,... 

then 



( 5 ) 


St = St =yh 


and yt is often referred to as an optimal single critical number for 
Period t. 

The following observation argues for the plausibility of (5). Suppose the se¬ 
quence of optimal single critical numbers {st = St) is such that St ^ St+i for all 
t ^ T — I, Then the incremental cost of purchasing {st —j) items instead of 
(j-f — 1 — j) items in Period t is only {pt — !>ince this additional imit 

would otherwise be purchased in Period t + 1, regardless of demand in Period t 
given that > Sf Consequently, if ^ for all < ^ T — 1, the optimiza¬ 
tion can be performed for / = 1, 2,..., T — 1 on a period-by-period basis, 
where you minimize the sum of incremental purchase cost and expected holding- 
and-penalty cost, (Note the preceding argument breaks down when St > J<+i. 
Then an additional unit ordered in Period t might not otherwise be purchased in 
Period t + 1—^it would depend on the level of demand in Period t.) For t = 
the assumption (2) justifies employing {cj. — 0CCt+i) as the appropriate ordering 
cost. 

The preceding paragraph in essence demonstrated that if St^ J<+i, for all 
/ < 7* — 1, is indeed optimal, then the minimization in (3) provides the value for 
Sf But we really want to establish the more operational proposition that if the 
inequalities in (4) hold, then (5) is optimal. This result is argued in the advanced 
material at the end of this section. 
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Suppose the holding and penalty cost functions are linear, so that 

(6) Lt{y) = 


{Q,) + 2 (Q,) 

<i>V 

for j; > 0 


Q-0 


for < 0. 

Then is the smallest integer such that 

(7) ^ Pt.t+x (QJ) ^ !££±i_—Linear Cost—^Nonstationary Case. 

^0 -r nt^x 


How does change as TCt^x increases? As hi^x increases? How do the purchase 
costs affect the value of ? 

If in addition all the economic parameters and probability distributions are 
stationary, then (7) reduces to 

(8) i ^ L Linear Cost—Stationary Case. 

^0 7C n 

How do you think is likely to change as the lag X increases ? In answering, you 
might think of what happens if the probability on the left-hand side of (8) is 
approximated by a cumulative Normal distribution. Suppose o: = 1. What effect 
does the purchase cost c have on jF ? How reasonable, then, is this policy for, say, 
the last period in a finite horizon problem ? 

Therefore, the algorithm to find the optimal single critical numbers can be 
stated as: 


Step L Calculate JF^, 'for / = 1, 2,..., T, according to (3)—or according to 
(7) or (8) if appropriate. 

Step 2. Verify whether yt <yuiy for all / = 1, 2,. . ., T — 1. If so, let 

= Si = jFj. Otherwise, apply the algorithm in Sec. II.4. 

For many important cases, the inequalities in Step 2 will alwavs be satisfied. 
For example, the inequalities hold if you postulate full stationarity (8); then all 
the jFf are identical. Consequently, once inventory falls bciow the level jF as specified 
(®)> always order the amount demanded in the previous period. Further, 
in this case the value ofjF found by (8) is also optimal for an unbounded horizon 
and all values of o', where 0 < o' < 1. We return to this point in the next section. 

If you assume both the probability distribution of demand and the unit purchas¬ 
ing costs Cl are stationary, but that the penalty costs tCi^x in (7) are rising over 
time relative to hi^x, then all jF^ Give a verbal explanation of the reason 

why. Now, how much do you order in each period ? 

As was pointed out above. Step 1 is no more complex than the optimization 
process required in the one-period (static) model of Sec. 19.4. In fact, when 
X =-• 0 and o: = 0 in the model above, the two methods are identical. 

►The material at the end of Sec. II.3 pointed out that when \ = 0, the lost sales case 

could also be handled by a simple dynamic programming recursion. Assuming further 
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that all the iTj = 0, a single-critical-numbcr type of policy is optimal. lILi^y) is given by 
(6), and yt is the smallest integer such that 


(i) 




Tit — Cj 


Q»0 


hi — CCCi^i 


Immediate Delivery—Linear Cost—Lost Sales Case, 


then yi is the optimal single critical number provided (4) holds. 

We now prove the validity of the single-critical-number algorithm. Specifically, let yi 
be a value of^ such that 

(ii) minimum [(r^ — cLci^i)y -f Lii^j] = (cj — otCi+i)yi -f Li{yt) for / = 1, 2,, T, 

V 

We show that if yt ^ yi+i, for t = 1,2,..., T— 1, then st = Si = yt. 

The argument can be established inductively. Start at ( = T, Then by assumptions 
(1) and (2) 

(iii) frU) = minimum [cr^iy —j) + ArW - of S “ ^)Ar(^)], 

where, as usual,/rO) represents the expected cost of an optimal policy when entering 
inventory is j at Period T. Given that a single-critical-number policy is optimal for (iii), 
(jp = St) is a minimizing value of y found from 

(iv) minimum [ct * y •¥ Lj^y) — 

y 

which agrees with (ii). Hence, yT = St = St, as claimed by the algorithm. 
Consequently, 


(v) /r(j) = 

Define 

(vi) 


f Ct-Ut -j) + LT Ut) - a L «r+i • {Si - ?)^r(?) forj ^ yr 

qmO 


> irO') - a'ZcT^rij - q)pi{q) 

a-0 


for j > Sf 


drij) = Cr’iSr — j) + LriSi) + ® S ^r+t’(Si q)pT{q) 

qxmO 


= —Crj + ^r(^r)« 

Given that yr is derived from (iv), verify that 

Mj) ^/tU) ^orj^yr 
Mj):^fTU) fori>/r. 


(vii) 

Hence, 

(viii) 

(ix) 

(X) 


minimum [cr-i(> — j) + Lr^Ay) + S — ^)^r-i(^)] 

9-0 

t 

^ minimum [rr-i(^ — j) + + CL Yifriy — ^)^r-i(^)] 

y^ 9-0 

=/r-i(i), 


where, as usual, ^•-i(j) represents the expected cost using of an optimal policy when 
entering inventory is j at Period T — 1. 
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Substitute the expression (vi) into (viii) and check that for j sufficiently small in 
(viii), a minimizing value of^ can be found from 

(xi) minimum [cr-iy + Z.r-i(j') — ctcxy]* 

V 

By hypothesis, ^t-\ is a solution to (xi). 

Also, ^ St by assumption, and therefore ^ St> for ^ ^ 0; a fortiori, 

(xii) drUr-i — ?) ^frUr-i — q) for j ^ 0. 

We are now justified in writing (viii) and (ix) as: 

oo 

^r-i(j'r-i — j ) + A’-i(Jr-i) + CL S^(i’r-i — < l ) pT ^\{ q ) 

(xiii) 

^ minimum [cr-iiy -j) + + OL S/r (y - q)pT-i{g)], 

q=o 

so that St-i is an optimal value of y in the right-hand expression in (xiii), for sufficiently 
small j; a fortiori, since a single-critical-number policy is optimal, yx-i = St-i = St-u 
as was to be shown. 

The correctness of the entire algorithm is established by continuing in the same 
fashion for / = T* — 2, T — 3,..., 1. < 

11.6 UNBOUNDED HORIZON MODEL 

Here we postulate the cost assumptions made in Sec. 11.4, except now we 
assume that the economic data and demand distributions are stationary over an 
unbounded horizon. Specifically, the cost assumptions arc 

/V / N for X = 0 

(1) c{x) = ] where ^ 0 and c > 0 

(A cx for X > 0 

(2) L{y) is convex 

(3) c(l — a)y -r L{y) —> oo as |^ | —> oo. 

We can show the following proposition. 

Optimality of a Stationary (j, 5) Policy. Given the model represented by the 
dynamic programming recursion (9) in Sec. II.3 in which the economic parameters 
satisfy (1), (2), and (3), where 0 < o' < 1, there is an optimal policy of the ( 5 , S) type 
in which Si = s and = S, for all /, over an unbounded horizon. Further, if all 
Ki = K = 0, then s = S. 

{The theorem remains true when (2) is weakened to — [c(l — oc)y + L{y)'] is 
unimodal, defined in Sec. 14.3.} 

The numerical procedures for calculating an optimal {s, S) policy are involved 
—except when all Ki = 0 —and are presented in the next section. In the remainder 
of this section, you will find a sensitivity analysis of optimal (j*, S) policies to changes 
in the economic parameters and demand probability distributions. You will also 
see how to calculate the optimal {s — S) when all = 0, and how to compute 
a nearly optimal {s, S) policy using a Normal distribution approximation, 

► For a: < 1, a unique solution exists to the dynamic programming functional equation 
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(0 fU) = minium ^c{y —j) + L(ji) + OC ^)^(^)] (0 ^ QJ < 1), 

which is the appropriate generalization of the finite horizon recursion (9) in Sec. II.3, 
Assuming the theorem applies so that the optimal form of the policy is (x, S), the function 
f(j ) must satisfy, for an optimal (x, S) policy, 


(ii) 


fU) = K + c-(S -i) + L{S) + a ^f(S - q)p{q) Torj < x 

ff-O 


f{j) = LU) + oi^fU-q)P(g) 

qmO 


for J; 


For cc = I, the analogous extremal equation is 

(“0 gU) + g = minium ^c(y — j) + L(y) + ^^g(j — ^)^(^)]> 

where g represents the expected cost per period of an optimal policy. Assuming the 
theorem applies, the function g(j) and g must satisfy, for an optimal (x, S) policy, 

gU) + g = -j) + S forj < X 

(iv) « 

gU) + i = UJ) + S sU - ?)/>(?) forj ^ J- ◄ 

QmO 

Sensitivity analysis. Assume OJ — 1 and that the expected holding 
and penalty cost function is 


(4) L(y) = 


± h-{y - Q,)p^*^{Q.) + 2 (d - iory ^ 0 

{ory < 0. 


d-o 


Q>y 


. 4>-0 


Since all demands are eventually filled, = c, and o' = 1, the value of c does not 
influence the optimality of a policy and therefore can be omitted in the discussion 
below. Further, you can normalize one of the cost parameters, say 

(5) h = 1, 

because a policy that is optimal for the costs {K, h, tv) is also optimal for any 
multiple of the costs, such as (2A', 2^, 27f). 

Over a wide range of economic parameters and demand probability distribu¬ 
tions, 

X, iS, {S — s) increase as X increases; 

*5, {S — x) increase and x decreases as K increases; 

X increases and {S — x) decreases as w increases; and 
X increases as E[q\ increases. 

As the example in Fig. 11.5 shows, iS* need not increase with tv or £[^], and 
{S — s) need not increase with E[q\. A second illustration showing that \S — x) 
need not increase with E[q'\ is exhibited in Fig. II.6. Notice there is a moderate 
decrease in {S — x) when E[q'\ goes from 22 to 23 and a large decrease when 
E[q'\ goes from 61 to 63. 


( 6 ) 
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Poisson Distribution 





K = 64 /I = 

= 1 7r = 9 \ 

Poisson Distribution 



/<'=64 /7 = 

1 X=0 

£■[<7] 

s, S, S-s 




21 

16,65,49 


TT =4 

7r = 99 

22 

17,68,51 

E[q] 



23 

18,52,34 

s, S, S-s 

s, S,S-s 

24 

19,54,35 

49 

35JOO,65 

56,116,60 

59 

52,126,74 


61 

53,131,78 

64 

46J3I,85 

72,83,11 

63 

55,73,18 

64 

56,74,18 


Figure II.5. Sensitivity Analysis. Figure II.6. Variation in {S — s). 

The fact that {S — s) does drop substantially as E[q\ gets large demonstrates 
a significant difference between the discrete review models in this appendix and 
the continuous review model in Chap. 19. In the continuous review case, as the 
mean demand rate increases, the order quantity (5 — s) always increases, but the 
interval between successive orders decreases. In the discrete review case, as £[$'] 
grows, eventually the probability is nearly 1 that a replenishment order is placed 
at every period. Consequently, the system behaves almost as if A” = 0, that is, as if 


►In the following special case, the optimal (S — s) is precisely the EOQ, formula (8) 
for the deterministic lot-size model in Sec. 19.5. Assume o' = 1, X = 0, is the expo¬ 
nential density function 

(i) piq) = m*-*” for 5 ^ 0 E[q\ = 

and L{y) is given by (4), except that integrals are used instead of summations. Then the 
optimal values for s and iS" are determined by 

(ii) = ^ 

Case of A = 0. If A = 0, then the calculation for the common value j = 5 
is identical to that in Sec. II.5. Specifically, if L{y) is given by (4) above, then 
JF = j = 5 is the smallest integer such that 

(7) ^(0a^ 1). 

For more general L{y)yy is found by 

(8) minimum [(1 — cL)(y + L{y)']. 

y 

Case of A > 0. A complete algorithm to compute policies efficiently for 
an unbounded horizon is briefly sketched in the advanced material in the next 
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a = (1 + ,r’'’^ 
T 


section. For many practical situations^ however, you can apply a very simple 
approximation procedure. 

Assume the expected holding and penalty func¬ 
tion is given by (4). Also let o' = 1, which is an 
adequate assumption for most applications of the 
model, as can be seen in Fig. II. 7. This table 
indicates values for a when, say, the annual interest 
rate is i and there are 12, 52, 365 review intervals 
per year. 

The technique uses a Normal distribution ap¬ 
proximation in a manner analogous to that 
employed for the continuous review model in Sec. 

19.6. Let 

(9) E[q\ = M VarM = F, 

so that 


/ 

12 

52 

365 

.01 

.9992 

.9998 

.9999 

.05 

.9959 

.9991 

,9999 

.15 

.9884 

.9973 

.9996 

.20 

.9849 

.9965 

.9995 

.25 

.9816 

,9957 

.9994 

.30 

.9784 

.9950 

.9993 


+ • 
Var [qt + 


+ ^t+x] = 0 ^+ 1)-^ 
• + = 0^+ 


* V(X+ l)V 


( 10 ) 

Define 
(11) 
so that 

(12) j= (X+ 1)M +W(X+ 1)F. 
Denote the cumulative standardized Nomud distribution as 

(13) 

and the standardized Normal loss integral as 

(14) ' ' ' ^ 


Figure 11.7. Compound Discount 
Factors. 


'dt 


In{u) =.^Jt-u) ^-^'dt. 


Both Pif{u) and /jy(tt) are tabled in Fig. 19.6 (p. 819). 
The approximation algorithm is as follows. 


EOQ,= ^\ 


Step L Calculate 

(15) 

Step 2. Compute 

(16) 

and find the value for u, such that 

(17) 


'MM 


o _ h^EOQ, 

W(X,+ 1)F 














A42 Inventory Models with Periodic Review 


. II.6 • 


Step 3, If Af < .8S83 K/hy then let 5') be determined by j in (12) and 
iS = j + EOQ^, Otherwise go to Step 4, 

Step 4. Compute 


(18) 




Tt 

h -f- 7C 


and find the v«lue for such that 

(19) = R. 
Define 

(20) = minimum (w,, v^) 
and let (j, .S') be determined by 


(21) J = (\ + 1)M + W(X+ 1)F 

(22) S = {\+ l)A'I + minimum [t/s>/(A, + 1)F + EOdy iW ( X -r 1)^]- 


Justification for Steps 1 and 2 involves mathematical arguments too advanced for 
an introductory text, but which are based on an analysis of how the expected 
cost function behaves as K and tt increase. The calculation test (M < ,8888Klh) 
in Step 3 establishes whether {EOQ^jAd) >1.5 periods. If {EOQ^jAI) < 1.5, then 
Step 4 calculates in (18) and (19) a Normal approximation to the optimal single- 
critical-number policy, as given by (7). The approximate s in (21) is the smaller 
of the trial s indicated by u, in (17) and by Vg in (19). Similarly, the approximate 
.S' in (22) is the smaller of the trial S implied by Us and the EOQ^, and by 

Several examples using the approximation are given in Fig. II.8. (The data 
employed permit you to compare the results with thos»- in Fig. 19.7, p. 821, which 
exhibited a Normal approximation method for the continuous review model.) 
To see how the algorithm works, consider data for Case 1 in Fig. II.8: 

(23) h = I 7r = 9iC = 32X = 0M = 9 F=9. 

According to Step 7, 


^06.= = 24. 


(24) 

Then in Step J?, 

(25) 

and so from Fig. 19.6, 

(26) 

Since 9 < .8888(32/1), ^ 

(27) E0Q,-S-s = 2i s = 9 - .y5^/~9 ^ 7 
As Fig. II.8 shows, the optimal policy is (7, 28). 


1*24 


u, = -.75. 


S = 3i;' 


•i.i D.m 
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Varied 

Parameter! 

iNormol Approximotion 

Optimal 

Case 

1 ^ 

S 

S-s 

s 

S 

S-s 


None 

{ 7 

31 

24 

7 

28 

21 


TT =99 

12 

36 

24 

13 

33 

20 


/<' = 64 

5 

39 

34 

6 

37 

31 

1 

7r = 99 . 
X' = 64 

II 

45 

34 

12 

42 

30 


/If = 16 

13 

45 

32 

14 

37 

23 


li 

45 

69 

24 

45 

68 

23 


l/ = 90 
(Neg. Bin.) 

• II 

35 

24 

10 

33 

23 


None 

103 

148 

45 

105 

148 

43 


II 

CD 

82 

127 

45 

82 

128 

46 


/<'=32 

105 

137 

32 

V 107 

137 

30 

2 

TT = 9 
/<'=32 

86 

118 

32 

85 

117 

32 


00 

II 

56 

88 

32 

58 

90 

32 


II 

o 

24 

69 

45 

25 

65 

40 


V=I6 

(Poisson) 

91 

136 

45 

92 

131 

39 


Nonvoried Parameter NAilues: 

Casel:/?=1 Tr = 9 /C=32 c=0 X=0 Poisson A^=9 V^= 9 
Cose2:/)=1 7r=99 /f=64 c=0 X = 4 efn^^iol V'=48 

Figure II;8. Approximately Optimal (s^S) Policies. 

•V 

Now suppose M = V = 36. Then 
(28) EO(l= 48 /?^ = .889 w, =-.75. 

Since 36 > .8888(32/1), calculate 

jR = .9 1.3, 

Ws = minimum (—.75, 1.3) = —.75. 
j = 36 - .75(6) 33 

5 = 36 + minimum [—.75(6) + 48, 1.3(6)] 44, 

which is the exactly optimal policy if the demand distribution is Poisson. 


(29) 
so that 

(30) 
Then 

(31) 
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►If there are a finite number of possible values for then the several techniques of 
successive approximation as well as the linear programming approach in Chap. 18 
can be used to solve the functional equations (i) and (iii) in the advanced material at 
the beginning of this section. Since Secs. 18.3 through 18.5 analyze an inventory model 
by these means, no further illustrations arc given here. Such solution methods are 
attractive when the purchase cost function, c{y and the expected holding and 
penalty cost function, L{y)y do not satisfy the assumptions of the model in Sec. II.4. 
When the theorem that the form of an optimal policy is (j, S) applies, you can reformulate 
the optimization model in such a way as to exploit the special structure when using suc¬ 
cessive approximations and linear programming techniques. Another optimizing 
technique is outlined below. A 


*11.7 MARKOV CHAIN ANALYSIS 


Chapter 18 introduced Markov chains as a means of analyzing probabilistic 
dynamic structures. Here you will see how that approach can be applied to char¬ 
acterize the behavior of an inventory system operating under a stationary (j, S) 
type of policy. We can postpone making any assumptions about the cost structure 
imtil we discuss a method for finding optimal policies. 

Recall that 

= level of inventory on hand plus on order in Period t 
after the ordering decision but before demand. 

Given a specific (j, S) policy, not necessarily optimal, the probability distribution 
of demand induces a probability distribution on^^. In particular, the transition 
law is 



[ S otherwise. 


where s ^ 5, according to the (j, S) policy. 

Assuming the demand distribution is stationary, the implied matrix of one- 
period transition probabilities is shown in Fig. II.9. For expository convenience, 
the state y = S has been designated as the last row and column. Observe two 
important features of tlw matrix. First, the probability entries do not depend on 
s and S separately, but only on their difference, 

( 2 ) D = S^s. 

Second, the probability entries do not depend on X. 

Let 

(3) r(y) = stationary probability of y. 

Then the values rO), for j 1,..., S', are uniquely characterized by the 
D + \ linear equations, 

r{S - z) = S r(5’ - q)piz - j) + t{S)P{z) for 1 ^ z ^ D 

Qml 


( 4 ) 
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(5) tr(S-l) = L 

ZmQ 

Give a verbal inteipretation to the equations in (4). (The stationary equation for 
t{S) has been omitted, since it is linearly dependent on the set of equations (4).] 
The system (4) and (5) is easily solved, as you would guess after noticing the 
tri 2 mgular structure in the first (S — 1) rows and colunms of Fig. II.9. The 
approach is first to calculate recursively the quantifies 


( 6 ) 


fB(0) 


miz) 


PiO) 


1 -p(0) 

P{^) +^Pi^ 


q)m{q) 


1 -p(0) 


for z = 1,2,..., 


and for any z let 

(7) M(z) = m(P) + ib(1) + • • • + m(z). 

Then 


( 8 ) 


r{S-z) = 


r{S) = 


l + M{D) 

1 + »»( 0 ) 

1 +M(D) 


for 1 ^ z ^ i> 
for z = 0. 


As a check, substitute the quantifies in (8) into fire equations (4) for z = 1,2, 
and 3. 

It can be shown that 


average number of periods 
(9) between successive 
. rq>lenishments 


probability of a ^ 


= 1 + M{D) 
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To see how the algorithm works, consider the example 


(10) p{,q) = y for 9 = 0, 1, 2. 
Then, for z = 0, 1, 2, and 3, 

”(») = ri4-T 

(11) ;;j(2) = = i- 


0 "t" Q(t) "f" ■?(t) 


m(3) : 

Suppose Z) = 2, so that 

( 12 ) 

Then 

(13) r(5_l)=i = |. r(5-2)=4 = | 

Consider, instead, D = 3 and verify that 


1 —. 


M(2) = JLj-A_lA = I2 
^ ^ 2 ^ 4 ^ 8 8 


15 

16* 


1 + M{2) = 


3 4 

8 '' 


(14) 


1 + A/(3) = I? 


r{S ^ 1) = 


23 


K5-2)=^ 


r(5-3) = 


23 


?'(‘y) — 23' 


► It is also possible to develop the recursion (6) and the stationary probabilities (8) by 
recourse to a renewal type of argument. This approach is insightful as it will provide you 
with meaningful interpretations of the quantities m{z) and M{z) in (6) and (7) as well 
as a deeper understanding of the probabilistic behavior of an (j, S) type inventory rule. 

Suppose in the current period a replenishment order is placed so that^ = S, Then 
the next replenishment will not occur until cumulative demand exceeds D, The expected 
number of periods for this event can be found as follows. 

Let 


(0 0.1 = cumulative demand for t consecutive periods, 

that is, (it = -r ^2 + *' * + ^r^d denote 

(ii) M{z) = expected number of periods that cumulative demand 

remains less than or equal to z. 


To compute M(z), define the indicator function 

A(^)=(; 

(0 otherwise, 


so that 

(iii) M{z) = /,(z)] = S £[f<(z)] = I, 

where P^{z) is the cumulative distribution function of the /-fold convolution of^(^). 
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Let 

m(z) = expected number of periods that cumulative demand equals z, 
and, consequently, 

(iv) A/(z) = m(0) + ^(1) + • • • + ^( 2 ) = Af(z — I) -f 7n(z). 

Then, from (iv), 

(v) m(z) = ±nz) - S (2 :r 1) = S 
and ;7;(z) satisfies the so-called renewal equation 

(vi) m(z) = p{z) + J!, p(z — q)m{q)y 

because V* 

m(z) = f; /i'(z) = /i(z) + S p‘(z) 

(vii) = p(z) + s "EiP^i- g)P‘~'(.<l) 

t=2 q=0 

= />(z) + t /-(^ - ?) S />'(?) = />(^) + - ?)”»(?)• 

g-0 i*l 

As you can verify, starting with z =^0, the renewal equation (vi) can be solved recur¬ 
sively according to the formulas in (6). 

Since the next replenishment occurs in the period just after cumulative demand 
exceeds D, 

expected number of periods between — j _i. M{D) 
successive replenishments ' 

Therefore the fraction of time that the system remains in the states S — 1,5 — 2, 

S — z during this interval is simply 

(») -TT^ 

so that 

(x) r(5 - z) = Y^ ' M JP) fo' 1 ^ ^ 

Since 

(xi) t{S) + hriS— z)= 1, 
it follows that 


(xii) 


r{S) = 


1 + w(0) 

1 + M[P)' 


< 


Policy operating characteristics. Given the stationary distribution 
r(j), the probability distributions of other system characteristics are easily com¬ 
puted. For example, define 
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jf(J) = stationary probability that the level of inventory 
on hand plus on order at the start of Period t 
prior to the ordering decision equals j. 


Then 


(15) 


1 + M{D) 

Pi^) + 

1 + M{D) 


As another example, define 


for 0 z < /> 


for z> D. 


a;(t) = long-run fraction of periods that the level of inventory 
on hand at the start of the period equals i. 


Then 


(16) w{i) 


- i) + - i - z)miz) 

'' r+ M{D) 


for I = 5,— 1,5 — 2,. 


Note that w(i) does depend on X. 

Because all demands are eventually filled, 

expected amount expected purchase 

(17) ordered in any = ^[^] quantity a = J^[^]*[l + Ar(jD)], 
period replenishment occurs 


* Optimization. Given the cost assumptions of the model in Sec. II.6, 
expressed in (1) and (2), the objective function is simply 

which represents the expected cost per period. The average purchase cost per 
period is a constant cE[q]^ and therefore has been omitted from (18). 

For the sake of completeness, we outline an algorithm for optimizing (18) by 
a search on D and S. A computationally attractive alternative is to solve an 
optimization model like those treated in Chap. 18, where the formulation exploits 
the (j, S) form of an optimal policy. Either successive approximations or linear 
programming can be applied as a solution technique to the appropriate formula¬ 
tion. To obtain an optimal (j, S) policy, you must minimize (18) with respect 
to D and S, The general algorithm is: 


Step 1. Calculate lower and upper bounds on both s and 5, and thereby 
determine lower and upper bounds for an optimal D. 

Step 2. For each value of D between the upper and lower bounds found in 
Step ly compute the stationary distribution r(v), and find a corresponding 
optimal S by minimizing L{S — 2 )* 
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Step 3. For each of the pairs of D and S obtained in Step 2^ calculate the 
associated value of (18) and select a pair that gives the minim um for (18), 

Further detail on the algorithm is given in the advanced material below. But 
we point out here that if L{y) has the simple linear form (4) in Sec. II.6, then^ in 
Step 2j an optimal S will be the smallest value such that 

where w{i) is given in (16). Thus the system incurs no backlog for at least tcKtc + A) 
of the periods, or equivalently, incurs a backlog for no more than A/(;r + A) of 
the periods. 


^The reason that all the possible values for D need to be tested in Steps 2 and 3 is that 
as D gets progressively larger, the associated values of the objective function may 
exhibit local but not global minima. 

The formulas developed above are easily generalized to encompass the discounted- 
stationary-cost case where ct < I. Specifically, the only changes required are to compute 


(i) 


m(0) = 


cepiO) 

1 - o'^(O) 


J7l(z) = 


1 - a/(0) 


for z = 0 


for 1 ^ Z A 


instead of (6); and 

where 


(iii) G(Ji = {\-a)<y + L{y), 

instead of (18). Note that when « = 1, (i) and (6) are identical, as are (ii) and (18), 
The analogue of (19) is 


(iv) 


f-0 


zr — (1 — d)c 

P + h ^ 


where «;(i) are computed in (16) using the values from (i). 

Let (i, S) and (J, S) be the lower and upper bounds in Step i, so that a lower bound 
for D is {S — 5) and an upper bound is (5 — i). The bounds are determined by the 
following: 

(v) is the smallest integer that minimizes G(y) 

(vi) is the smallest integer not less than S for which 

G(S+l):^G{S) + otK 

(vii) i is the smallest integer for which G(i) ^ G{S) + ^ 

(viii) ; is the smallest integer for which G{5) ^ G{S) + (1 — ct)K, 

Since Silo L(*5 — — z) is a convex function ofiS*, the minimization operation in 

Step 2 can be performed rapidly. A local minimum for this sum is also a global minimum^ 
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so that given D, you can let S be the smallest integer for which 

(ix) ^L{S + I- z)r{S + 1 - 2).^ S US - z)r(S - z). 

When <X < 1, in certain cases you must record in Step 2 any additional values for .S' 
that minimize the summation and then include these values in Step 3, This additional 
work ensures that you obtain an (j, S) policy tliat is optimal for any initial level of 
inventory on hand plus on order. < 


REVIEW EXERCISES 

1 Consider an inventory model like that described in Sec. II.2. Assume X = 2, the level 
of inventory at the start of Period 1 equals 10, and no replenishment orders are out¬ 
standing. Suppose the demands for the next 10 periods are 

^1 = 7 ^2=7 ^3 = 5 ^4 = 3 ^5=2 

^0 = 6 ^7 = 3 ^73=^ eo = 9 ^10 = 7. 

In each part below, find the values for kt^ and^^, for / = 1, 2, . .., 10, using 

the policy (r, S) = (9, 17). 

(a) Assume the backlog case. 

(b) Assume the lost demand case. 

(c) Answer parts (a) and (b) for X = 1 and X = 3. 

2 (a) Explain in detail the justification for (2) in Sec. II.2. 

(b) Explain why />(.(+x-i((2,) ^Iso represents the probability that ki^x has the value 

{yt - a). 

(c) Explain in detail the justification for (16) in Sec. II.2, 

*3 (a) Derive the convolution of Poisson distributions as given by (6) in Sec. 11.2. 
{Hint: You may find it helpful to use an inductive proof.) 

(b) Derive the convolution of negative binomial distributions as given by (10), 

4 In each part below, use (4) and (8) in Sec. 11.2 to find the indicated convolutions. 

(a) Assume pi{q) = J, for ^ = 0, 1, 2; p^iq) = J, for ^ = 0, 1, 2, 3; p^{q) = J, for 

^ = 2, 4 . Calculate/>,. 2 (a),/>i. 3 (a)> and/>2,3(a)- 

(b) Assume/>t(0) = J,/>f(l) = J,/>r(2) = ^,foreve’ry/. Calculate/>^((i) for X = 2,3,4. 

5 (a) Describe, verbally, the calculation in (4) of Sec. II.3. 

(b) Explain how (7) follows from (6) when //{(A*, q) has the form in (3). 

(c) Rewrite (7) for the case that all the pi{q) are identical. 

(d) Explain how depends on^^, given the backlog assumption. 

(e) Explain why the term multiplies both ci{yi — jt) and Li{yi) in (8).(///n/; 

review the definitions and economic assumptions.) Also explain why the text 
states that (8) ignores the holding and penalty costs in Periods 1, 2,..., X — 1. 

(f) Interpret fi{j) in (9), verbally, and state why the recursion is an appropriate 
representation of the optimization problem. Explain why it is legitimate to let j 
(the level of inventory on hand plus on order at the start of the period prior to 
the ordering decision) be the state variable. 
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*6 Apply (9) in Sec. II.3 to the data in (10) and check the results in Fig. II.2, for 

(a) K = 4.5 and ;r = 12. 

(b) = 1 and = 5. 

*7 (a) Consider the model that maximizes profit, as described in the special material at 
the end of Sec. II.3. Justify (i) as the appropriate modification. 

(b) Consider the model in which unfilled demand is lost, as described in the special 
material at the end of Sec. II.3. Justify (ii) and (iii) as the appropriate recursions 
for X = 0 and X > 1, respectively. Wnte the appropriate recursion for \ = 1, 
and compare it to (ii). 

8 (a) Explain why (4) in Sec. II.4 yields an optimal St, 

(b) Justify the expression for ft{j) in (5), given that the form of an optimal policy is 
{s,S), 

(c) Explain why (6) and (7) determine an optimal Si, 

9 Consider the example (13) and (14) in Sec. II.4. 

(a) Verify the calculations in (15), (18), (19), (21), and (23). \\"hat assumption is 
being made in (11) and (12) about unfilled demand at the end of the horizon? 

(b) Perform the algorithm to find'an optimal policy and F*„(J) for n = 3, using the 
data for « = 2 in Fig. 11.3. (Check your results with those in Fig. II.3.) 

(c) Answer part (b) for « = 7, using the data for n = 6. 

(d) Assume that the holding cost A = 4, instead of /i = 2. Apply the algorithm for 
« = 1, 2, 3, 4, and check that your answer agrees with the policies given in (26), 

10 Consider the example (13) and (25) in Sec. II.4. Verify the policy shown in Fig. II.4, 

for 

(a) X = 0, q: = 1, and n = 1, 2, 3. 

(b) \ = 1, ci; = I, and « = 2, 3, 4. 

(c) Give a plausible explanation of why Sn oscillates the way it docs in Fig. 11.4 for 
the case X = 0 and o' = I. 

11 (a) Give an economic interpretation of the assumption' (2) in Sec. II.5 relating to the 

cost of excess inventory or unfilled demand at the end of the horizon. 

^ (b) Derive the formula for given by (7) when the holding and penalty cost functions 
are linear, according to (6). 

(c) Consider the formula for yt given by (7). Describe how the value of changes as 
7tt+\ increases. As-ht^x increases. Explain how the purchase costs Ci and affect 
the value of Jf. 

(d) Consider the formula for y given by (8). What effect docs the purchase cost c 
have on y when ct = 1 ? Is the resultant policy ‘‘reasonable” for the last period 
in a finite horizon problem? Explain and indicate how the assumption in (2) 
gives rise to this result. 

(e) Explain how you think y given by (8) is likely to vary when the delivery lag X 
increases. [Suggestion: assume that p(Q,) can be well-enough approximated by a 
Normal distribution with mean 10 and variance 4. Suppose that the fraction on 
the right of (8) is .9. Use the tabled values in Fig. 19,6 to find / for X = 0, 1, 2.] 

(f) Suppose the penalty costs a'^+i in (7) are rising over time relative to ht^\ Show that 
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aill ^ yt-¥u verbally explain why. State how much you order in each period, 

assuming that initial inventory on hand plus on order at the start of the horizon 
is 0. 

(g) Verify that when A, = 0 and CC = 0, the value of is the same as that obtained 
in the one-period (static) model of Sec. 19.4. Be sure you compare the formulas 
for the linear holding and penalty cost case. 

♦12 Consider the lost-sales case model (where \ = 0) in the special material at the end of 
Sec. II.5. Examine the formula determining in (i) and (7) and state whether yt ^ 
larger in one case than in the other. 

♦13 Consider the proof of the singlc-critical-number algorithm in the special material at the 
end of Sec. II.5. Justify each step in the proof in detail. 

14 In each part below, apply the Normal approximation algorithm described in Sec. II.6 
to find a nearly optimal (j, S) inventory policy. 

(a) Case 2 in Fig. II.8 (show the detailed calculations). 

(b) The case E[q\ = 64 and tt = 4 in Fig. II.5. Do the same for n = 99. 

(c) The case E[q\ = 21 in Fig. II.6. Do the same for E[q\ = 23. For E[q\ = 59. For 
E[q\ = 64. 

(d) Verify that the test M < ,SSSQK/h in Step 3 establishes whether EOQ^IM >1.5 
periods. 


♦15 Consider the Markov analysis of an (j, S) policy, as given in Sec. II.7. 

(a) Justify the validity of the entries in Fig. II.9. Explain why the number of rows 
and columns and an entry in the ith row and Jth column are the same for the 
policies (3, 8) and (9, 14). 

(b) Justify the statement that the values of the stationary probabilities r(y) are 
determined by (4) and (5). Interpret the equations in (4) verbally. Write the 
equation for r{S) and show that it is linearly dependent on the set of equations (4). 

(c) Substitute the quantities in (8) into the equations (4) for z = 1,2, and 3, and check 
that the formulas for r(iS — z) do, in fact, satisfy the stationary equations. 

(d) Consider the example (10). Verify the results in (14) for D = 3. Find the sta¬ 
tionary probabilities for D = 4, 

(e) Find the stationary probabilities r(y), for ^ = j, j + 1,..., iS", for the policy 
(2, 6), where p{q) = J, for ^ = 0, 1, 2, 3. 

(f) Justify the validity of the formula for z;(t) given by (15). 

(g) Justify the validity of the formula for w(}) given by (16). 

(h) Justify the validity of the expected cost formula given by (18). ^ 


16 Explain your understanding of the following terms: 


immediate delivery 
delivery lag 

continuous review model 
discrete (periodic) review model 
backlog case 
lost demand case 


convolution 
♦inductive proof 
♦J^-convexity 

optimal single critical number 
♦indicator function 
♦renewal equation. 
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FORMULATION AND COMPUTATIONAL EXERCISES 

17 Consider the formulas for the expected holding and penalty cost function Zf(v) in 
(7) in Sec. II.3. Assume that the economic functions and probability distributions are 
stationary. 

(a) Rewrite (7) to exhibit the simplifications resulting finom assuming stationarity. 

(b) Let L{y | \) denote your answer in part (a), where the symbol \ is introduced in 
the notation to stress the dependence on the delivery lag. Show that you can 
calculate X) recursively as follows: 

L(y\X) = f LCf - ^|X - l)p(q) for X ^ 1. 

QmO 

Also, interpret the recursion verbally. 

18 Suppose the inventory holding and shortage penalty cost function ^t(Ar, q), as defined 
by (2) in Sec. II.3, is calculated as h times the average level of inventory on hand 
during Period t weighted by the fraction of the period that inventory is on hand, plus 
7C times the average level of shortage during Period t weighted by the fraction of the 
period that inventory is “negative.” For simplicity, assume that the. fraction 
of the period that inventory is on hand is the smaller of 1 and the ratio of 

inventory on hand at the start of period 

total demand 

If inventory on hand at the start of the period is “negative,” assume the ratio equals 
0. Write the appropriate formula for q). 

19 Consider the inventory model characterized by the dynamic progra mmin g recursion 
(9) in Sec. II.3. Assume the horizon 7^= 2; the demand distribution in each period 
is ^(2) = .2 and ^(3) = .8; the ordering cost is Ki f- x for x > 0; the delivery lag 
X = 0; the inventory holding and shortage penalty cost is linear with h =-6;7' and 
7C = 26; and a = .9. (Assume there is no salvage value for inventory at the end of 
Period 2 and unfilled demand at that time remains unfilled forever.) Suppose iTx; = 24 
and jlCj = 5, so that the proposition in Sec. II.4 on the optimality of an (r, iS^ policy 
does not apply. (Why?) Use (9) to calculate an optimal policy. 

'*'20 Consider the data in Fig. II.4, and assume ct .8. In each part below, use (ii) and'the 
special material at the end of Sec. II.3 to find an optimal policy for n = 1,.2;.3; 

(a) Assume X = 0. 

(b) Assume X z= 1. 

21 Uncertain Horizon. Consider the inventory model yielding the recursion (9) in: Sec. 
II.3. Suppose the horizon length for the inventory system is uncertain, because a.much 
better substitute product may possibly be developed and marketed.. Spcdfifcally, let 
fit he the conditional probability that the item wi/l be needed to satisfy demand in 
Period t + ^ + 1> given that it u needed to meet demand in Period t + ^ Show the 
appropriate modification of (9).. Explain verbally of how' to imprement an. optimal 
policy found from this recunion. ^ 
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22 Perishable Commodity, Consider the following modification of the inventory model in 
Sec. II.3, Assume that any item delivered in Period t perishes (cannot be used to 
meet demand) beyond Period t + 2. Also assume that the delivery lag \ = 0. Derive 
an appropriate dynamic programming recursion. Be sure to define all the symbols 
you employ. 

♦23 Consider a so-called (/, T) inventory policy, which is defined as: order the quantity 
T if the level of inventory on hand is smaller than t, and do not order otherwise. 
Assume delivery lag X = 0. Suppose the probability distribution of demand is 

/.(0) = ,2 Ml) = .4 M2) = .3 M3) = .1, 
and that unfilled demand is completely lost. 

(a) Find the stationary probabilities for the states of the system when the policy is 
(1,2). Is (2,1). 

(b) Assume that the ordering cost is 3 2T\ the holding cost, assessed on inventory at 

the end of the period, is 1 per unit; and the penalty cost for each unit of unfilled 
demand is 4. Based oh your answers in part (a) is (1, 2) or (2, 1) the better policy? 

♦24 (A:, (D Policy (Exercise 40 in Chap. 19). Suppose you utilize a {ky Q) policy over an 
unbounded horizon, and assume unfilled demand is completely backlogged. 

(a) Consider (A:, Q,) = (8, 5). Construct a Markov transition matrix for this policy, 
analogous to Fig. II.9. Label the rows and columns with their exact values. 

(b) Determine the stationary probabilities r(y) for part (a). {Hint: check the value of 
the column sums.) 

♦25 In each part below, find an optimal (j, S) policy, using the algorithm in Sec. II.7 
and the calculations suggested in the special material at the end of that section. 
Consider the data in Fig. II.4, where 

(a) X = 0 and oi — \, 

(b) X = 0 and cc = .8. 

(c) X = 1 and cc = I, 

*26 Use the algorithmic techniques at the end of Sec. II.7 to find an optimal (j, S) policy 
for the data below. 

(a) Let p{q) = ^, for ^ = 1, 2,..., 10; o' = 1; X = 0; h = 5; tt = 10; r = .1; 
and/l:'=5. 

(b) Rework part (a), letting jFl = 10. 

(c) Rework part (a), letting a = .5. 

(d) Rework part (a), letting p(q) = j, for q = 1, 2,..., 5, and X = 1. 


MIND-EXPANDING EXERCISES 

27 Uncertain Delivery Lag. Consider the inventory model resulting in the dynamic program¬ 
ming recursion (9) in Sec. II.3. In that model, the delivery lag X is assumed known. 
Suppose, instead, that X is a random variable. Assume that there is never a "crossover” 
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in deliveries, that is, an order placed in Period, t is never delivered before an order 
placed in an earlier period (it may be delivered in (he same period). Also assume fj, 

for I = 0,1,_ N/\s the probability that all the orders that were placed I or more 

periods ago and have not yet been delivered arrive in Period t; every order arrives 
within iV periods. (Note that is assumed to be independent of the order sizes and the 
number of orders not yet delivered.) Finally, assume that the ordering cost Ct{x) is 
incurred in Period t, when the order is placed. Show the appropriate modiheation of 
(9). [Hint:.\Qt the state of the system be {i, atj, ..., xy), where i is the level of inventory 
on hand at the start of Period t, and Xi is the amount ordered / periods ago and not 
yet delivered. Note that ifx{ = 0, then so do Xn^j ,.. ., xy. Also define Z<(^) as the 
expected holding and penalty cost in Period t, given that inventox^' on hand after 
delivery and before demand equals A,] 

28 A military supply depot faces the following inventory problem. The demand for the 
item in Period t is pt(g)j where ^ = 0, 1,2,.... The depot may purchase x new items 
from a vendor, and they arc delivered immediately so that the delivery lag X = 0. 
At the start of each period, the depot has in slock i “serviceable” items and A “repar¬ 
able” items, where the former are items in good working order and the latter are items 
that can, at a cost, be put into good working order. For simplicity, assume that the 
repair action is also immediate. The depot’s decision involves whether to repair items, 
to purchase new items from the vendor, or to scrap some of the available serviceablcs 
and reparablcs. Assume that the cost parameters are such that it is always optimal to 
scrap reparables before scrapping serviceables, to scrap reparablcs instead of storing 
them, and to repair before purchasing new items. The planning horizon is T periods'. 

Let jf be the number of serviceable items q/ler the decision to repair, purchase, or 
scrap. If demand g exceeds jf, then the depot places an emergency order and pays a 

• penalty price for doing so. Let Lt(j') represent the expected holding and penalty cost 
in Period t, given that the depot’s decision in Period t leads to the level Also assume 
that the demand q in Period t yields q reparables at the beginning of Period t -f- 1. 
Let the purchase cost function be Ci{x), the repair cost be bt per unit, and the scrap 
values be i;, and Vr per serviceable and reparable item, respectiv»dy. Assume the dis¬ 
count factor O' = 1. Formulate a dynamic programming recursion for finding an 
optimal policy. [Hint: let the state be (i, k) and distinguish the possibilities> i -f Ar, 
i <y "^i + kj and^ i, as well as q <y and q >y.] 

*29 Assume the inventory policy is (s, S) = (3,6) and X = 3. Suppose all unfilled demand 
is lost. Explain why there is never more than one replenishment order outstanding. 
Construct a Markov transition matrix analogous to Fig. II.9. (Suggestion: define the 
state of the system by three numbers, namely, the amount of inventory on hand 
at the start of the period, the size of any order outstanding, and the number of periods 
between the present date and an outstanding order’s due date.) 

*30 Determine the probability distributions r(y) and o(j), as described in Sec. II.7, for a 
geometric demand distribution^(^) = ^(1 — p)^,v/hereq = 0, 1,..., andO < ^ < 1. 

*31 Separable Markovian Decision Formulation, Consider an inventory model satisfying the 
Optimality of a Stationary (j, S) Policy proposition in Sec. II.6. Assume the discount 
factor satisfies 0 ^ •< 1. The appropriate dynamic programming recursion is given 
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by (i) in the special material. Suppose you have lower and upper bounds on optimal s 
and S, namely s, 5, S, S; see (v) through (viii) in the special material ending Sec. II.7 
for computational techniques of computing these bounds. Also suppose that there is 
only one optimal policy. 

(a) Show how to adapt the idea in exercise 51, part (b), of Chap. 18 to characterize 

the extremal equations. {Hint: let the special state correspond to the j = ^ — J.. 
so that a replenishment order must be placed, and the system is moved to one of 
the states J , 5, at an appropriate cost. Notice that no replenishment 

order is placed for j = J,..., 5. The decisions at j = i,..., f — 1, arc cither 
not to order, or to order up to the level that was used at state J = ^ — 1.) 

(b) Write the primal linear programming model, analogous to that in (1) and (2) 
of Sec. 18.5, for the extremal equations in part (a). Determine the number of 
constraints and variables. 

(c) Determine whether the simplifications in part (d) of exercise 51 in Chap. 18 
can be applied here. If so, state the resultant number of equations and variables 
in the linear programming model of part (b) above. 


/ 
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appendix 


Advanced Techniques for 
Veiling Line Models 


IILl PRELIMINARIES 


This appendix provides additional results for rather simple queuing models^ 
and, more importantly, introduces you to several useful mathematical techniques 
for treating such models. We begin by presenting two devices that are frequently 
employed in waiting line analysis. The first one is a generating function, and the 
second is a Laplace-Stieltjes transform. 

Consider a random variable taking on the values 7 = 0, 1,2,..., with the 

associated probabilities Po, A, A, _Define the generating function 

for this probability distribution as 

(1) P*{s) = Po + + PP, + • • • = 

JmQ 

where the variable j in (1) satisfies —1 ^ 1, but has no particular interpreta¬ 

tive significance. Observe that 

(2) P*(l) = 1 P*(0) = Po 

(3) ^ = 

as JmO 

so that 


( 4 ) ^=±jp,=E[n {oTs=i: 

Os j»o 

Verify that 

(5) ^ = forj = 0.- 


tXhis appendix requires a knowledge of elementary differential and integral calculus. 
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Consequently, by knowing P*(s), you can compute the expectation of the random 
variable according to (4), as well as the individual terms of the distribution 
according to (5). 

As you can check by elementary algebraic manipulation, if 
(6) = (1 — p)p^ (geometric distribution; 0 < p < 1) 

then 

<’) 

if 

(8) Pj = —rp- (Poisson distribution), 

J* 

then 

(9) P*(s) = 
if 

(10) Pj = (5)^(1 — ^)””'^ (binomial distribution; 0 < )& < 1) 
then 

( 11 ) = 

and if 

(12) Pj = (^^T"')i^(l — (negative binomial distribution; 0 < p < 1) 

then 

(13) = 

When m = 1, (12) is simply the geometric distribution with p = I — p. Check 
that when you take the nth derivative of P*(s) in (7), (9), (11), and (13) and let 
J = 0, you obtain n!P„ as stated in (5). 

Next consider a nonnegative random variable x and let 

(14) G(0 = P[x<t] 

be its distribution function. Define the Laplace-Stieltjes transform of 
G(t) as 

(15) G*(s)=n-»dG(t), 

J 0 

where j* is a real number contained in a bounded interval around 0, say, 
—J <Z s a S; here again s has no particular interpretative significance. Sometimes 
this transform is called a moment generating function, since 

(16) (-1)"-^ =£[<"] fori = 0. 

To illustrate (15), suppose 

G(t)=J dw (exponential distribution); 
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then • 

( 18 ) . C*W 

If X and y are independent random variables, the Laplace-Stiel^cs transform 
for the distribution function of their sum is equal to the product of the transforms 
for each of the distribution functions. Consequently, if 

(19) G{t) = r (gamma distribution), 

j 0 — ]}! 

which is the distribution of a sum of M identically and independently exponentially 
distributed variables, then 

( 20 ) 

Or if you replace ^ by Mfi in (19), and thereby use the form for an Erlang 
distribution, 

(21) 0{t) = p (Erlang distribution), 

Jo [M — 1 )! 

which has 

(22) and Var[(]=jJjp' 

then 

+)»)"■ 

A constant service time distribution can be viewed as the limiting case of Af — > oo 
in (21); then G*(s) = 

The next two sections analyze a single-server queuing system in which either 
the interarrival or the service time distribution is not exponcnti''Jly distributed, 
(The case of both distributions being general is treated in the final section of this 
appendix.) The generating function and Laplace-Stieltjes transform provide the 
principal tools for deriving many of the important stationary operating character¬ 
istics. To specify the formulas for the steady-state probabilities, the queuing 
system’s behavior will be described by a Markov chain. Hence you may find it 
helpful to reread pp. 752 through 756, which give several illustrative examples of 
Markov chains and the calculation of stationary probabilities. 

III.2 SINGLE-SERVER MODEL WITH POISSON 

INPUT AND GENERAL SERVICE DISTRIBUTION 

In Sec. 20.5 you studied a system composed of a single server with interarrivals 
exponentially distributed and service times generally distributed (designated as 
M/G/1); formulas (29) through (32) gave the values for several long-run averages. 
You should review that material, including the proof at the end of the section. 

Recall that this proof considered only those instants in time when a service had 
just been completed and the associated customer had departed. These are termed 
regeneration points, because at such instants the only information needed 
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to predict the system’s future is the number of customers then present; the detailed 
history of the system prior to these instants is not required. Consequently, the 
probabilistic behavior of the system at the regeneration points can be described 
by means of a so-called imbeddedl Markov chain. The adjective “im¬ 
bedded” refers to the fact that the system is examined only at the regeneration 
points. The corresponding steady-state probabilities refer to such instants. An 
important fact is that it can be proved that for the Af/G/1 system, these proba¬ 
bilities at regeneration points also represent the proportion of all time over an 
unbounded horizon that the system is in each state. 

Continuing with the notation introduced in the advanced material ending 
Sec. 20.5, let n be the number of persons in the system immediately after a depar¬ 
ture, say, at Time T, and n' the number of persons in the system immediately 
after the next departure, say, at Time T'. Let j be the number of arrivals during 
the next interval of service. Then, as was established in Sec. 20.5, 


(1) 



ifn = 0 
if n > 0. 


If the length of the next interval of service is A, 

(2) P [j arrivals interval K\ = —> 


given that the arrival process is Poisson with rate X,. (Note how the memoryless 
property of the exponential distribution is being used.) Therefore the uncondi¬ 
tional probability is 

(3) P [j arrivals] = J" dG{h) = k„ 

where G{K) is the distribution function for service time. As shown below, for 
certain G(A), such as a gamma distribution, the value of kj is easily derived by 
mathematical analysis. In general however, you can always calculate k) by using 
standard methods of numerical integration. 

The states of the imbedded Markov chain are characterized by the number 
of customers in the system at a regeneration point. Hence, given n, the proba¬ 
bility of n' in (1) is provided by the probability of j arrivals, according to (3), 
and the resultant Markov chain in matrix form is 


( 4 ) 


n customers at 
Time T: 


n' customers at Time T': 


0 


0 

1 

2 

3 


'h- 

ko 

0 

0 


1 

k^ 

ki 

ko 

0 


2 3 

k2 A 3 
A2 A3 
Ai A2 
Aq Ai 


(T and T' are 

regeneration 

points). 
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Steady-state analysis. Let 

Pr = Stationary probability of r customers in the system 
at a regeneration point. 

To ensure the existence of a steady-state distribution, assume that 

(5) p = XE [service time] < 1. 

Then, given (4), the P,. must satisfy the steady-state equations 

Pq = PqHq + P ik^ 

Pi = P^ki + Pi^i + P 2^0 

( 6 ) 


P T P^kr “1“ ^l^r “f" ^2^r-l ”1" 


r+l 

+ -Pr+l^o = + 2 


Rearranging terms in (6) yields the recursive formulas 
p _ Pod - ko) 

p iPi(l-ki)^Poki 

- k. 


(7) 


Pr{l-ki) 

Pr.l= - 


2 Pjkr+l-j 


ko 


-Pokr 


Notice that to actually compute the Pr using (7), you need to have the value of 
Pq, the probability that the server is idle. It will be shown below that 

(8) Pq = 1 — p, where p is given by (5). 

To prove (8) as well as to characterize the stationary probability distribution 
Pr, we derive the generating function for Pr. Take the expression for Pr in (6) and 
multiply both sides by j’*, giving 


(9) 


SrPr = S^Pokr + 2 S^Pjkr+l^J- 
J-i 


Define the generating function K*{s) = Then sum the terms in (9) for 

r = 0, 1, 2, . .,, to yield 


^-0 


P*{s) = PoK*{s) + 2 2 : srp^kr^i.j 

r-o ^-1 

= PoK*{s) + t 2 S^Pjkr^i^j 

>1 r=j-l 


= PoK*{s)+ts^-^Pjts% 

j-i <-o 


( 10 ) 
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so that 


(11) P*{s) 


PoK*(s)(l -s) 

K*{s)-s 


(generating function for the probability distribution 
of the number of customers in system). 


Hence, P*{s) has been expressed in terms of the generating function for kj. 

The function K* (s) can, in turn, be expressed by means of the Laplace-Stieltjes 
transform of G(^) as follows: 


( 12 ) 


K*{s) = ± s% = S J" Q^^dG{h) 

J^o ^=0 Jo Jl 

= r </c(A) 

J 0 


Notice that given (12), 


(13) 


rfji:*(5) . ^ dG*(z) 

ds dz 




and therefore at j = 1, (13) yields the result 


(«) rS]- = p, 

since = —(!)£[/] according to (16) in the preceding section. 

To derive Pqj 'we take the limit of P* (^) in (11) as j > 1. This gives, on applying 
THospital’s rule, 

l=limP*(.)=limMMO^ 

s-i A*(s)—s 

(15) _ Po 

“l rf^)’ 

ds 

Using (14), you can simplify (15) to 

(16) 1=T^’ 

1 — p 

and consequently the probability that the server is idle is (po = 1 — p)> in accor¬ 
dance with (5). 

By a similar approach, you can derive the Laplace-Stieltjes transform for the 
time spent in the system and the time spent in line. Assume the queue discipline 
is first come, first served. Consider a regeneration point. Let U{t) be the distribu¬ 
tion function for the time the departing customer has spent in the system. Then 
the number of customers he leaves behind is the number who arrived during 
the time he remained in the system, that is, after he entered. Therefore an alter¬ 
native formula to (6) for is 
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0 / • 


(17) 
so that 

(18) P*{s) = i: f*- r dU{k) ■= U*[Xil - j)]. 

r=0 J 0 r/ 

Letting [i; = X(1 — j)] and using (5) and (12), check that you can rewrite (11) as 


(19) U*(v) = ■ (1 - p)vG*iv) 

^ V-X + XG*{v) 


(transform for the distribution of 
time a customer spends in system). 


[This is sometimes called the Pollaczek-Khintchine equation, and it 
leads to formulas (32) in Sec. 20.5.] 

Observe that taking the derivative with respect to s on both sides of (18) and 
setting s = 1 yield 

(20) E [number of customers in system] = \E [time in system], 
as was stated in (33) of Sec. 20.5. 

The time a customer spends in the system is the sum of the time he spends in 
line and the time he spends in service. Letting JV(t) represent the distribution 
function of the time he spends waiting in line, you can write 

(21) U*(v) = JV*(v)-G*(v), 

Therefore from (19) you have 

(1 — p)v (transform for the distribution of 

time a customer spends in line). 


( 22 ) 


W*(V) = ; 


V-X + XG*(v) 

Examples. The computational usefulness of the generating function (11) 
and the Laplace-Stieltjes transforms (19) and (22) depends on whether they 
represent distributions that are easily calculated. 

To illustrate, suppose the service time distribution is 

(23) (gamma distribution; 

J 0 (AI — 1) ! mean service time = M//i) 

SO that, according to (20) in the preceding section. 

Then, from ,(12), 

(25) 

Letting 

(26) 

you can write (25) as 
(27) 


P = 


/*.+ V 


A'*(s) = 
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so that from (13) in Sec. III.l, you can conclude 


(28) kj = ■“ py^ (negative binomial distribution) 

with 

(29) = ^ = p Va,D•)_Jl^[(Ay + -^]. 

Then from (11), after some manipulation, 



When M = 1 in (23), giving the exponential distribution, then (30) reduces 
after simplification to 

(31) P*{s) = Y - — where p = —> 

\ — ps p 

so that 


(32) = (1 — p)p^ (geometric distribution), 

in accordance with (12) of Sec. 20.5 Further, when M = 1, you have from (19) 
that 


(33) 


U*(v) = ^ m(1 - P) ^ 

\/4 + Vj 


which is the transform of an exponential distribution with mean — p). 
Consequently, the waiting time density is 

(34) /(t(l—p)g"^^'"^^^ for^^O, 

in agreement with (19) of Sec. 20.5. 


► If you want to express the service time distribution as an Erlang distribution according 
to (21) in Sec. IlLl, then replace fi by pMin (24), (25), (26), and (30). 

In the case of constant service time of l/p—which can be treated formally as the limit¬ 
ing case of an Erlang distribution with M —> oo—you have 


(i) kj = i-jj— (Poisson distribution), 
where p = \/p, so that 

(ii) K*(s) = 

Then 


(iii) 


P*{s) = 


(1 - p)e"^<»-*Hl 


i) 


(generating function for the probability distribution 
of the number of customers in an M/D/l system). 


It can be shown that (iii) implies 




+ 


(r-«- l)!j 


(iv) 


for r >0 
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Further 

(v) G*{^) = (transform for constant service time distribution), 

so that, by (22), 

/ -x T^r*/X (1 — P)^' (transform for the distribution of time 

(vi) W (*') = a customer spends in line for Af/ jD/1), 

It can be shown that (vi) implies 

(vii) . W{t) = (1 - 

where [fit] represents the integral part of fit. 

For the sake of completeness, we mention how to find the steady-state probabilities 
for the case of constant service time l/fi and iS" servers (designated as MIDJS). Consider 
the system at instants t, t + (l/z-t), / + (2//^),..., where the value of t is arbitrary, say, 
/ = 0. If there arc j customers in the system at Time T, then all of these will 
have been served by TimeT + (l/p-). Consequently, the number of customers, i, in 
the system at Time T + (l/fi) is simply the number of arrivals, given by the Poisson 
distribution with parameter p. If instead of the above there are j > S customers in the 
system at Time T, then the number at Time T + (1/a) isj — S plus the number of 
arrivals. These observations lead to the Markov chain matrix describing the system’s 
behavior at two instants spaced l/fi apart: 


(viii) 



number in system Pt Time T -f- {l/fi): 


0 

1 

2 ••• S 

1 S+2 • 

0 

'k. 

ki 

^2 • • • fcs 

ks+i 

ks+2 

1 

ko 

ki 

^2 ’ * * ks 

ks+i 

ks+2 

2 

ko 

ki 

^2 ’ *’ ks 

ks+i 

ks-¥2 

number in 

. 

. 

. 

- 

• 

system 

• 

• 

. 

• 

• 

at Time T: iS* 

ko 

ki 

kz • • • ks 

ks+i 

ks +2 

5+ 1 

0 

ko 

ki ••• ks^i 

ks 

ks+i 

S+2 

0 

0 

^0 * * * ks-2 

ks^i 

ks • 

2S 

0 

0 

0 •• • ko 


k2 









where, as in (i), 
(ix) 


with p = —• 
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Numerical techniques for finding stationary probabilities for a Markov transition matrix 
can be applied to obtain 

Unfortunately, an analysis of the multiple-server model with a general service time 
distribution (designated as MIGjS) is too involved to be included in this text. A 


IIL3 SINGLE-SERVER MODEL WITH GENERAL INPUT 
AND EXPONENTIAL SERVICE DISTRIBUTION 


The approach to analyzing this model (designated as G//M/1) is again to 
examine the system only at regeneration points. But in contrast to the method 
used in the preceding section, a regeneration point is defined here to be the 
moment immediately after an arrival. 

Specifically, let n be the number of customers in the system in front of the 
person arriving at Time T, and let v! be the number of customers in front of the 
person arriving at Time T'. Let j be the number of customers served during 
this interval, including the person arriving at Time T if his service is completed 
by Time T'. Verify that 

(1) fi' = n + 1 — j, 

and note that 0 ^ n' ^ tbe length of the interval between two successive 

arrivals is A, 

(2) P [J services I interval A] = — forj < «, 

given that the service distribution is exponential with rate fi, (Note how the 
memoryless property of the exponential distribution is being used.) Therefore 
the unconditional probability is 

(3) P [j services] = f ^ dF{h) = for j < n, 

Jq Jl 

where F{k) is the distribution function for interarrival times. 

To represent the system in terms of an imbedded Markov chain, let the states 
be characterized as the number of customers in front of a person when he arrives. 
Hence, given n, the probability of in (1) is indicated by the probability o£j 
departures, according to (3), and the resultant Markov chain in matrix form is 


( 4 ) 


n' customers in front of a person 
arriving at Time T': 


0 


n customers 
in front of 
a person 
arriving 
at Time T: 


l-So 

1-Si 

1 - 1:2 

l-Ss 


1234 ... 

ko 0 0 0 
ki ko 0 0 • • • 
ki kQ 0 • • • 
ki ki ^0 * * * 


(T and T' are 

regeneration 

points). 
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where we have defined the symbol 

( 5 ) 

Steady-state analysis. Let 

Pr = Stationary probability of r customers in front of 
a person arriving at a regeneration point. 

To ensure the existence of a steady-state distribution, assume that 

(6) £ [interarrival time] /^ > 1. 

Then, given (4), the Pj must satisfy the steady-state equations 

(7) Pr = 'EPi*r-lki forr^l 

t=0 

(8) Po = ±Pi{l-Ei)- 

1=0 

It is easily shown that a solution to (7) is of the form 

(9) Pr = 

where the values of ^ > 0 and s are determined so that both (7) and (8) are 
satisfied. 

To obtain j, substitute (9) into (7) to give 

( 10 ) cs^ = ^ 

so that 

( 11 ) 1 

1-0 1=0 

Thus s must satisfy 

( 12 ) s^K*{s), 

where K*{s) is the generating function for kj. Now 

(13) K*{s) J" dF{h) = - s)l 

and therefore (12) can be written as 

(14) s = 

so that it is only necessary to calculate the Laplace-Stieltjes transform of the 
interarrival distribution F{k). It can be shown that there always exists a unique 
solution So to (14) for which 0 < j© < !• 

To obtain c, you can use (8), or equivalendy, employ the equation that the 
probabilities must sum to 1: 



( 15 ) 
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so that ^ = 1 — Jq, Therefore, 


(16) P,. = (1 — for r = 0, 1, 2,... 

[geometric distribution; mean = ^ 0/(1 — Jo)]« 

Note that Pq = 1 — Sq, 

Given that a newly arriving person finds n customers in front of him, the time he 
must spend in the system, including his own service, is described by a gamma 
density with distribution function 


(17) U{t\n) = f 

Jq n\ . 

since service times are exponentially distributed. Hence, 

(18) ^ dw 

= 1 — [exponential distribution; mean = l//t(l — •fo)]* 

Consider the probability of r customers in the system at any Time T; then it can 
be proved that as Time T —> oo, this probability tends to a limit; 

■Po* = 1 - P 

Pt* = pPr-i = p(l — Jo)'??"’ for r = 1, 2,... 

E [number of customers in system] = -j— S .—> 

1 - Sq 


(19) 


where p is given by (6). 

Thus you have observed an important result: P,.* = Pj. only when Sq = p, 
which occurs if the arrival distribution is Poisson, as is shown below. Otherwise, 
the proportion of all time over an unbounded horizon during which the system has 
r customers, namely P*, does not equal the long-run fraction of regeneration points 
in which the system has r customers, namely P,.. Hence the systems Af/G/1 and 
GIlMjl differ essentially in this respect. 


Examples. Suppose the interarrival distribution is 

(20) F{t) = (gamma distribution; mean = M/X), 

Then 


( 21 ) 


= f 


X(Xz)^-^e- 


{M- 1)! 


-dz = (^-y-‘)^(i -p)“ 


where 

( 22 ) 

Therefore, 

(23) 


(negative binomial distribution; Elj] = M/ilX)y 
^ fi + x 




A. 

p 
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When M == 1 in (20), giving the exponential distribution, {sq = p) is a solu¬ 
tion to (12), and the steady-state distribution (14) and distribution of time a 
customer spends in the system (16) agree with the results in Sec. 20.5. 


► If you want to express the interarrival time distribution as an Erlang distribution, 
where MX replaces X in (20), then substitute MX for X in (20), (21) and (22), and Mp 
for p in (23). 

In the case of a constant arrival time (denoted as D/M/l), 

(i) K*is) = ^ 

*111.4 MULTIPLE-SERVER MODEL VSTTH GENERAL 

INPUT AND EXPONENTIAL SERVICE DISTRIBUTION 


The derivation of steady-state probabilities for this model (denoted as GI/MIS) 
generalizes the approach used in the preceding section. In particular, it employs 
the instants of new arrivals as regeneration points. Although the details of the 
analysis are involved, the resulting formulas can be stated in a straightforward 
fashion, and are given here for completeness. 

Analogously to (14) in Sec. III.3, let Sq be the unique solution for 0 < Jq < 1 
of the equation 

(1) j = 


where F* is the Laplace-Stieltjes transform of the interarrival distribution, 
and .S' is the number of servers. Define and calculate 


( 2 ) 

(3) 

(4) 

(5) 


Ff=F*{fij) forj = 0,l,2,...,5 

for j = 1, 2,..., .S' and Cq = 1 

D,= ± _ (I) _ 

C*(l - FI) L 5(1 - ^o) - 1 J 


M= 


for 7 = 0, 1, 2,...., 5 — 1 


To ensure the existence of stationary distributions, assume that X < jjS, where 
1 jX is the mean interarrival time. 

Then it can be proved that 


( 6 ) 


o ( S (-1)^‘"(^) 'MCjDj for r = 0, 1,..., 5 - 1 
\ Msl~^ , for r = S, S + I, - 


Notice that for r ^ .S', the form of iV is geometric. 

Using an approach similar to that in (17) and (18) in the preceding section, 
you can derive that 
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(7) W{t)^ P [time a customer spends in line < 0 = ^-i- 

1 — ^0 

and 

(8) E [time in line] = • 

The probability of r customers in the system at Time T tends, as Time T 
to the limit 




( 9 ) 


P* = 


_ ^^r-1 


Tfl 


for r = 1, 2,. .., 5 — 1 


hrr = S,S+l,. 




* 111.5 SINGLE-SERVER MODEL WITH GENERAL 

INPUT AND GENERAL SERVICE DISTRIBUTION 


The key idea for analyzing this model (designated as G//G/1) is to work directly 
with the distribution of the time spent in line. Let F{h) and G{h) be the distribution 
functions of interarrival and service times, respectively, and as usual, let IJX 
and the corresponding means. To ensure the existence of limiting distribu¬ 

tions assume that p = X/yx < 1. 

Suppose the system begins at Time t = 0, when the first customer arrives. 
Define: 


Wr = time that Customer r spends in line 
Sr = service time for Customer r 

Or = interarrival time between Customer r and Customer r + 1. 
Since the line is empty just before Time t = 0, 


( 1 ) 

The Wr can be characterized recursively by 


fox' t < 0 
for / > 0. 


( 2 ) 


I 0 it Or > Wr Sr 

I “'r + — fir if 


Give a verbal interpretation of (2). 
Then for ^ >■ 0, 


(3) =^[“'r+l = 0] +P[0<lt),+ ,^t] 

= PbOr + (jr — flr) ^ 0] + P[0 < Wr + (j, — Ur) ^ t] 

= P[a'i-+ (Jr — fir) < 0- 

Note that Sr and a, arc independently distributed and the distribution of (j, — a,) 
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is independent of r. Let 

(4) K{t) = P[sr - flr < 0 = r for -oo < / < oo, 

0 

where, in any specific case, the integral on the right of (4) can be calculated by 
numerical methods. 

Consequently, you can write (3) as a convolution of K{k) and Wr{h)i 

(5) = r K{t - h) dWrih) for t > 0. 

J 0 

Starting with (1), you can in principle use (5) to calculate all the distribution 
functions, Wr{t), for r = 2, 3,.,.. 

Given that p < 1, the functions Wr{t) approach a function W{t) as r—> oo, 
where W{t) satisfies the so-called Weiner-Hopf integral equation: 

(6) Wit) = f" Kit - h) dWih) for / ^ 0. 

Jo 

One way of calculating W{t) is to perform the recursive computations implied by 
(5) up to a large value of r. For any /, H4+i(0 ^ ^r(0> so that the distribution 
functions converge monotonically from below, [Another approach is to use a 
discrete and finite approximation. For example, you may consider t only at 
Times 0, e, 2e, 3e,..., Me; this method leads to a set of simultaneous linear 
equations for the approximate values of For special models, such as 

Erlang interarrival and service times (denoted by EnlE^ll)^ it is possible to apply 
advanced analytic methods for the solution of (6), and thereby derive explicit 
formulas for W{t), 

Operating characteristics. Having the distribution you can find 
£'[time in line], and then by multiplying this average by you will have the 
expected number of customers in the system in front of an entering customer. It 
also can be shown that as the system operates over an unbounded horizon, the 
limiting probability that the server is idle is 1 — p. Note that this value may differ 
from that given by (16) in Sec. III.3. There the value 1 — Sq gives the probability 
that the server is idle at a regeneration point when a new customer enters; these two 
probabilities agree only when the arrival distribution is Poisson, for then Sq = p. 


REVIEW EXERCISES 

1 In each part below, verify the indicated statement dr formula involving the generating 
function P*(j), given by (1) in Sec. III.l. Parts {e) and (/) are on p, A74, 

(a) (2), (3), (4), and (5). 

(b) Geometric distribution (6) and (7), 

(c) Poisson distribution (8) and (9). 

(d) Binomial distribution (10) and (11). 
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(e) Negative binomial distribution (12) and (13). 

(f) Check that when you take the nth derivative of P*(j) in (7), (9), (11), and (13) 
and let j = 0, you obtain n!P„. 

2 In each part below, verify the indicated statement or formula involving the Laplace- 
Stieltjes transform G{t), given by (15) in Sec. III.l, 

(a) Moment generating property (16). 

(b) Exponential distribution (17) and (18). 

(c) If X and y are independent random variables, the Laplace-Stieltjes transform for 
the distribution function of their sum is equal to the product of the transforms for 
each of the distribution functions. 

(d) Gamma distribution (19) and (20). 

(e) Erlang distribution (21) and (23), and the constant service time distribution. 

3 Consider the single-server model with Poisson input and general service distribution, 
discussed in Sec. III.2. 

(a) Give a detailed justification for (1), (2), and (3). Explain how the memoryless 
property of the exponential model is exploited in (2). 

(b) Justify in detail the construction of the Markov chain matrix (4). 

(c) Verify that the steady-state equations for the Markov matrix (4) are those given 
in (6). 

(d) Perform the intermediate algebraic steps to verify that the steady-state probabilities 
Pj. can be computed by the formulas in (7). 

(e) Perform the intermediate algebraic steps to justify each equality for the generating- 
function P*(s) in (10) and (11). 

(f) Justify in detail the derivation of the Laplace-Stieltjes transform in (12). 

(g) Perform the intermediate steps to justify the equality in (13). 

(h) Justify in detail the equalities in (14). 

(i) Perform the intermediate steps to justify the derivation of Pq (15) and (16). 

(j) Justify in detail the derivation of the Pollaczek-Khintchine equation (19). 

(k) Verify the derivation of (20). 

(l) Justify in detail the equalities in (21) and (22) relating to the time a customer 
spends in the system, in line, and in service. 

4 Consider the single-server model with Poisson input and Gamma service distribution, 
given by (23) in Sec. IIL2. 

(a) Verify the equalities in (25), and derive the form (27). 

(b) Perform the intermediate steps to derive P*(j) in (30). 

(c) Examine the exponential case M = \ ; show how to obtain P*(j) in (31). 

(d) Perform the intermediate steps to derive the transform for the waiting time density, 
given in (33). 

*5 Consider the single-server model with Poisson input and Erlangian service, as discussed 
in the special material ending Sec. III.2. 

(a) Rewrite (24), (25), (26), and (30), where you replace fi by fiM. 

(b) Examine the constant service case, in which M —► oo. Verify (i), (ii), and (iii). 
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6 Consider the single-server model in Sec. III.3 with general input and exponential 
service distribution. 

(a) Verify the transition relation (1). 

(b) Justify (2) and explain how* the memoryless property of the exponential distribution 
is exploited. 

(c) Justify in detail the construction of the Markov chain matrix (4). 

(d) Verify that the steady-state equations for the Markov matrix arc those given in 
(7) and (8). 

(e) Substitute the solution for Pf given in (9) into the stationary equations (7), and 
verify the derivation in (10), (11), and (12). 

(f) Justify in detail the derivation in (13) and (14). 

(g) Justify in detail the derivation of the service time distribution in (17) and (18). 

7 Consider the single-server model with a gamma intcrarrival distribution and expo¬ 
nential service, given by (20) in Sec. III.3. 

(a) Verify the equalities in (21) and (23). 

(b) Show that when M = 1, a solution to (12) is Jq = and the steady-state distri¬ 
bution (14) and distribution of time a customer spends in the system (16) agree 
with the results in Sec. 20.5. 

*(c) Give a detailed justification for (i), which pertains to the case of a constant arrival 
time and is discussed at the end of Sec. III.3. 

8 Consider a single-server system with input as specified below and exponential service 
at mean rate /x = 10. In each part, give explicit formulas for the probability that r 
customers are in the system and for the expected number of customers in the system 
at both regeneration points and at a Time T that is an arbitrarily long distance into 
the future. 

(a) Gamma input distribution, given by (20) in Sec. III.3, where Af = 1 and X = 9. 

(b) Rework part (a), letting Af = 2. 

(c) Rework part (a), letting M = 3. 

(d) Rework part (a), letting M = 2 and X = 18. 

* (e) Constant arrival time, where X = 9. 

9 Explain your understanding of the following terms: 

generating function 
Laplace-Stieltjes transform 
moment generating function 
regeneration point 
imbedded Markov chain 
Pollaczek-Khintchine equation 
*Weiner-Hopf integral equation. 


APPENDIX 


Tables 


S 


p 

1 

2 

3 

4 

.1 

.1000 




.15 

.1500 

.0104 



.2 

.2000 

.0181 



.25 

.2500 

.0277 



.3 

.3000 

.0391 



.35 

.3500 

.0521 



.4 

.4000 

.0666 



.45 

.4500 

.0826 

.0113 


.5 

.5000 

.1000 

.0151 


.55 

.5500 

.1186 

.0195 


.6 

.6000 

.1384 

.0246 


.65 

.6500 

.1594 

.0304 


.7 

.7000 

.1814 

.0369 


.75 

.7500 

.2045 

.0441 


.8 

.8000 

.2285 

.0520 


.85 

.8500 

.2535 

.0606 

.0117 

.9 

.9000 

.2793 

.0700 

.0143 

.95 

.9500 

.3059 

.0801 

.0171 

1.00 


.3333 

.0909 

.0204 


Note: p = X^L traffic intensity 


X = arrival rate 
^ = service rate 

5 = number of servers 
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5 


p 

2 

3 

4 

5 

6 

7 

8 

9 

10 

1.0 

,3333 

.0909 

.0204 







1,2 

.4499 

.141 1 

.0370 







1.4 

.5764 

,2033 

.0603 

.0153 






1.6 

.711 1 

.2737 

.0906 

.0258 






1,8 

.8526 

.3547 

.1285 

.0404 

.0111 





2,0 


,4444 

.1739 

.0597 

.0180 





2.2 


.5421 

.2267 

.0839 

.0274 





2.4 


.6471 

.2870 

.1135 

.0399 

.0125 




2.6 


.7588 

,3544 

.I486 

.0558 

.0187 




2.8 


.8766 

.4286 

.1895 

.0754 

.0270 




3.0 



.5094 

.2361 

.0991 

.0376 

,0129 



3.2 



.5964 

.2885 

.1271 

.0508 

.0184 



3.4 



.6893 

.3466 

.1595 

.0669 

,0256 



3.6 



.7877 

.4103 

, 1965 

,0862 

,0346 

.0127 


3.8 



.8914 

.4795 

.2382 

,1088 

.0456 

.0175 


4.0 




.5541 

.2847 

,1351 

.0590 

.0237 


4.2 




.6337 

.3359 

.1650 

.0749 

.0313 

.0121 

4.4 




.7183 

.3919 

.1988 

.0935 

.0407 

.0164 

4.6 




,8077 

,4525 

,2365 

.1150 

.0518 

.0217 

4.8 




.9016 

,5177 

.2783 

.1395 

.0650 

.0282 

5.0 





.5875 

.3241 

.1672 

.0805 

.0361 


Note: p = A //1 traffic intensity 
X = arrival rate 
;x= service 

S = number of servers 
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IV 


S 


p 

6 

7 

8 

9 

10 

li 

12 

13 

14 

15 

5.0 

.5875 

,3241 

,1672 

.0805 

,0361 

.0150 





5.2 

.6616 

.3740 

.1982 

.0983 

.0455 

.0196 





5.4 

.7401 

.4279 

.2827 

.1186 

.0565 

.0252 

.0105 




5.6 

.8227 

.4859 

,2706 

.1415 

,0694 

.0319 

.0137 




5,8 

.9094 

,5479 

,3120 

.1673 

.0843 

.0398 

.0176 




6.0 


.6138 

.3569 

.1959 

.1012 

.0492 

.0224 




6.2 


.6836 

,4055 

.2275 

.1204 

.0600 

.0281 

.0124 



6.4 


.7572 

.4576 

.2622 

.1420 

,0725 

.0349 

.015& 



6.6 


.8345 

.5133 

,2999 

.1660 

,0868 

.0428 

.0199 



6,8 


.9155 

.5725 

,3408 

.1925 

.1029 

.0520 

.0248 

.0112 


7.0 



.6353 

.3849 

.2217 

.1211 

.0626 

.0306 

.0141 


7.2 



.7015 

.4322 

.2536 

.1413 

,0746 

.0373 

.0177 


7.4 



.7711 

.4827 

.2682 

.1637 

.0883 

.0451 

.0219 

.0100 

7.6 



.8441 

.5363 

.3256 

.1884 

.1036 

.0541 

.0268 

.0126 

7.8 



.9204 

,5932 

.3659 

.2154 

,1208 

,0644 

.0326 

.0156 

8.0 




.6533 

.4091 

,2449 

.1398 

.0759 

,0392 

,0193 

8.2 




.7165 

.4552 

.2769 

.1608 

.0890 

.0469 

.0235 

8,4 




,7828 

.5042 

.31 14 

.1838 

.1036 

.0556 

.0284 

8.6 




.8522 

,5561 

.3484 

,2090 

.1198 

.0655 

.0342 

8.8 




.9246 

.61 10 

.3881 

.2364 

,1377 

.0767 

.0407 

9.0 





.6687 

.4304 

.2660 

,1575 

.0891 

.0482 

9.2 





.7293 

.4754 

.2979 

.1790 

.1030 

.0567 

9.4 





.7927 

.5231 

.3322 

.2025 

,1184 

.0662 

9.6 





.8590 

.5734 

.3688 

,2280 

.1353 

.0769 

9,8 





.9281 

.6264 

.4078 

,2556 

.1538 

.0888 

10.0 






.6821 

.4493 

.2852 

.1741 

.1020 


Note: /) = \/fjL traffic intensity 
X = arrival rate 
service rate 

S = number of servers 
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Lump-sum penalty cost, in inventory model, 
798, A25 


•M» 

Machine interference model, 873 
Machine maintenance and repair, 502, 776, 
781,873 

Machine scheduling (sequencing), 447,506, 
507 

Maintenance models, 502, 776, 837 
Management games. 918 
Management science, 3, 9 
Manne expansion problem, 501 
Manpower planning, 70, 187,443,502. 633, 
685,699, 731,775 
Manual simulation, 894 
Market share, 640, 648 
Markov chain, 753 
imbedded, A62, A68 
recurrence time, 780, A45 
(s, S) inventory model, A44 
stationary probabilities, 753 
steady-state probabilities, 753 
transition matrix, 753 
Markovian arrivals (input), 852, A61 
Markovian decision mc^el (process), 739 
Markov process, 846 
Markov property, 343, 740,870 
Markov renewal programming, 763,779 
deterministic models, 375 
Markov separable decision problem. 781,782, 
A55 

Massive degeneracy, 465 
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Master program: 
computer simulation, 908 
decomposition algorithm, 614, 619 
Material balance equations, 192 
Matrix form: 

Dual Theorem, 139 
simplex method. 121 
Matrix games, 161 
Max Flow/Min Cut Theorem, A7 
Maximin strategy, 162 
Maximum flow, A3, A9 
Maximum of a function, 521,530,560 
MjDjX queue, 861, A66 
M/DIS, :\67 

Media allocation, 66^ 337, 724 
Membership relationships, 896 
Memoryless property, 843 
Metals industry, 66, 73.203, 328 
AZ/G/oo queue. 866 
A//G/1 queue. 856, 861, A61 
Minimal cost/maximum flow algorithm, A13 
Minimal path problem, 54, 177 (see niso 
.Shortest path) 

Minimax objective in regression, 558 
Minimax route, 355 
Minimax strategy, 161 
Minimax Theorem. 162 
.M ini m u m -d u ra t ion -pa i h, 178 
Minimum (maximum) of a function, .521,530, 
560 

.Mixed-integer programming, 415. 486 
.Mixed strategy, 161 
A//A// CO queue. 866, 872 
A//A/ /1 queue, 854,871, A66, A70 
Af/i\//.9 queue. 865,873 
Model building. 10 (see fl/so Formulating a 
model) 

Modified gradient, 537 
Modified simplex algorithm, for quadratic 
programming. 606 
Modulus arithmetic, 899 
Moment generating function, A60 
Monotonic sequence: 
decreasing, 381,407 
increasing. 380,407 
Monte Carlo method, 914 
More-for-less phenomenon. 231 
Most stringent problem. 666 
Multicommodity network, 194 
Multiechclon inventory system, 834 
Multi-index transportation problem’, 74,501 
Multiplant expansion problem, 501 
Multiple chain, 414,420, 756 
Multiple criteria (objective functions), 61,132 
Multiple optima, in linear programming. 39, 
83.87. 109.126 

Multiple right-hand sides, 133 
Multiple-sei^'er model, 864, A67. A71 
Multiplicative congruential method, 899 


Multiproduct inventory problems, 329,352, 
516, 834 

Multistage analysis, 342 
Multistage decision process, 342 
Multistage linear model, 678 
nonlinear case, 699 

Multivariate analysis, in simulation, 915 
Myopic rule, 534 


•N» 

Nearly optimal, 370, 418,420 
Necessary and/or sufficient condition: 
optimality in Markovian model, 757,758 
optimality in nonlinear programs, 530,538, 
, 539,586,600.603 

Negative binomial distribution, 920, A22 
Negative exponential distribution, 843 
Negative semidefinite, 550,605 
Nested problem, 338 
Net inventory position, 797 
Network models. 50, 190 
acyclic, 191, 235 
assignment problem, 176, A7 
flow, 190, A3 

generalized transportation problem, 67, 193 
graphs, 190 

Integrality Theorem. 192,230 
maximum flow, A3 
multicommodity flow, 19-1 
optimum route, 54,177 
probabilistic, 696, 730, 740 
shortest-route problem, 54. 177, 231, 235, 

387 

transportation problem. 52, 166, A17 
transshipment problem, 171, A17 
triangularity in, 220,225,230 
New product introduction, 703,734 
Newsboy problem, 792 
Newspaper advertising, 66 
Newton-Raphson method, 537 
Next imminent event, 909 
Neyman-Pearson lemma, 697 
Node, 178 ’ 

Node-arc incidence matrix, 191 
No feasible solution, 84, 111, 127 
Non ba,sic variable, 103 
Nonlinear constraints. 592 
Nonlinear equations, solution of, 530 
Nonlinear programming, 513 
with integer variables, 487,514 
Nonnegative variable, 39 
Nonstandard basis, 606 
Normal distribution. 818 
loss integral, standardized. 818, A41 
multivariate, 905 
random numbers from, 904 
in simulation, 911 
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Normalization restriction, 761 
Normalizing equation, 429 
Northwest corner rule, 52,237 
Number of customers in system, 856,861, 867, 
A63, A69, A71 


•O* 

Objective function, 42 
sensitivity analysis in linear programming, 
130, 140,574 

Obsolescence, 721,724, A26 
Oil industry, 33,59,67,327,516, 837, 889,930 
One-period inventory model, 792 
One-potato, two-potato problem, 15,29,125, 
157,571,634,695 

Operating characteristics, 754, 839, A47 
Operational gaming, 918 
Operations planning, 889 
Operations research: 
definition, 4 
history, 7 

managerial use, 926 
project planning, 931 
staff, 935 

Operations Research Society, 3 
Optimal decisions under uncertainty, 648 
Optimal feasible step size, 579 
Optimality conditions, in nonlinear 
programs, 600,603,604 
geometric interpretation, 632 
Optimal policy, 260,370,752 
Optimal single critical number, 794, A27, A28, 
A33, A35,A37,A38 
Optimal solution, 78 
Order quantity, 815 
Orthogonal, 535 
Overhaul, 356 


•p. 

Parallel components, 725 
Parametric linear programming, 132 
Parking problem, auto, 735 
Partial enumeration, 480 
Partial solution, 481 
Partial tours, method of, 477 
Path, 191 
longest, 184 
shortest, 177,231,235 
stochastic problem, 696,730,740 
Pay struaure, 572 
Penalty cost: 

BigM Method, 112 

inventory model, 789,806,809, A23, A26 
linear, 793, A28, A36, A37, A40 
lump-sum, 798, A25 


Penalty function approach, in nonlinear 
programming, 609 

Perfect information, value of, 658,666,677 
Performance measures, 897 
Periodic arrivals, 848, A71 
Periodic review inventory model, A19 
Period of random number generator, 900 
Perishable commodity problem, 288, A54 
Personnel (see also Employment scheduling): 
assignment of, 176, 211,731,775 
salary scale, 572 
training, 70,210,289 
PERT, 185 (see also Critical path) 

Petroleum industry, 33,59, 67,327,516,847, 
889,930 

Piecewise linear approximation, 457,511,551, 
575 

Pivot operation (pivoting), 98,104,608 
Planar (linear) graphs, 190 
Planned replacement, 716,764 
Planning horizon, 45,407 
inventory models, 300,309 
Planning models, 929 
Plant: 

expansion of, 492, 501, 640,648, 698, 731 
layout, 505,507 
location of, 451,503 
Point, in graph, 190 
Poisson distribution, 846 
arrivals (input), 846, A61 
demand, A21 
process, 846 
random numbers, 904 
Policy, 361,405, 747,752,757 
Policy iteration, 383,390,401,409,412,420, 
750,758 

simplex method, 427 
suboptimal policy, 442 
Policy space. 383,409,412 
Pollaczek-Khintchine equation, 861, A65 
Polygonal approximation, 457,511,551,575 
Portfolio selection, 518 
Positive semidefinite, 550 
Positive weighted average, 110,580 
Postoptimality analysis, 129 
Potentials, method of, 216 
Preemptive priority, 863,884 
Present discounted value (worth), 365,651 
Preventive inspection, 777,781 
Preventive maintenance, 502 
Price breaks, 303, 833 
Pricing problem, 733 
Primal problem, 134 
Principle of optimality, 257 
Priorities, preemptive, 863,884 
Priority discipline, 863,868 
Probability distributions, assessment of, 653 
(see also Distribution, probability) 

Process control, 516 
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Product introduction, 703,734 
Production lot sizes, 810, 832 
Production planning, 45 
Production scheduling, 261, 291, 680, 713 (see 
also Inventory models) 

Production smoothing, 187, 288, 311, 328,729 

Product-mix selection problem, 37 

Profit loss, 789 

Projection, 591 

Project network, 185 

Project scheduling, 185 

Promotional spending, 66, 337, 724, 728, 733, 
773. 774 

Property of adjacent weights, 457, 511,553 
Pseudo-random numbers, 899 
Purchasing, 162, 289, 326, 698 
Pure-birih model, 846, 871 
Pure-death model. 850, 871 
Pure integer programming, 445 

•Q. 

Quadratic assignment problem^ 505,507 
Quadratic constraint, 571 
Quadratic criterion function, 541,684 
linear decision rule, 685 
Quadratic form, 550. 605 
Quadratic programming problem, 540,605 
Quadratic Simplex, Algorithm, 544 
Quality control models, 777, 782 
Quantitative analysis, 12 
Quantity discounts, 303, 833 
Quasi-concave function, 540 
Quasi-convex function, 524 
Queues, 837, A59 
birth-and-death process, 869 
bulk models, 840, 875,884 
busy period, 856,861, A73 
busy probability, 855, 866, A63 
delay, probability of, 866 
discipline, 840 
finite queue, 859,872 
finite source, 872 
idle period, 856 
interarrival time, 842 
multiple-server, 865,873, A67, A71 
optimization model, 731, 734, 775 
output, 867 
preemptive, 863, 884 
priority, 863,868 
self-service, 851,866,872 
service time, 848 
simulated system, 908 

single-server model, 853’, 871, A61, A68. A72 
tandem, 867,881,884 
transient behavior, 854,856,860 
waiting line, 856, 861, 867, A63, A69, A71 
waiting time, 857, 861, 867, A65. A70, A72 


•R* 

RAND Corporation, 27,899 
Random arrivals, 843, A61 
Randomized strategy (decision rule), 651 
Random numbc'r generation, 898 
Random service discipline (order), 840, A68, 
A71 

Ranging analysis, 132 
Rate of return, 362 
Ratio of linear forms, 559, 575 
Raw materials inventory, 811 
Recurrent event, 843 
Recursion. 258 

Recursive programming, 254.331.375,403 
Recursive (causal) structure, 897 
Reduced gradient, method of, 584 
Reduced partial derivative, 546, 586 
Redundant relation. 57. 127 
Refinery models, 41,67 

Regeneration event (point), 304. 375, A61, A68, 
A71 

Regeneration models: 
deterministic, 375.808 
Horizon Theorem, 380 
stochastic, 715, 763 
Regression, 559 
Chebyshev criterion, 559,573 
minimax criterion, 558 
minimum absolute deviation, 558,573 
nonlinear, 572 
Regular arrivals, 848, A71 
Regularization postulate, 578 
Reject allowance problem, 446, 453, 709, 737 
Relative costs, 109, 228 
Relative partial derivative, 586 
Reliability models, 725, 736, 776, 777, 778, 781 
Reneging, 840, 875, 884 
Renewal equation, A47 
Renewal process, 843, A46 
Reorder cost, 789 
Reorder level, 793 
Reorder point, 793, 816 
Reorder quantity, 815 
Reorder time, A45 
Repairman problem, 873 
Repair model, 776,781 

Reparable-serviceable inventory problem, A55 
Replacement models, 180, 340, 715, 729, 763, 
777,778,782 

Replenishment models (see Inventory models) 

Replication, in simulation, 912 

Requirements, space, 86 

Research and development. 338 

Research project selection, 338 

Response surface, 916 

Restricted-basis-entry rule, 553 

Restricted master program, 614 

Returns to scale, 295,303 
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Revised simplex method, 121 
Right-hand side, sensitivity analysis of, 132, 
140,148,574 
Rolling schedule, 273 
Rosen gradient projection method, 591 
Route: 
longest, 184 

shortest, 54,177,231,235 
stochastic problem, 696,730, 740 
Routing problem, 67, 202, 205,503 

•S* 

Saddle-point: 
game theory, 162 
nonlinear programming, 602 
Safety component, 712. 

Safety stock. 516, 816 
Salary model, 572 
Sales forecasting, 713 
Salvage value, 794 
Sample size, in simulation, 913 
Sawtooth pattern of inventory, 804 
Scheduling (see Employment scheduling and 
Machine scfieduling) 

Scrap allowance problem, 446, 453, 709, 737 
Search, 524 
bisection method, 527 
Fibonacci method, 432 
golden sections method, 525 
Search problems, 724, 726, 727,732,735,736, 

737 

Seasonal variation, 170 
Secondary constraints, 147 
Second-best route, 356 
Second-order method. 537,582,598 
Secretary problem: 
hiring, 18, 29, 735 
typing pool, 858, 861,866 
Seed, 899 

Self-service model, 851,866,872 
Semidehnite quadratic form, 550,605 
Semi-Markov model. 391, 745 
Sense of an inequality, 78 
Sense of optimization, 78 
Sensitivity analysis: 
dynamic programs, 271 
inventory models, 808,821, A39 
linear programs, 129,160,574 
networks, 231 
nonlinear programs, 603 
queuing models, 857,867 
Separable function, 551,554,556 
Separable Markovian decision problem, 781, 
782,.A55 

Separable programming, 551,616 
Sequencing problem, 447,506,507,616 
Sequential Unconstrained Maximization Tech¬ 
nique (SUMT), 610 


Serial correlation, 911 
Servers, 841 
in parallel, 841 
in series (tandem), 841 

Serviceable-reparable inventory problem, A55 
Service distribution, 841 
arbitrary, 860, A61, A72 
exponential, 850, A68 
rate, 850 

time constant, 861, A66 
times, 848 

Service level in inventory model, 816, A48 
Setup cost: 

in inventory models, 303, 789, A26 
in linear programs, 446, 451 
Shadow costs, 109 
Shadow prices, 109,140 
Ship routing. 202, 205 
Shortages, 789, 797, A21 
Shortest path (route). 54, 177. 231, 235, 387 
approach to assignment model, A15 
stochastic versions, 696, 730,740 
Sign convention, 43, 47 
Simplex algorithm, 102 
for assignment model, A7 
for general network, 250 
for transportation problem. 215, 226 
Simplex tableau, 120 
SIMSCRIPT,917 
Simulation, 887 
Simulation examples: 
inventory replenishment, 905 
stock market, 893 
telephone operator, 22, 31 
waiting line, 908 

Simulation languages, 891,902, 916 
Single critical number policy, 794, A27, A28, 
A33. A35, A37, A38 
Single decision inventory model, 792 
Single-server models, 853, 871, A61, A68. A72 
Sink node (point), 171, A3 
Site location problem, 327,451,503 
Size-of-backlog penalty cost, 806 
Skew-symmetric game, 162 
Slack variable, 79 
Small-step gradient algorithm, 604 
Smoothing cost, 288,311 
Smoothing problems, 187,685, 729 
Solution characteristics, 709 
Solution set, 83 
Solution space, 82,85,592 
simplex method in, 108 
Source node (point), 171, A3 
Spanned node, A4 
Spare parts problem, 725,830 
Spectral analysis, 912 
Spoilage of inventory, 288, A54 
(s, S) policies, 793,800, A27, A33, A38, A55 
Stage, 257 
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Stagecoach problem, 256 
Standard basis, 606 

Standardized Normal loss integral, 818 

Starting basis. Ill, 228 

State: 

in dynamic programming, 257 
in simulation, 897 
Static inventory model, 792 
Stalionarity assumption, 359,432 
Stationary Gaussian process, 911 
Stationary point, 537, 538, 581,594, 597 
Stationary policy (strategy), 361,405, 747, 757 
Theorem of. 405, 412, 747, 752, 779, 780 
Stationary probabilities, 753, 855, 859, 865, 
871,911, A44,A63, A69,A71 
Stationary queues, 871 
Stationary time process, 843, 911 
Statistical decision theory, 653 
Statistical equilibrium, 753, 855 
Steady-state solution, 855, 859, 865, 871, A44, 
A63. A69, A71 

Steel industry. 66, 73. 203, 328, 516 
Steepest ascent (descent), 534, 579, 611 
Step size, 531 
optimal, 533, 579 
Stochastic approximation, 916 
Stochastic linear programming model, 659 
Stochastic regqjieration model, 715 
Stochastic shoTtest-roule model, 740 
Stock market problems, 353, 731,732 
Stock market simulation. 893 
Stockout: 

penalty for, 789, A23, A25 
probability of. 816, A48 

Stopping rule problems, 18, 726, 732, 735, 736 
Storage cost. 268. 276, 294,303. 789, A23 
Store location, 503 
Strategic planning. 888 
Strategy, 344,645,663 
in game theory, 161 
randomized, 651 

Strictly convex (concave) function, 525, 539, 
581,593 

Subprogram, 615 
Subrelaxation, 537 
Subtour, 455 

Successive approximation, 379,406 
finite horizon in the limit. 379,407,418 
function space (value iteration), 380, 397, 
407. 748 

policy space (policy iteration), 383, 390, 401. 
409,412.750 

suboptimal policy, 442, 771 
Sufficient condition for maximum, 539 
SUMT algorithm, 610 
Supremum, 523 
Surplus variable, 79 
Surrogate constraint, 483 
Switching network, 74 


Systems: 
design, 929,934 
synthesis, 842 


•T* 

Tabular method in simulation, 903 
Tandem queue, 867, 881,884 
Tangent, 531 

Tangent hyperplane, 535,593 
Taxicab example, 847 

Technological coefficient, sensitivity analysis 
of, 142 

Telephone line leasing. 882 

Telephone switching networks, 74,837 

Telephone system simulation. 22 

Television industry, 68,727 

Terminal node, 178 

Theil certainty equivalence, 685 

Timber harvesting model, 372 

Time compression, 890 

Time homogeneous Markov process, 869 

Time-integrated average, 911 

Time in line. 857, 861,867. A65, A70. A72 

Time sharing, 926,937 

Time-staged problem, 46 

Time-step incrementation, 905 

Tour, 454 

Traffic intensity, 855,861 
Transform, A60 

Transient correction, in Markov chains, 754 
Transient solution, in queuing, 854,856. 860 
Transient state, 755 

Transportation equipment routing, 202, 205, 
776,889. 929 

Transportation problem, 166 
degeneracy in, 226 
generalized model, 193,642, A17 
initial basis, 228 
nonlinear costs, 357 
sensitivity analysis, 231 
simplex method, 215 
solution by maximum flow, A17 
stochastic version, 517,671,698 
triangularity property, 220 
Transshipment model, 171,224 
Transshipment point, 171 
Traveling salesman problem, 184,454, 471,505 
Tree: 

decision, 704 
in network, 191, 230 
Triangle inequality, 178 
Triangularity property (Theorem), in net¬ 
works, 225 

Trim loss problem, 68-69,357,504,573,696 
Truck delivery problem, 503 
Truck dispatching problem, 354,776 
Truncated Poisson, 851 
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(t, T) inventory policy, A54 
Turnover ratio, 827 
Turnpike problem, 401 
Two-bin policy, 811 
Two-person constant-sum games, 161 
Two-stage linear model with uncertainty, 658, 
691 

nonlinear case, 699 


•U* 

Unbounded horizon, 359,745 
Unbounded optimal solution, 44, 83, 87, 111, 
126,523 

Uncertain actions, 649 
Uncertain horizon, 721,724, A53 
Uncertainty, cost of, 658,666,677 
Unconstrained maximization (minimization), 
529,610 

Unconstrained in sign, 39, 80 
Uniform distribution, 898 
Uniform random numbers, 898 
Unimodal function, 432,525, A27, A38 
Unrestricted in sign, 39,80 
Upper-bounded variable, 148, 170, 210, 228, 
A17 

Upper-bound substitution, 149 
Utility of money, 363 
Utilization factor, 856, A63 


•V 

Validity and reliability, 897 
Value determination equations (routine), 409, 
413,757,759 
Value of a game, 162 

Value iteration, 380, 388,397,407,748,779 
suboptimal policy, 442 
Value of perfect information, 658,666,667 
Variable-time increments, 905 
Variance, in simulation, 912 
Variance-reduction, 914 


Veinott Optimal Inventory Policy Theorem, 
A27 
Vertex: 

in network, 190 
in solution space, 83,85,108 
Virtual delay, 866 

Von Neumann’s Fundamental Theorem of 
Games, 162 


•W 

Wagner-Whitin lot-size model, 303, 335, 561 
Wait-and-see problem, 665 
Waiting line length, 856, 861, 866, A63, A69, 
A71 

Waiting line models, 837, A59 (see also 
Queues) 

Waiting time, 857, 861, 867, A65, A70, A72 
Warehouse location model, 451,736 
Warehouse problem. 162, 289,698 
Weighted average, 110,580 
Weighted distribution problem, 193 
Weighting factor, 110 
Weiner-Hopf integral equation, A73 
Where-or-when production problem, 20, 354, 
633 

Wilson lot size. 21,30,352,378,807 
Wolfe dual, 604 

Wolfe-Frank nonlinear programming algo¬ 
rithm, 580 

Wolfe quadratic programming algorithm, 606 
Wolfe reduced gradient method, 584 
Wood products industry, 34, 65, 68-69, 372, 
660,667,774 


•Z* 

Zangwill concave simplex method, 584 
Zero-one programming, 480 
Zero-one variable, 450 
Zero-sum game, 162 
Zigzagging, 537 

Zoutendijk method of feasible directions, 598 
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